Commit graph

540 commits

Author SHA1 Message Date
Ning Xu
88e554b513 Move label check failure to mergebot (#94707)
Fixes https://github.com/pytorch/pytorch/issues/88098

This is a mirror of the same PR (https://github.com/Goldspear/pytorch/pull/3) that has been reviewed in my fork (due to it's a stacked PR).

==============
## Context
This the 3rd of the 3 PRs to address the issue 88098.

## What Changed
1. check_labels.py no longer fails, but only leaving a comment
2. trymerge.py now would fail if no required labels provided

## Tests
* dummy-repo trymerge run [fails without required label](https://github.com/Goldspear/pytorch-dummy/actions/runs/4162819216) and resulted in [a label error comment](https://github.com/Goldspear/pytorch-dummy/pull/3#issuecomment-1427756769)
* the above pr was [correctly merged](https://github.com/Goldspear/pytorch-dummy/pull/3) after label is added.

## Note to Reviewers
1st PR: https://github.com/pytorch/pytorch/pull/94179
2nd PR: https://github.com/pytorch/pytorch/pull/94899
3rd PR: this one
Pull Request resolved: https://github.com/pytorch/pytorch/pull/94707
Approved by: https://github.com/ZainRizvi
2023-03-03 15:09:14 +00:00
Nikita Shulga
61fa43a1f2 [GHF] Add submodule updates check (#95885)
Originally planned to integrate it somehow into the `lintrunner`, but this poses too many challenges, one of them is that it deliberately ignores submodule updates.

On the other hand, almost all the information, other than list of the submodules is already present in the GitHubPR info.

Incorporate small BE change into `test_trymerge.py`, that moves `@mock.patch` from individual test to the class definition.

Fixes https://github.com/pytorch/pytorch/issues/74326 and https://github.com/pytorch/test-infra/issues/1521
Pull Request resolved: https://github.com/pytorch/pytorch/pull/95885
Approved by: https://github.com/ZainRizvi, https://github.com/huydhn
2023-03-02 18:05:26 +00:00
Ning Xu
5ba4dafccd Retry Merge: extract utils from check labels ptr (#94899)
Fixes #88098

This is the rebased and retry merging branch of the reverted PR: https://github.com/pytorch/pytorch/pull/94597

Pull Request resolved: https://github.com/pytorch/pytorch/pull/94899
Approved by: https://github.com/kit1980
2023-03-01 20:40:30 +00:00
Huy Do
40d54cf8bf Apply filter logic to disabled jobs dynamically (#95442)
Apply filter logic to disabled jobs dynamically.  The list of disabled jobs is published at https://ossci-metrics.s3.amazonaws.com/disabled-jobs.json.  When the workflow (i.e. `pull`) and the platform (i.e. `linux-bionic-py3.8-clang9`) names match, job will be disabled (skipped) if they are in the list.

Note that getting the current job name within the GitHub action is fairly hacky.  This is a TODO item.

### Testing

* Unit testing
* This PR. https://github.com/pytorch/pytorch/issues/94861 disables `pull / linux-bionic-py3.8-clang9 / test (dynamo)` in the CI.  We have:
   * No dynamo tests running in `pull / linux-bionic-py3.8-clang9` https://github.com/pytorch/pytorch/actions/runs/4272505289/jobs/7437706181
   * Other dynamo tests, i.e. `pull / linux-bionic-py3.11-clang9`, are run normally https://github.com/pytorch/pytorch/actions/runs/4272505289/jobs/7437706054
 * This PR. https://github.com/pytorch/pytorch/issues/95642 disables `pull / linux-bionic-cuda11.7-py3.10-gcc7-sm86 / test`.  All test jobs for `pull / linux-bionic-cuda11.7-py3.10-gcc7-sm86` are skipped https://github.com/pytorch/pytorch/actions/runs/4287330986/jobs/7468179694
 * This PR. https://github.com/pytorch/pytorch/issues/95656 disables `pull / linux-bionic-py3_8-clang8-xla / build`.  All build and test jobs for `pull / linux-bionic-py3_8-clang8-xla` are skipped https://github.com/pytorch/pytorch/actions/runs/4287330986/jobs/7470478905
Pull Request resolved: https://github.com/pytorch/pytorch/pull/95442
Approved by: https://github.com/clee2000
2023-03-01 00:10:35 +00:00
Huy Do
b55d0d2aef Fix trymerge changed files count (#95720)
The value from the PR info includes only unique files != The number of files changed (both are technically correct, depending on how you view it)

I'm trying to merge this PR https://github.com/pytorch/pytorch/pull/95233 which makes `.github/ci_commit_pins/triton.txt` a softlink.  So the PR includes 2 changes to that file 1) to delete the file and 2) to add it as a symlink.

```
[
  ".ci/docker/build.sh",
  ".ci/docker/ci_commit_pins/triton.txt",
  ".ci/docker/common/common_utils.sh",
  ".ci/docker/common/install_triton.sh",
  ".ci/docker/requirements-ci.txt",
  ".ci/docker/ubuntu-cuda/Dockerfile",
  ".ci/docker/ubuntu/Dockerfile",
  ".github/ci_commit_pins/triton.txt", <--
  ".github/ci_commit_pins/triton.txt", <--
  ".github/workflows/build-triton-wheel.yml"
]
```

Trymerge doesn't like that and rejects the merge due to `Changed file count mismatch` https://github.com/pytorch/pytorch/actions/runs/4295438799/jobs/7485853815 . This is because the PRInfo GraphQL result from GitHub only counts 9 of them https://paste.sh/zVsOnWoT#p_3RKX_VMjj-e71vwsTeA01W (search for `changedFiles`).  It means that the name are dedup, so that only unique file names are counted.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/95720
Approved by: https://github.com/kit1980, https://github.com/malfet, https://github.com/ZainRizvi
2023-02-28 21:55:21 +00:00
Sergii Dymchenko
11f293a74e Comment about Meta-internal usage of trymerge.py (#95536)
Pull Request resolved: https://github.com/pytorch/pytorch/pull/95536
Approved by: https://github.com/malfet
2023-02-27 14:16:04 +00:00
Ivan Zaitsev
ca59b2d375 Fix co-dev regresssion in github-exports-check job (#95345)
Summary:
Regression introduced in #91134 (github-exports-check calls git, which is not available internally at Meta).

Meta employees, see T145865943 for the context.

Test Plan: Unit tests, `github-export-checks` job.

Differential Revision: D43521051

Pull Request resolved: https://github.com/pytorch/pytorch/pull/95345
Approved by: https://github.com/kit1980
2023-02-24 22:40:28 +00:00
Catherine Lee
fb3ff77438 [mergebot] Fix for pagination error (#95333)
Fix for weird bug that happens very rarely.  My solution is to retrieve all checksuites before going to retrieve their checkruns.

Sometimes `cs_cursor=edges[edge_idx - 1]["cursor"] if edge_idx > 0 else None,` is None when it shouldn't be because of how we reset `checksuites = get_next_checksuites(checksuites)` on every loop.

Ex
page 1 of checksuites contains some stuff
page 2 of checksuites: pull {a bunch of checkruns}
cs_cursor gets set to none for the pull checksuite on page 2 because `checksuites = get_next_checksuites(checksuites)` resets the edges on every loop.  Then the checkruns can't be retrieved.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/95333
Approved by: https://github.com/huydhn
2023-02-23 17:48:56 +00:00
Zain Rizvi
d6a8d397da Fix formatting for merge failed message (#95234)
Fixes formatting so that the merge rule shows up on a different line than the "Raised by" text

Follow up to https://github.com/pytorch/pytorch/pull/94932

New version
<img width="433" alt="image" src="https://user-images.githubusercontent.com/4468967/220441349-ac99096d-590a-42c1-b995-4a23b2d9b810.png">
Pull Request resolved: https://github.com/pytorch/pytorch/pull/95234
Approved by: https://github.com/huydhn
2023-02-22 18:11:22 +00:00
Zain Rizvi
da98053c6d Fix bug where a github api failure would prevent the label check from failing (#95098)
Fix bug where a github api failure would prevent the check from failing even if we already saw that labels were needed.

Also adds more debugging info to the rate limit exceeded error since it's weird to see an error claiming the rate limit has exceeded when the "Used" amount is way below the limit.  I suspect these happen when the request arrived just before the rate reset time, but the response was generated right after the reset time, hence the apparently tiny "used" amounts

Example run where the check should have failed, but passed instead:
https://github.com/pytorch/pytorch/actions/runs/4200205209/jobs/7285979824
Pull Request resolved: https://github.com/pytorch/pytorch/pull/95098
Approved by: https://github.com/huydhn
2023-02-21 18:42:12 +00:00
Zain Rizvi
976d289e86 Fix update_pytorch_labels workflow (#95227)
Pass in repo args now that they're required (after a recent refactor). Also changes the script to pass in the repo name instead of being hardcoded to pytorch/pytorch.

I'm guessing this wasn't noticed earlier since the workflow is only triggered when a label is created/edited/deleted
Pull Request resolved: https://github.com/pytorch/pytorch/pull/95227
Approved by: https://github.com/huydhn
2023-02-21 18:26:21 +00:00
Zain Rizvi
a863d5e37c Hide failing merge rule's name in the internal debugging section (#94932)
Fixes https://github.com/pytorch/test-infra/issues/1081

The merge rule name is not helpful to most readers, and most of the time it's just "superuser."  Move this to a less prominent place in the "Details for Dev Infra team" section
Pull Request resolved: https://github.com/pytorch/pytorch/pull/94932
Approved by: https://github.com/huydhn
2023-02-16 04:20:10 +00:00
zhuhong61
73ee4964d3 Add new checks in CI system to verify the built linux pip wheel with cpu-cxx11-abi (#79409)
We added the linux pip wheel with cpu-cxx11-abi in pytorch/builder, see: https://github.com/pytorch/builder/pull/990 and https://github.com/pytorch/builder/pull/1023

The purpose of this PR is to add new checks in pytorch CI system to verify the linux pip wheel with cpu-cxx11-abi.

Co-authored-by: Zhu Hong <hong.zhu@intel.com>
Co-authored-by: Guo Yejun <yejun.guo@intel.com>

Pull Request resolved: https://github.com/pytorch/pytorch/pull/79409
Approved by: https://github.com/malfet
2023-02-14 12:59:03 +00:00
PyTorch MergeBot
7c3fc2c7f0 Revert "Issue-88098: extract utils from check labels (#94597)"
This reverts commit 2c76838d7f.

Reverted https://github.com/pytorch/pytorch/pull/94597 on behalf of https://github.com/jeanschmidt due to reverting due internal breakages https://fburl.com/sandcastle/3ukij9xp
2023-02-13 20:19:50 +00:00
Ning Xu
2c76838d7f Issue-88098: extract utils from check labels (#94597)
Fixes #88098

This is a mirror of the same PR (https://github.com/Goldspear/pytorch/pull/2) that has been reviewed in my fork (due to it's a stacked PR).

======================
## Context

This is the 2nd of the 3 PRs to address issue-88098.

## What Changed
1. Extract comment related utils from trymerge.py to github_utils.py
2. Extract label related utils from trymerge.py and check_labels.py to label_utils.py

## Tests
* pytorch-dummy repo [trymerge run ](https://github.com/Goldspear/pytorch-dummy/actions/runs/4118944174)merged the test PR [OK](https://github.com/Goldspear/pytorch-dummy/pull/2).

## Note to Reviewers
Due to higher degree of complexity involved to extract GitHubPR class, it's worth having a separate issue to handle that part of refactoring. This issue only focusing on refactoring where necessary to ship the functional diff.

* 1st PR: https://github.com/pytorch/pytorch/pull/94179
* 2nd PR: this one
* 3rd PR: https://github.com/Goldspear/pytorch/pull/3

Pull Request resolved: https://github.com/pytorch/pytorch/pull/94597
Approved by: https://github.com/ZainRizvi
2023-02-12 12:18:53 +00:00
Nikita Shulga
89df0e4253 Enable Python-3.11 binary builds across the board (#94430)
Most of the work is outside of repositories and consists of cloning projects https://github.com/AnacondaRecipes/ and building:
- [typing_extensions](https://github.com/AnacondaRecipes/typing_extensions-feedstock)
- [pyyaml](https://github.com/AnacondaRecipes/pyyaml-feedstock)
- [setuptools](https://github.com/AnacondaRecipes/setuptools-feedstock) v 59.8.0, needed to build `numpy`. Trick here is to add `add_pip_as_python_dependency: off` to ones `.condarc`
- [cython](https://github.com/AnacondaRecipes/cython-feedstock)
- [mkl-service](https://github.com/AnacondaRecipes/mkl-service-feedstock)
- [numpy-base](https://github.com/AnacondaRecipes/numpy-feedstock) (against mkl-2021.4), i.e. add `blas_impl: "mkl"` and `mkl: ">=2021.4.0,<2022.0a0"` to ones `conda_build_config.yaml`
- [mkl_random](https://github.com/AnacondaRecipes/mkl_random-feedstock)
- [mkl_fft](https://github.com/AnacondaRecipes/mkl_fft-feedstock)
- [numpy](https://github.com/AnacondaRecipes/numpy-feedstock)
- [mpmath](https://github.com/AnacondaRecipes/mpmath-feedstock)
- [sympy](https://github.com/AnacondaRecipes/sympy-feedstock)

Anaconda build system is really modern, so in order to be able to build:
- x86 MacOS packages, one need to install Macos 10.10 SDK from 2014, still available at https://github.com/phracker/MacOSX-SDKs/releases and reference it as conda build sysroot, as follows: `CONDA_BUILD_SYSROOT: /Library/Developer/CommandLineTools/SDKs/MacOSX10.10.sdk`
- Windows packages "MSVC v141 - VS 2017 C++ x64/86 build tools (v14.16)" is needed, which likely is still available as Visual Studio component

As well as make a pretty trivial tweak to build rules in cf4fa8900b
Pull Request resolved: https://github.com/pytorch/pytorch/pull/94430
Approved by: https://github.com/seemethere, https://github.com/weiwangmeta, https://github.com/albanD, https://github.com/atalman
2023-02-10 06:10:27 +00:00
Xuehai Pan
69e0bda999 [BE] Import Literal, Protocol, and Final from standard library typing as of Python 3.8+ (#94490)
Changes:

1. `typing_extensions -> typing-extentions` in dependency. Use dash rather than underline to fit the [PEP 503: Normalized Names](https://peps.python.org/pep-0503/#normalized-names) convention.

```python
import re

def normalize(name):
    return re.sub(r"[-_.]+", "-", name).lower()
```

2. Import `Literal`, `Protocal`, and `Final` from standard library as of Python 3.8+
3. Replace `Union[Literal[XXX], Literal[YYY]]` to `Literal[XXX, YYY]`.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/94490
Approved by: https://github.com/ezyang, https://github.com/albanD
2023-02-09 19:17:49 +00:00
Ning Xu
527b646f4b Refactor to extract label_utils from export_pytorch_labels (#94179)
Part of fixing #88098

## Context

This is 1/3 PRs to address issue 88098 (move label check failure logic from `check_labels.py` workflow to `trymerge.py` mergebot. Due to the messy cross-script imports and potential circular dependencies, it requires some refactoring to the scripts before, the functional PR can be cleanly implemented.

## What Changed
1. Extract extracts label utils fcns to a `label_utils.py` module from the `export_pytorch_labels.py` script.
2. Small improvements to naming, interface and test coverage

## Note to Reviewers
This series of PRs is to replace the original PR https://github.com/pytorch/pytorch/pull/92682 to make the changes more modular and easier to review.

* 1st PR: this one
* 2nd PR: https://github.com/Goldspear/pytorch/pull/2
* 3rd PR: https://github.com/Goldspear/pytorch/pull/3

Pull Request resolved: https://github.com/pytorch/pytorch/pull/94179
Approved by: https://github.com/ZainRizvi
2023-02-09 19:17:05 +00:00
blorange-amd
11f51e798f Upgrade nightly wheels to ROCm5.4.2 (#93090)
Test PR1225: https://github.com/pytorch/builder/pull/1225

Pull Request resolved: https://github.com/pytorch/pytorch/pull/93090
Approved by: https://github.com/atalman
2023-02-09 04:53:11 +00:00
Nikita Shulga
6ac0198c02 [CI] Add known ciflow labels to probot (#94368)
Add `collect_ciflow_labels.py` that automatically extracts all labels from workflow files and adds the to pytorch-probot.yml
Same script can also be used to validate that all tags are referenced in the config

Add this validation to quickchecks
Pull Request resolved: https://github.com/pytorch/pytorch/pull/94368
Approved by: https://github.com/jeanschmidt
2023-02-08 17:37:27 +00:00
Nikita Shulga
230c4fe93d [GHF] Fix pushDate handling (#94364)
Merge commits does not have a merge date, which is also clear from [GraphQL schema](https://docs.github.com/en/graphql/reference/objects#commit).
Modify return signature of `GitHubPR.last_pushed_at`, print warning when one can not be queried and add regression test.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/94364
Approved by: https://github.com/huydhn
2023-02-08 05:52:03 +00:00
Catherine Lee
e9533767af trymerge to ignore certain failures (#91134)
For any failure in dr ci listed as "flaky" or "broken trunk" (aka anything not "new failures"), these get marked as "ok to fail".

If there are a small number (currently set to 3) ok to fail jobs, merge can still continue.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/91134
Approved by: https://github.com/huydhn, https://github.com/malfet, https://github.com/seemethere
2023-02-07 17:19:57 +00:00
Nikita Shulga
43f6ed4abd Extend torch-trition conda to 3.11 (#93117)
Also drop 3.7 from both builds and add proper names to the steps
Add `pytorch-nightly` for `conda` builds to test the installation against `pytorch` from the nightly channel as well as get [`filelock`](https://anaconda.org/pytorch-nightly/filelock) dependency for 3.11)
Pull Request resolved: https://github.com/pytorch/pytorch/pull/93117
Approved by: https://github.com/atalman
2023-02-06 22:14:57 +00:00
PyTorch MergeBot
7fb2ac2bd5 Revert "trymerge to ignore certain failures (#91134)"
This reverts commit 8b7bd5dffc.

Reverted https://github.com/pytorch/pytorch/pull/91134 on behalf of https://github.com/seemethere due to Breaks internal `github-export-checks` see failure: https://fburl.com/sandcastle/ggqj29pz
2023-02-04 08:08:32 +00:00
Catherine Lee
1c30268ff1 Update rockset version (#94005)
upgrading rockset to 1.0.3

the diff looks like it gets rid of dependency on six but i think python-dateutils still uses it but is better about downloading it
Pull Request resolved: https://github.com/pytorch/pytorch/pull/94005
Approved by: https://github.com/huydhn
2023-02-03 21:38:35 +00:00
Catherine Lee
8b7bd5dffc trymerge to ignore certain failures (#91134)
For any failure in dr ci listed as "flaky" or "broken trunk" (aka anything not "new failures"), these get marked as "ok to fail".

If there are a small number (currently set to 3) ok to fail jobs, merge can still continue.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/91134
Approved by: https://github.com/huydhn, https://github.com/malfet
2023-02-03 20:56:39 +00:00
atalman
2910695942 Remove cuda 11.6 from nightly (#93979)
Remove cuda 11.6 from CI replace with 11.7
Following the Release readme here: https://github.com/pytorch/pytorch/blob/master/RELEASE.md#release-compatibility-matrix
Pull Request resolved: https://github.com/pytorch/pytorch/pull/93979
Approved by: https://github.com/Skylion007, https://github.com/clee2000, https://github.com/malfet
2023-02-02 20:27:19 +00:00
PyTorch MergeBot
f5e9c8ce54 Revert "Remove CUDA 11.6 from nightly builds (#93404)"
This reverts commit c76ac8eef2.

Reverted https://github.com/pytorch/pytorch/pull/93404 on behalf of https://github.com/clee2000 due to breaking lint
2023-02-02 17:10:01 +00:00
atalman
c76ac8eef2 Remove CUDA 11.6 from nightly builds (#93404)
Remove CUDA 11.6 from nightly builds.
Following the Release readme here: https://github.com/pytorch/pytorch/blob/master/RELEASE.md#release-compatibility-matrix
Pull Request resolved: https://github.com/pytorch/pytorch/pull/93404
Approved by: https://github.com/malfet
2023-02-02 14:26:52 +00:00
Nikita Shulga
31d466f925 [BE][ez] Move hardcoded constants to function args (#93874)
Also use tail-recursion instead of for loop to dismantle pyramid of doom

Pull Request resolved: https://github.com/pytorch/pytorch/pull/93874
Approved by: https://github.com/clee2000
2023-02-02 00:47:18 +00:00
Nikita Shulga
87b9ab4870 [CI] Add Py-3.11 wheels for all platforms (#93400)
As python-3.11 is now available on Conda for both MacOS and Windows

Disable dimtorch for Python-3.11 on Windows as its current implementation relies on internal symbols which are not exposed on Windows runtime (and to be frank, not sure why they are exposed on Linux/Mac), see https://github.com/pytorch/pytorch/issues/93854

As with the previous PR, most of the changes are not in PyTorch repo, but in builder, namely:
b71049dcbc
ece340ef7e
b0071ac366

Pull Request resolved: https://github.com/pytorch/pytorch/pull/93400
Approved by: https://github.com/weiwangmeta, https://github.com/atalman
2023-02-01 19:51:19 +00:00
Catherine Lee
1dcd2609b5 Add retries for get_workflow_job_id and try catch in upload_test_stats (#93401)
upload_test_stats keeps failing b/c it can't handle when the id is workflow-<workflow_id> so add a try catch for this.

Add retries to get_workflow_job_id to try and reduce the number of times the id can't be found

Failure to upload test stats and inability to get the job id cause our sharding infra and slow test infra (probably also flaky test detection) to be less effective.  This does not completely resolve the issue since we do rely on the job id

Failure to get the workflow job id happens tragically often, hopefully retries will help
Pull Request resolved: https://github.com/pytorch/pytorch/pull/93401
Approved by: https://github.com/huydhn
2023-02-01 18:33:32 +00:00
🌌
b179a097ea Add platform markers for linux x86_64 only extra_install_requires (#93066)
Like #89924 #91083

#85097 added new extra dependencies on nvidia-*. They are linux x86_64 (GPU) only packages, but were not marked as such, causing issues installing pytorch 1.13 via Poetry (and possibly other tools that follow PyPI's metadata API) on Linux aarch64 systems. This "fixes" the issue by adding the `and platform_machine == 'x86_64'` marker on these dependencies.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/93066
Approved by: https://github.com/malfet
2023-01-31 22:23:51 +00:00
Nikita Shulga
295fd20eb5 [CI] Add Python-3.11 Linux conda builds (#93186)
This PR almost a no-op, as most of the logic resides in the builder repo, namely:
6342242c50
8f361d91e1

Remove `conda-forge` channel dependency for test job, but add `malfet` channel for 3.11 testing (as numpy is not in default channel yet)
Build and upload following dependencies to `pytorch-nightly` channel:
```
anaconda copy --to-owner pytorch-nightly malfet/numpy/1.23.5
anaconda copy --to-owner pytorch-nightly malfet/numpy-base/1.23.5
anaconda copy --to-owner pytorch-nightly malfet/mkl-service/2.4.0
anaconda copy --to-owner pytorch-nightly malfet/mkl_random/1.2.2
anaconda copy --to-owner pytorch-nightly malfet/mkl_fft/1.3.1

anaconda copy --to-owner pytorch-nightly malfet/sympy/1.11.1
anaconda copy --to-owner pytorch-nightly malfet/mpmath/1.2.1
```
Pull Request resolved: https://github.com/pytorch/pytorch/pull/93186
Approved by: https://github.com/atalman, https://github.com/ZainRizvi
2023-01-31 20:24:03 +00:00
Nikita Shulga
7a621c443b [GHF] Fix ghstack branches in sync logic (#93298)
Test plan:
```python
from git_utils import are_ghstack_branches_in_sync,GitRepo
repo=GitRepo("/Users/nshulga/git/pytorch/pytorch")
are_ghstack_branches_in_sync(repo, "gh/SS-JIA/206/head")
```

Pull Request resolved: https://github.com/pytorch/pytorch/pull/93298
Approved by: https://github.com/clee2000, https://github.com/ZainRizvi
2023-01-30 21:00:51 +00:00
Nikita Shulga
5976f0bdfe Set min supported Python version to 3.8 (#93155)
Also, grep for `if sys.version_info .cond. (3, 8)` and replaces them with appropriate action.

This is a last in a series of PRs that moved CI/CD away from testing PyTorch behavior against Python-3.7.

Fixes https://github.com/pytorch/pytorch/issues/80513

Pull Request resolved: https://github.com/pytorch/pytorch/pull/93155
Approved by: https://github.com/huydhn
2023-01-29 18:28:46 +00:00
Wei Wang
345695e8f7 Remove PY37 from binary build matrix (#92919)
Similar to https://github.com/pytorch/test-infra/pull/1416 but for binary build
Pull Request resolved: https://github.com/pytorch/pytorch/pull/92919
Approved by: https://github.com/atalman
2023-01-26 01:25:47 +00:00
pbialecki
9bfd1357d5 Add CUDA 11.8 CI workflows (#92137)
Fixes #92090
CC @atalman
Pull Request resolved: https://github.com/pytorch/pytorch/pull/92137
Approved by: https://github.com/atalman
2023-01-23 21:03:53 +00:00
Nikita Shulga
550983e39d Revert "Move check_label ci to mergebot (#92309)"
This reverts commit 190f7803f5.

As it broke revert workflow, see https://github.com/pytorch/pytorch/actions/runs/3963235531/jobs/6790838677
2023-01-19 15:33:10 -08:00
Ning Xu
190f7803f5 Move check_label ci to mergebot (#92309)
Fixes #88098

### What Changed
* Moved `check_label.py` logic into `trymerge.py`
* Refactored relevant unittests
* ~~Dropped~~ Refactored `check_label.py` ci job

### Tests
`python .github/scripts/test_trymerge.py`
`python .github/scripts/test_check_labels.py`
`make lint & lintrunner -a`

### Notes to reviewers
This PR replaces the [original PR](https://github.com/pytorch/pytorch/pull/92225) to workaround the sticky EasyCLA failure mark on its first commit.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/92309
Approved by: https://github.com/ZainRizvi
2023-01-19 22:31:32 +00:00
Catherine Lee
34353a402e [mergebot] Flatten workflows into jobs, fix bugs (#92097)
* flatten the workflows into just jobs in order to give more specific links (link to the specific job that failed instead of just pull), this should make it easier to implement bypass certain failures in the future
* try catch of MandatoryChecksMissingError from find_matching_merge_rule should fix error where merge loops instead of raising runtime error when trunk job fails
* remove usage of on_green and mandatory_only flags just in case.  on_green and force are the only two behaviors we currently use
* fail if ghstack pr has non ghstack change, tested locally with #92177 but unsure how to write tests b/c requires use of repo._run_git
Pull Request resolved: https://github.com/pytorch/pytorch/pull/92097
Approved by: https://github.com/huydhn, https://github.com/ZainRizvi
2023-01-18 23:38:16 +00:00
Nikita Shulga
32937f39f4 Don't raise error if job_id can't be fetched (#92192)
But always return `workflowi_d`, which is not unique across reruns but it's better than failing the entire run just because API call failed. Test it locally by feeding the program an incorrect input and observe the failure.
Fixes https://github.com/pytorch/pytorch/issues/91332

Pull Request resolved: https://github.com/pytorch/pytorch/pull/92192
Approved by: https://github.com/kit1980
2023-01-17 00:09:05 +00:00
Nikita Shulga
2293a6b95e [BE] Refactor get_workflow_job_id (#92191)
A noop change that refactors existing codebase and prints a bit more
verbose error message when request fails.

Get rid of `requests` as it inevitable results in flakiness

TODO: Remove in a few days after PR is landed
4af5939d7a/.github/actions/get-workflow-job-id/action.yml (L29)
Pull Request resolved: https://github.com/pytorch/pytorch/pull/92191
Approved by: https://github.com/kit1980
2023-01-15 23:02:29 +00:00
Nikita Shulga
3794b4643f [GHF] Record how many times PR is revered (#92180)
Or merged, by adding "revertedX2","revertedX3",... labels

Tested in https://github.com/malfet/deleteme/pull/36

Pull Request resolved: https://github.com/pytorch/pytorch/pull/92180
Approved by: https://github.com/ZainRizvi, https://github.com/kit1980
2023-01-13 23:18:38 +00:00
Dmytro Dzhulgakov
812d774cc9 Easy: add instructions for testing pytorch/builder (#91923)
Also makes the repo name configurable for branches in forks

Pull Request resolved: https://github.com/pytorch/pytorch/pull/91923
Approved by: https://github.com/malfet, https://github.com/seemethere
2023-01-11 07:26:46 +00:00
Catherine Lee
de4e4c785a [mergebot] Fix mergebot allow revert of codev diff (#91975)
mergebot was allowing non facebook-github-bot users to revert codev diffs when it shouldnt be allowed

Fixes https://github.com/pytorch/test-infra/issues/1381
Pull Request resolved: https://github.com/pytorch/pytorch/pull/91975
Approved by: https://github.com/ZainRizvi, https://github.com/kit1980, https://github.com/malfet
2023-01-11 01:59:07 +00:00
Catherine Lee
1e768c63c1 Add merged label to ghstack prs (#90238)
not very elegant

other option might be adding something to pytorchbot to listen to push events for master?
Pull Request resolved: https://github.com/pytorch/pytorch/pull/90238
Approved by: https://github.com/malfet, https://github.com/kit1980
2023-01-09 22:49:20 +00:00
Syed Tousif Ahmed
66b3325304 Adds more nvidia pypi dependencies (#89944)
This PR adds more nvidia pypi dependencies for cuda 11.7 wheel. Additionally, it pins cufft version to 10.9.0.58 to resolve https://github.com/pytorch/pytorch/issues/88038

Depends on: https://github.com/pytorch/builder/pull/1196

Pull Request resolved: https://github.com/pytorch/pytorch/pull/89944
Approved by: https://github.com/atalman
2023-01-04 15:08:08 +00:00
Nikita Shulga
bc92444b34 Rename torchtriton (#91539)
to `pytorch-triton`
Pull Request resolved: https://github.com/pytorch/pytorch/pull/91539
Approved by: https://github.com/seemethere, https://github.com/soumith
2022-12-30 22:49:17 +00:00
Catherine Lee
d7674e70f4 Fix for tryrebase after PR was merged (#91337)
rebasing certain merged prs results in the rebased branch pointing at the target branch b/c git believes the pr has already been included in the branch.  Git does not replay the changes onto the target branch because the change is already in the target branch

This usually affects PRs with only 1 commit (more commits -> trymerge squashes them when merged -> git believes that the change is not in the target branch b/c the squashed commit is different from the individual changes).

It might also affect ghstack changes b/c behind the scenes the ghstack PRs are all contained within one commit on the orig branch, but I'm not sure about this.

helps w/ https://github.com/pytorch/test-infra/issues/836
looks like https://github.com/clee2000/random-testing/pull/44#issuecomment-1363439534
Pull Request resolved: https://github.com/pytorch/pytorch/pull/91337
Approved by: https://github.com/ZainRizvi
2022-12-28 18:44:08 +00:00