pytorch

mirror of https://github.com/saymrwulf/pytorch.git synced 2026-05-15 21:00:47 +00:00

Author	SHA1	Message	Date
Song Zhou	dabeff33b9	[pytorch] Fix fblearner flow compiling errors (#35902 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35902 Move operator registration to anonymous namespace to avoid collision. Reviewed By: soumith Differential Revision: D20822382 fbshipit-source-id: 1ab00871491668b8b85e803ac877d96477f1688b	2020-04-02 14:52:48 -07:00
Mikhail Zolotukhin	3ef5ff6012	[TensorExpr] Make Load and Store multi-dimensional. (#35800 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35800 This PR includes the following changes: * Introduce a new `Expr` type `Buf`: it plays a similar to `Var` role, but also has dimensions. * Use the new `Buf` class in `Store` and `Load` instead of `Var` for specifying where to store to or load from. `Buf` contains the dimensions info of the buffer we're loading/storing to and hence we are able to keep N-d indexes without flattening them into a 1-d index ([x,y] vs [x+yW]). Flattening of the indexes is now a separate pass that is executed in `LoopNest::prepareForCodegen` - backends still expect indexes to be flattened, and this PR preserves that. * `Tensor` now contains a `Buf` instead of `Var`, and thus Tensor now has the dimensions info (previously it was a property of a `Function`, not a `Tensor`). This brings us closer to Tensor being a combination of Buffer + Function, where Buffer specifies iteration domain and the Function defines a computation. TODOs: * Consider merging `Buffer` with `Buf` or `BufHandle`. It seems that we don't need all of them. * Harden the logic of how we create buffers in fuser pass. Currently it seems that sometimes we don't set dimensions. * Use `Buf` in `Allocate` and `Free`. * Make it clearer that `Function` doesn't "own" dimensions info and that dimensions are a property of a Tensor, not a Function. Differential Revision: D20789005 Test Plan: Imported from OSS Reviewed By: zheng-xq Pulled By: ZolotukhinM fbshipit-source-id: e04188d1d297f195f1c46669c614557d6bb6cde4	2020-04-02 11:18:28 -07:00
Christian Sarofeen	6d24f8fe21	Infrastructure for a new CUDA Fuser (#34785 ) Summary: Summary: This PR contains the infrastructure of a new CUDA fuser. This CUDA fuser is based on many of the same principles of TensorExpressions and Halide, however the implementation is ground up. The fusion pass itself is similar to the default CUDA fuser, however, it has undergone some refactoring and is using the new code generation infrastructure. For those who are interested in how the code generation in this PR works, I would recommend reviewing _test/cpp/jit/test_gpu_fusion.cpp_ as well as the long comment section at the beginning of _torch/csrc/jit/codegen/cuda/transform_replay.h_ One of the largest differences between our approach and that of TVM/Halide, is the concept of "TensorView". TensorView from a high level should be thought of similarly to how we think of working with Tensors in PyTorch. It's an N-D object which can undergo transformations that change its dimensionality. Dimensionality changes are done through the operations split/merge/reorder/computeAt. These transformations are similar to split/fuse/reorder/compute_at of TVM, they modify how a tensor is iterated over to generate GPU code. Interestingly, in our scheme these transformations are applied to tensors and only impact how that tensor is generated. Warning: This PR is purposefully not feature complete with the current fuser. We wanted to separate out the infrastructure from the fusion capabilities. Once in, smaller incremental PRs will be submitted to expand capabilities of the fuser. Short term goals: Parity with current CUDA fuser (including performance): - Dynamic shapes (no recompilation) - Implicit handling of braodcast (broadcasted tensors are treated as tensors of the braodcasted size in the generated code) - Dropout Mid-term goals: - Transposes fused with pointwise operations where transpose involves only 2 axes (across the fused operation). - 1-D reductions fused with pointwise operations Pull Request resolved: https://github.com/pytorch/pytorch/pull/34785 Reviewed By: ZolotukhinM Differential Revision: D20650977 Pulled By: soumith fbshipit-source-id: ee39c95a880e1b9822e874ed4cc180971572bf63	2020-04-02 09:22:42 -07:00
Nick Gibson	051132f119	[TensorExpr] simplification of round + mod pattern. (#35683 ) Summary: Adds capabilities to the TensorExpr IR Simplifier to simplify down Round + Mod patterns (e.g. `(x/y)y + x%y => x`) via means of lifting integer rounding into a temporary `RoundOff` node. This integrates with existing simplification mechanisms (folding, factorization, reordering, etc) to allow simplification of compound expressions: e.g. `20 (x / (16 / 2)) * 2 + (11 % 6) * (x % (7+1)) => 5 * x.`. Tests: ran tensorexpr cpp and python tests, ran a hpc benchmark and verified results and time didn't regress. Pull Request resolved: https://github.com/pytorch/pytorch/pull/35683 Differential Revision: D20811316 Pulled By: nickgg fbshipit-source-id: 0cd6a517fb9548b3bc689768304b97375df5ac58	2020-04-02 00:11:00 -07:00
Ilia Cherniavskii	bc6bd0bb1a	Debug Information Guard Summary: This diff fixes the issues with current handling of debug information passed along the execution of the model. (For example, it is possible that multiple calls to the debug guard may override each other) Test Plan: CI test/cpp/jit Reviewed By: dzhulgakov Differential Revision: D20602775 fbshipit-source-id: 4683957954028af81a1a0f1f12b243650230c9bb	2020-04-01 01:55:29 -07:00
Wojciech Baranowski	2f84a07b58	indexing: throw exception for masks with dtype=uint8 (#34418 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/33751 Pull Request resolved: https://github.com/pytorch/pytorch/pull/34418 Differential Revision: D20776164 Pulled By: ngimel fbshipit-source-id: f4ebaabf427d7967f2f317235562f91c8f9216f0	2020-03-31 20:51:56 -07:00
Ilia Cherniavskii	800d5617c0	Recording of TorchScript functions (#34710 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34710 Extending RecordFunction API to support new recording scopes (such as TorchScript functions), as well as giving more flexibility to set sampling rate. Test Plan: unit test (test_misc.cpp/testRecordFunction) Reviewed By: gdankel, dzhulgakov Differential Revision: D20158523 fbshipit-source-id: a9e0819d21cc06f4952d92d43246587c36137582	2020-03-31 00:33:23 -07:00
Nick Gibson	5b3492df18	[TensorExpr] Extend arithmetic simplifier to work with multi variable expressions (Attempt 2) (#35415 ) Summary: https://github.com/pytorch/pytorch/pull/35127 was landed and reverted because I missed a test fail (oops). I have found and fixed the issue, which was due to zero terms being introduced after the point that filtered them out (usually required NAN/INF, e.g. x / INF => 0). See https://github.com/pytorch/pytorch/pull/35127 for more info. Pull Request resolved: https://github.com/pytorch/pytorch/pull/35415 Reviewed By: ZolotukhinM Differential Revision: D20702957 Pulled By: nickgg fbshipit-source-id: 119eb41e9fa676bd78e3d1df99297a47ae312185	2020-03-28 00:19:55 -07:00
Nikita Shulga	b9adbb5002	Fix/relax CMake linter rules (#35574 ) Summary: Ignore mixed upper-case/lower-case style for now Fix space between function and its arguments violation Pull Request resolved: https://github.com/pytorch/pytorch/pull/35574 Test Plan: CI Differential Revision: D20712969 Pulled By: malfet fbshipit-source-id: 0012d430aed916b4518599a0b535e82d15721f78	2020-03-27 16:52:33 -07:00
Nikolay Korovaiko	9e22d15f14	Enable tensorexpr cpp tests in CI. try #2 (#35454 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35454 Differential Revision: D20665160 Pulled By: Krovatkin fbshipit-source-id: e04cbe92b2ee5a3288f3c4e5c83533bfea85bf85	2020-03-27 12:09:55 -07:00
anjali411	5371fdb1a0	[C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer (#34957 ) Summary: 1. Removed LossClosureOptimizer, and merged Optimizer into OptimizerBase (and renamed the merged class to Optimizer) 2. Merged the LBFGS-specific serialize test function and the generic test_serialize_optimizer function. 3. BC-compatibility serialization test for LBFGS 4. Removed mentions of parameters_ in optimizer.cpp, de-virtualize all functions 5. Made defaults_ optional argument in all optimizers except SGD TODO: add BC-breaking notes for this PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/34957 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20678162 Pulled By: yf225 fbshipit-source-id: 74e062e42d86dc118f0fbaddd794e438b2eaf35a	2020-03-26 19:53:02 -07:00
Meghan Lele	6384c2d81b	[JIT] clang-format JIT code (#35115 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35115 This commit runs the newly added tools/clang_format.py on the JIT codebase and includes all of the formatting changes thus produced. Testing: Ran the script, CI. Test Plan: Imported from OSS Reviewed By: eellison Differential Revision: D20568523 Pulled By: SplitInfinity fbshipit-source-id: e09bdb982ccf090eecfb7c7b461b8d0681eef82b	2020-03-26 11:24:51 -07:00
Edward Yang	843fd740fb	Revert D20645945: [pytorch][PR] [C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer Test Plan: revert-hammer Differential Revision: D20645945 Original commit changeset: 383588065bf1 fbshipit-source-id: 6d7bc5676de64e329d9862889f32033c76b4009c	2020-03-26 06:40:34 -07:00
Suraj Menon	aa01a95c6d	Revert D20630760: [pytorch][PR] Enable NNC tests vol. i. add test_tensorexpr.py tests [WIP] Test Plan: revert-hammer Differential Revision: D20630760 Original commit changeset: 7d2f27aca6b1 fbshipit-source-id: 28ac92b3390651a4a67061d6ebf208515b9b9463	2020-03-25 20:34:46 -07:00
anjali411	efbd6b8533	[C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer (#34957 ) Summary: 1. Removed LossClosureOptimizer, and merged Optimizer into OptimizerBase (and renamed the merged class to Optimizer) 2. Merged the LBFGS-specific serialize test function and the generic test_serialize_optimizer function. 3. BC-compatibility serialization test for LBFGS 4. Removed mentions of parameters_ in optimizer.cpp, de-virtualize all functions 5. Made defaults_ optional argument in all optimizers except SGD TODO: add BC-breaking notes for this PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/34957 Differential Revision: D20645945 Pulled By: yf225 fbshipit-source-id: 383588065bf1859b38f0ad0a25d93d41e153c96e	2020-03-25 18:26:02 -07:00
Nikolay Korovaiko	f3a5081bd4	Enable NNC tests vol. i. add test_tensorexpr.py tests [WIP] (#34897 ) Summary: This PR add tensorexpr cpp tests to test_jit.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/34897 Differential Revision: D20630760 Pulled By: Krovatkin fbshipit-source-id: 7d2f27aca6b1e23e3ffed1c765d8f590688118e3	2020-03-25 17:23:48 -07:00
Nikita Shulga	512bcf68be	[Formatting] `if (` -> `if(` in CMakeLists.txt (#35343 ) Summary: Same to `else`, `endif` and `elseif`. Also prefer lowercase over uppercase ones Pull Request resolved: https://github.com/pytorch/pytorch/pull/35343 Test Plan: None at all Differential Revision: D20638789 Pulled By: malfet fbshipit-source-id: 8058075693185e66f5dda7b825b725e139d0d000	2020-03-25 13:48:42 -07:00
Mikhail Zolotukhin	ceb4ed3733	[TensorExpr] Methods name cleanup in LoopNest class. (#35174 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35174 Differential Revision: D20585575 Test Plan: Imported from OSS Pulled By: ZolotukhinM fbshipit-source-id: 0fa8e1e85e1502b9a86cf34608cb791ffb23d395	2020-03-25 11:51:11 -07:00
Mikhail Zolotukhin	450738662b	[TensorExpr] Replace `ExprHandle` with `const Expr*` in `Substitute`. (#35173 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35173 Differential Revision: D20585577 Test Plan: Imported from OSS Pulled By: ZolotukhinM fbshipit-source-id: 902f9740a0b97c3d2a0eef2c274d8227b975b3cb	2020-03-25 11:48:14 -07:00
Alban Desmaison	a7f8655314	Revert D20624571: [pytorch][PR] [TensorExpr] Extend arithmetic simplifier to work with multi variable expressions Test Plan: revert-hammer Differential Revision: D20624571 Original commit changeset: e49049377bee fbshipit-source-id: 7d8dda0c3b44be1c3236a0313bbfa128b7015de7	2020-03-24 16:59:51 -07:00
Nick Gibson	fce67800f4	[TensorExpr] Extend arithmetic simplifier to work with multi variable expressions (#35127 ) Summary: A new version of the IR simplifier used by the jit/tensorexpr fuser. This is capable of simplifying expressions containing (shock) multiple variables, eg: ```(m * (1 * n_1) + (n + 1)) - (m * (1 * n_1) + n) => 1``` Similar to the previous IR Simplifier it uses a two stage approach: 1. Traverse the tree combining subtree's of commutable operations in to a flat structure. In this implementation we have two intermediate Exprs: Term (expressing products of sub expressions) and Polynomial (expressing sums of sub expressions). 2. Traverse the tree expanding Term's and Polynomials into their component operators. Using the example above we execute with a process like this to simplify: ``` (m * (1 * n_1) + (n + 1)) - (m * (1 * n_1) + n) # Using PolynomialTransformer: => Sub(Add(Mul(m, Mul(1, n_1)), Add(n, 1)), Add(Mul(m, Mul(1, n_1)), n)) => Sub(Polynomial(Term(m, n_1), n, 1), Polynomial(Term(m, n_1), n)) => Polynomial(Term(m, n_1), Term(-1, m, n_1), n, -n, 1) => Polynomial(1) # Using TermExpander => 1 ``` The IRSimplifier supports arithmetic simplifications of operators Add, Sub and Mul and constant folding of all binary Exprs and Intrinsics, but does not attempt expansion of multiplication of Polynomials to the canonical form since that generally leads to less efficient representations. It will do scalar factorization if it results in removal of operators, and will merge chains of multilane primitives (such as Broadcast and Ramp) down into a single operator. The ir_simplifier unit tests are a short tour of its capabilities. The existing simplifier has a bug where it will sometimes reorder operations on floating point types which are not associative. This causes (at least) the pyhpc equation_of_state benchmark to produce incorrect results. I have fixed that issue in this version and verified that that benchmark produces the same results with and without the simplifier. Tests: all cpp & py tensorexpr tests, and pyphc benchmark: ``` benchmarks.equation_of_state ============================ Running on CPU size backend calls mean stdev min 25% median 75% max Δ ------------------------------------------------------------------------------------------------------------------ 4,194,304 pytorch 10 0.246 0.002 0.243 0.245 0.246 0.248 0.250 1.000 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/35127 Differential Revision: D20624571 Pulled By: nickgg fbshipit-source-id: e49049377beee69e02dcf26eb922bef1447ae776	2020-03-24 14:16:07 -07:00
James Reed	618c6214aa	[reapply][JIT] Namespaces for TorchBind (#35254 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35254 Reapply D20541090 with some BC fixes ghstack-source-id: 100733987 Test Plan: buck test mode/dev-nosan //caffe2/torch/fb/predictor/model_repo/tests:ai_infra_representative_model_shard_6_test -- 'RepresentativeModelTest\/ShardedRepresentativeModelTest\.RunModel\/0' Reviewed By: zdevito Differential Revision: D20607111 fbshipit-source-id: 80f148d860571208c93e9308128cd480ff089f74	2020-03-24 00:39:48 -07:00
Nikita Shulga	c46c28a7cb	Fix `JitTest.ADFormulas` intermittent failures (#35196 ) Summary: Clamp input tensor values to [3, 3] to limit how small `tanh` gradint can get Pull Request resolved: https://github.com/pytorch/pytorch/pull/35196 Test Plan: CI + `bin/test_jit --gtest_filter=JitTest.ADFormulas --gtest_repeat=60000 --gtest_break_on_failure` Differential Revision: D20611256 Pulled By: malfet fbshipit-source-id: 8640faa5d8567d6c6df8cc5df80c2e65407116eb	2020-03-23 22:21:30 -07:00
Will Feng	cfc0ff1691	Renaming: MultiLabelMarginLossFuncOptions -> MultilabelMarginLossFuncOptions, MultiLabelSoftMarginLossFuncOptions -> MultilabelSoftMarginLossFuncOptions (#35163 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35163 This PR is BC-breaking in the following way: Renaming: - `torch::nn::functional::MultiLabelMarginLossFuncOptions` -> `torch::nn::functional::MultilabelMarginLossFuncOptions` - `torch::nn::functional::MultiLabelSoftMarginLossFuncOptions` -> `torch::nn::functional::MultilabelSoftMarginLossFuncOptions` Reason for renaming: to be consistent with the corresponding functional name after camel case to snake case conversion (e.g. the `multilabel_margin_loss` functional should use `MultilabelMarginLossFuncOptions` as options) Test Plan: Imported from OSS Differential Revision: D20582598 Pulled By: yf225 fbshipit-source-id: 0f5bdb8249d901b310875a14320449a2fdfa8ecd	2020-03-21 18:34:46 -07:00
Lu Fang	a100cf5146	Revert D20541090: [JIT][torchbind] Namespaces for torchbind classes Test Plan: revert-hammer Differential Revision: D20541090 Original commit changeset: ce3d9391dd3c fbshipit-source-id: acc1d660fbda611941381315507dfe594c385db1	2020-03-21 12:20:44 -07:00
Will Feng	bbec4520c6	Add inplace tests for several torch::nn modules / functionals (#35147 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35147 Test Plan: Imported from OSS Differential Revision: D20578217 Pulled By: yf225 fbshipit-source-id: b8bafa49ee94c7dfbbca6e100ee3d9df5b2b621c	2020-03-21 10:02:56 -07:00
Mikhail Zolotukhin	95ad94c75b	[TensorExpr] Nuke tensorexpr::schedule namespace. (#35126 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35126 Test Plan: Imported from OSS Differential Revision: D20569364 Pulled By: ZolotukhinM fbshipit-source-id: c0d51ecadf411918641cdbdc6d8cb06e207d2c9b	2020-03-20 23:39:14 -07:00
Mikhail Zolotukhin	65cea95777	[TensorExpr] Rename schedule.{cpp,h} to loopnest.{cpp,h}. (#35119 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35119 Differential Revision: D20567927 Test Plan: Imported from OSS Pulled By: ZolotukhinM fbshipit-source-id: 1fb6d03bd4c6e66aca62140d2b537692577f261d	2020-03-20 23:37:51 -07:00
Will Feng	a2557970f3	Fix F::interpolate and torch::nn::Upsample implementation (#35025 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35025 This PR fixes `F::interpolate` and `torch::nn::Upsample` implementation to match the Python API implementation. This PR is BC-breaking in the following way: There are changes to `UpsampleOptions` and `InterpolateFuncOptions`: - `size` is changed from `std::vector<int64_t>` to `c10::optional<std::vector<int64_t>>`. If you want to pass a list of `int64_t` to this argument, you must pass it as `std::vector<int64_t>`. - `scale_factor` is changed from `std::vector<double>` to `c10::optional<std::vector<double>>`. If you want to pass a list of `double` to this argument, you must pass it as `std::vector<double>`. TODO: cherry-pick this PR into v1.5 release branch. Test Plan: Imported from OSS Differential Revision: D20559892 Pulled By: yf225 fbshipit-source-id: ac18609e351a9f2931eaeced8966b9491b2995f7	2020-03-20 22:37:13 -07:00
Will Feng	d7462dcea6	Fix AdaptiveAvgPool{2,3}d and AdaptiveMaxPool{2,3}d implementation (#35022 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35022 This PR fixes `AdaptiveAvgPool{2,3}d` and `AdaptiveMaxPool{2,3}d` implementation to match the Python API implementation. Particularly, `output_size` is changed to accept `c10::nullopt` in its elements, matching the Python API behavior. TODO: cherry-pick this PR into v1.5 release branch. Test Plan: Imported from OSS Differential Revision: D20559890 Pulled By: yf225 fbshipit-source-id: ccddbd278dd39165cf1dda11fc0e49387c76dbef	2020-03-20 22:36:57 -07:00
James Reed	e0496a70fc	[JIT][torchbind] Namespaces for torchbind classes (#35054 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35054 Test Plan: Imported from OSS Differential Revision: D20541090 Pulled By: jamesr66a fbshipit-source-id: ce3d9391dd3cdf619042b8f6ba2645f4c1fc875c	2020-03-20 20:07:02 -07:00
anjali411	781f590f33	[C++ API Parity] Add xor_convergence test for lbfgs (#35001 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35001 Differential Revision: D20548983 Pulled By: anjali411 fbshipit-source-id: 1f858635d0680c0109d1ef348b7df4d3844fe0a6	2020-03-20 06:57:24 -07:00
Michael Suo	8210b2054e	Move ivalue tests to aten (#34985 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34985 IValue is part of the overall runtime system, not just the JIT. So it should be tested in the ATen tests. The real motivation though is so that I can use gtest directly, not the hacked-up version the JIT uses. Test Plan: Imported from OSS Differential Revision: D20537902 Pulled By: suo fbshipit-source-id: 09897e015ecde24aa8996babeaa08d98db90ef0d	2020-03-19 17:56:37 -07:00
Edward Yang	7c06b86e42	Revert D20518647: [pytorch][PR] [C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer Test Plan: revert-hammer Differential Revision: D20518647 Original commit changeset: 4760d1d29df1 fbshipit-source-id: b84f1a06c2de27e147716279223a6844ef89f760	2020-03-19 07:53:43 -07:00
Natalia Gimelshein	be82e554fe	Revert D20524479: [pytorch][PR] [C++ API Parity] Add xor_convergence test for lbfgs Test Plan: revert-hammer Differential Revision: D20524479 Original commit changeset: 3413779676ab fbshipit-source-id: ef8007ed6c184bc8b8751eb713aac2a891260048	2020-03-18 21:56:17 -07:00
anjali411	b8e043abca	[C++ API Parity] [Optimizers] Merged Optimizer and LossClosureOptimizer (#34957 ) Summary: 1. Removed LossClosureOptimizer, and merged Optimizer into OptimizerBase (and renamed the merged class to Optimizer) 2. Merged the LBFGS-specific serialize test function and the generic test_serialize_optimizer function. 3. BC-compatibility serialization test for LBFGS 4. Removed mentions of parameters_ in optimizer.cpp, de-virtualize all functions 5. Made defaults_ optional argument in all optimizers except SGD Pull Request resolved: https://github.com/pytorch/pytorch/pull/34957 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20518647 Pulled By: anjali411 fbshipit-source-id: 4760d1d29df1784e2d01e2a476d2a08e9df4ea1c	2020-03-18 17:28:57 -07:00
anjali411	4521477f83	[C++ API Parity] Add xor_convergence test for lbfgs (#35001 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35001 Differential Revision: D20524479 Pulled By: anjali411 fbshipit-source-id: 3413779676ab95c1ee82298f95d3441a89873107	2020-03-18 17:06:53 -07:00
anjali411	d7e4a379a0	[C++ API Parity] LBFGS optimizer step() update and added closure to the Optimizer step() function (#34564 ) Summary: Follow-ups after this PR: * Remove `LossClosureOptimizer`, and merge `Optimizer` into `OptimizerBase` (and rename the merged class to Optimizer) * Merge the LBFGS-specific serialize test function and the generic `test_serialize_optimizer` function, possibly by passing a bool `has_only_global_state` flag into the `test_serialize_optimizer` function to denote whether `size()` should be equal to 1 or 2? * https://github.com/pytorch/pytorch/pull/34564#discussion_r393780303 * It seems that we don't have the equivalent `XORConvergence_LBFGS` test like the other optimizers, and it would be good to add one * Remove mentions of `parameters_` in optimizer.cpp, de-virtualize all functions, and remove the `OptimizerBase(std::vector<Tensor> parameters)` constructor from `OptimizerBase` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34564 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20495701 Pulled By: anjali411 fbshipit-source-id: 6d35286d2decb6f7dff93d9d3e57515770666622	2020-03-17 22:27:24 -07:00
James Reed	09a7788a2f	[torchbind] Improve IValue custom class API and remove most Capsule stuff (#34848 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34848 Test Plan: Imported from OSS Differential Revision: D20480514 Pulled By: jamesr66a fbshipit-source-id: 1c595faf34e00aab0a6202a8902426bd310551c3	2020-03-17 20:39:34 -07:00
Mikhail Zolotukhin	95833a49e6	[TensorExpr] Pull changes from bertmaher/pytorch_fusion. (#34842 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34842 This PR (hopefully the last one of such kind) is merging changes from a side branch where tensor expessions based fuser work has been done so far. This PR is is a squashed version of changes in the side branch, which is available here: https://github.com/bertmaher/pytorch Differential Revision: D20478208 Test Plan: Imported from OSS Pulled By: ZolotukhinM fbshipit-source-id: 21556e009f1fd88099944732edba72ac40e9b9c0	2020-03-17 11:02:48 -07:00
James Reed	089a0a2117	[torchbind] Test moving custom classes to/from IValue (#34847 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34847 Test Plan: Imported from OSS Differential Revision: D20480512 Pulled By: jamesr66a fbshipit-source-id: 87f5f8ea8764e26d383b17e4f72538166ddd0655	2020-03-16 23:57:42 -07:00
Mikhail Zolotukhin	ea5c86c276	[TensorExpr] Add LLVM codegen. (#34228 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34228 This PR adds LLVM codegen to tensor expressions. LLVM is added as an optional build dependency specified with `USE_LLVM=<path_to_llvm>` variable. If this variable is not set or LLVM is not found in the specified path, the LLVM codegen is completely disabled. Differential Revision: D20251832 Test Plan: Imported from OSS Pulled By: ZolotukhinM fbshipit-source-id: 77e203ab4421eb03afc64f8da17e0daab277ecc2	2020-03-16 11:49:34 -07:00
Mikhail Zolotukhin	35e7efeb9a	[TensorExpr] Add CUDA codegen. (#34227 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34227 This PR adds a CUDA support to tensor expressions. Differential Revision: D20251836 Test Plan: Imported from OSS Pulled By: ZolotukhinM fbshipit-source-id: ab36a55834cceff30c8371fef6cca1054a32f017	2020-03-16 11:49:29 -07:00
Mikhail Zolotukhin	e31d462e92	[TensorExpr] Pull changes to core classes for representing expressions and statements from the side branch. (#34224 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34224 Our development has been happening on a side branch `pytorch_fusion` in `bertmaher/pytorch` fork. This PR moves changes to the core classes representing expressions and transformations on them. At this moment, the tensor expressions are only used in tests. Subsequent PRs add LLVM and CUDA codegen for tensor expressions and implement fuser on top of these. This PR is huge as it is a squashed version of changes in the side branch. It is not practical to pull changes one by one from the branch, so here is the squashed version. If you're interested in seeing the history of changes, please refer to https://github.com/bertmaher/pytorch Differential Revision: D20251835 Test Plan: Imported from OSS Pulled By: ZolotukhinM fbshipit-source-id: 1a871acc09cf3c6f7fb4af40d408cdbb82dc7dab	2020-03-16 11:47:47 -07:00
peter	24c9e61e79	Enable JIT tests on Windows (#27029 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27029 Reviewed By: eellison Differential Revision: D20458664 Pulled By: jamesr66a fbshipit-source-id: 22be918543703869f471e89b3478423198351bf3	2020-03-16 11:26:21 -07:00
anjali411	762be86e63	[C++ API Parity] [Optimizers] added closure to optimizers (#34790 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34790 Differential Revision: D20468361 Pulled By: anjali411 fbshipit-source-id: 1c6115d735b211dc2bedf002d58931cb32cf657a	2020-03-16 07:51:44 -07:00
Will Feng	bdd7dbfd4b	[C++ API] RNN / GRU / LSTM layer refactoring (#34322 ) Summary: This PR refactors RNN / GRU / LSTM layers in C++ API to exactly match the implementation in Python API. BC-breaking changes: - Instead of returning `RNNOutput`, RNN / GRU forward method now returns `std::tuple<Tensor, Tensor>`, and LSTM forward method now returns `std::tuple<Tensor, std::tuple<Tensor, Tensor>>`, matching Python API. - RNN / LSTM / GRU forward method now accepts the same inputs (input tensor and optionally hidden state), matching Python API. - RNN / LSTM / GRU layers now have `forward_with_packed_input` method which accepts `PackedSequence` as input and optionally hidden state, matching the `forward(PackedSequence, ...)` variant in Python API. - RNN / LSTM / GRU layers no longer have these fields: `w_ih` / `w_hh` / `b_ih` / `b_hh`. Instead, to access the weights and biases of the gates, users should do e.g. `rnn->named_parameters()["weight_ih_l0"]`, which mirrors the Python API `rnn.weight_ih_l0`. - In `RNNOptions` - `tanh()` / `relu()` / `activation` are removed. Instead, `nonlinearity` is added which takes either `torch::kTanh` or `torch::kReLU` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `LSTMOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `GRUOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` The majority of the changes in this PR focused on refactoring the implementations in `torch/csrc/api/src/nn/modules/rnn.cpp` to match the Python API. RNN tests are then changed to reflected the revised API design. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34322 Differential Revision: D20458302 Pulled By: yf225 fbshipit-source-id: ffff2ae1ddb1c742c966956f6ad4d7fba03dc54d	2020-03-15 17:48:29 -07:00
Martin Yuan	d4f182d06b	Add overloaded name to prim operators (#34280 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34280 To have prim ops searchable for lite interpreter, overloaded names need to be added for the operators with the same name but different schema. For example, aten::add in register_prim_ops.cpp. The difference is a combination of args and output type. `"aten::add(str a, str b) ->str"` `"aten::add(int a, int b) ->int"` `"aten::add(float a, float b) ->float"` `"aten::add(int a, float b) ->float"` `"aten::add(float a, int b) ->float"` `"aten::add(Scalar a, Scalar b) ->Scalar"` Solution: Use the argument type and/or output type (the same to the existing overloaded names). The overloaded name should be minimum as long as the operators can be differentiated. For other operators please look into the source code change for details. `"aten::add.str(str a, str b) ->str"` `"aten::add.int(int a, int b) ->int"` `"aten::add.float(float a, float b) ->float"` `"aten::add.int_float(int a, float b) ->float"` `"aten::add.float_int(float a, int b) ->float"` `"aten::add.Scalar_Scalar(Scalar a, Scalar b) ->Scalar"` Test Plan: Imported from OSS Differential Revision: D20456997 Pulled By: iseeyuan fbshipit-source-id: 2c3dc324b4a4e045559f62c6cc2a10fbb9a72dcf	2020-03-15 17:05:54 -07:00
Will Feng	6c555e1508	Revert D20311699: [pytorch][PR] [C++ API] RNN / GRU / LSTM layer refactoring Test Plan: revert-hammer Differential Revision: D20311699 Original commit changeset: e2b60fc7bac6 fbshipit-source-id: 72f4a762189490998d6b716857eeac053a11742d	2020-03-14 16:18:48 -07:00
Will Feng	e23a9dc140	[C++ API] RNN / GRU / LSTM layer refactoring (#34322 ) Summary: This PR refactors RNN / GRU / LSTM layers in C++ API to exactly match the implementation in Python API. BC-breaking changes: - Instead of returning `RNNOutput`, RNN / GRU forward method now returns `std::tuple<Tensor, Tensor>`, and LSTM forward method now returns `std::tuple<Tensor, std::tuple<Tensor, Tensor>>`, matching Python API. - RNN / LSTM / GRU forward method now accepts the same inputs (input tensor and optionally hidden state), matching Python API. - RNN / LSTM / GRU now has `forward_with_packed_input` method which accepts `PackedSequence` as input and optionally hidden state, matching the `forward(PackedSequence, ...)` variant in Python API. - In `RNNOptions` - `tanh()` / `relu()` / `activation` are removed. Instead, `nonlinearity` is added which takes either `torch::kTanh` or `torch::kReLU` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `LSTMOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `GRUOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` The majority of the changes in this PR focused on refactoring the implementations in `torch/csrc/api/src/nn/modules/rnn.cpp` to match the Python API. RNN tests are then changed to reflected the revised API design. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34322 Differential Revision: D20311699 Pulled By: yf225 fbshipit-source-id: e2b60fc7bac64367a8434647d74c08568a7b28f7	2020-03-14 12:09:04 -07:00

1 2 3 4 5 ...

798 commits