pytorch

mirror of https://github.com/saymrwulf/pytorch.git synced 2026-05-15 21:00:47 +00:00

Author	SHA1	Message	Date
Mikhail Zolotukhin	35e7efeb9a	[TensorExpr] Add CUDA codegen. (#34227 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34227 This PR adds a CUDA support to tensor expressions. Differential Revision: D20251836 Test Plan: Imported from OSS Pulled By: ZolotukhinM fbshipit-source-id: ab36a55834cceff30c8371fef6cca1054a32f017	2020-03-16 11:49:29 -07:00
Mikhail Zolotukhin	e31d462e92	[TensorExpr] Pull changes to core classes for representing expressions and statements from the side branch. (#34224 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34224 Our development has been happening on a side branch `pytorch_fusion` in `bertmaher/pytorch` fork. This PR moves changes to the core classes representing expressions and transformations on them. At this moment, the tensor expressions are only used in tests. Subsequent PRs add LLVM and CUDA codegen for tensor expressions and implement fuser on top of these. This PR is huge as it is a squashed version of changes in the side branch. It is not practical to pull changes one by one from the branch, so here is the squashed version. If you're interested in seeing the history of changes, please refer to https://github.com/bertmaher/pytorch Differential Revision: D20251835 Test Plan: Imported from OSS Pulled By: ZolotukhinM fbshipit-source-id: 1a871acc09cf3c6f7fb4af40d408cdbb82dc7dab	2020-03-16 11:47:47 -07:00
peter	24c9e61e79	Enable JIT tests on Windows (#27029 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27029 Reviewed By: eellison Differential Revision: D20458664 Pulled By: jamesr66a fbshipit-source-id: 22be918543703869f471e89b3478423198351bf3	2020-03-16 11:26:21 -07:00
anjali411	762be86e63	[C++ API Parity] [Optimizers] added closure to optimizers (#34790 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34790 Differential Revision: D20468361 Pulled By: anjali411 fbshipit-source-id: 1c6115d735b211dc2bedf002d58931cb32cf657a	2020-03-16 07:51:44 -07:00
Will Feng	bdd7dbfd4b	[C++ API] RNN / GRU / LSTM layer refactoring (#34322 ) Summary: This PR refactors RNN / GRU / LSTM layers in C++ API to exactly match the implementation in Python API. BC-breaking changes: - Instead of returning `RNNOutput`, RNN / GRU forward method now returns `std::tuple<Tensor, Tensor>`, and LSTM forward method now returns `std::tuple<Tensor, std::tuple<Tensor, Tensor>>`, matching Python API. - RNN / LSTM / GRU forward method now accepts the same inputs (input tensor and optionally hidden state), matching Python API. - RNN / LSTM / GRU layers now have `forward_with_packed_input` method which accepts `PackedSequence` as input and optionally hidden state, matching the `forward(PackedSequence, ...)` variant in Python API. - RNN / LSTM / GRU layers no longer have these fields: `w_ih` / `w_hh` / `b_ih` / `b_hh`. Instead, to access the weights and biases of the gates, users should do e.g. `rnn->named_parameters()["weight_ih_l0"]`, which mirrors the Python API `rnn.weight_ih_l0`. - In `RNNOptions` - `tanh()` / `relu()` / `activation` are removed. Instead, `nonlinearity` is added which takes either `torch::kTanh` or `torch::kReLU` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `LSTMOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `GRUOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` The majority of the changes in this PR focused on refactoring the implementations in `torch/csrc/api/src/nn/modules/rnn.cpp` to match the Python API. RNN tests are then changed to reflected the revised API design. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34322 Differential Revision: D20458302 Pulled By: yf225 fbshipit-source-id: ffff2ae1ddb1c742c966956f6ad4d7fba03dc54d	2020-03-15 17:48:29 -07:00
Martin Yuan	d4f182d06b	Add overloaded name to prim operators (#34280 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34280 To have prim ops searchable for lite interpreter, overloaded names need to be added for the operators with the same name but different schema. For example, aten::add in register_prim_ops.cpp. The difference is a combination of args and output type. `"aten::add(str a, str b) ->str"` `"aten::add(int a, int b) ->int"` `"aten::add(float a, float b) ->float"` `"aten::add(int a, float b) ->float"` `"aten::add(float a, int b) ->float"` `"aten::add(Scalar a, Scalar b) ->Scalar"` Solution: Use the argument type and/or output type (the same to the existing overloaded names). The overloaded name should be minimum as long as the operators can be differentiated. For other operators please look into the source code change for details. `"aten::add.str(str a, str b) ->str"` `"aten::add.int(int a, int b) ->int"` `"aten::add.float(float a, float b) ->float"` `"aten::add.int_float(int a, float b) ->float"` `"aten::add.float_int(float a, int b) ->float"` `"aten::add.Scalar_Scalar(Scalar a, Scalar b) ->Scalar"` Test Plan: Imported from OSS Differential Revision: D20456997 Pulled By: iseeyuan fbshipit-source-id: 2c3dc324b4a4e045559f62c6cc2a10fbb9a72dcf	2020-03-15 17:05:54 -07:00
Will Feng	6c555e1508	Revert D20311699: [pytorch][PR] [C++ API] RNN / GRU / LSTM layer refactoring Test Plan: revert-hammer Differential Revision: D20311699 Original commit changeset: e2b60fc7bac6 fbshipit-source-id: 72f4a762189490998d6b716857eeac053a11742d	2020-03-14 16:18:48 -07:00
Will Feng	e23a9dc140	[C++ API] RNN / GRU / LSTM layer refactoring (#34322 ) Summary: This PR refactors RNN / GRU / LSTM layers in C++ API to exactly match the implementation in Python API. BC-breaking changes: - Instead of returning `RNNOutput`, RNN / GRU forward method now returns `std::tuple<Tensor, Tensor>`, and LSTM forward method now returns `std::tuple<Tensor, std::tuple<Tensor, Tensor>>`, matching Python API. - RNN / LSTM / GRU forward method now accepts the same inputs (input tensor and optionally hidden state), matching Python API. - RNN / LSTM / GRU now has `forward_with_packed_input` method which accepts `PackedSequence` as input and optionally hidden state, matching the `forward(PackedSequence, ...)` variant in Python API. - In `RNNOptions` - `tanh()` / `relu()` / `activation` are removed. Instead, `nonlinearity` is added which takes either `torch::kTanh` or `torch::kReLU` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `LSTMOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` - In `GRUOptions` - `layers` -> `num_layers` - `with_bias` -> `bias` The majority of the changes in this PR focused on refactoring the implementations in `torch/csrc/api/src/nn/modules/rnn.cpp` to match the Python API. RNN tests are then changed to reflected the revised API design. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34322 Differential Revision: D20311699 Pulled By: yf225 fbshipit-source-id: e2b60fc7bac64367a8434647d74c08568a7b28f7	2020-03-14 12:09:04 -07:00
James Reed	fb20621b3b	Move torchbind out of jit namespace (#34745 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34745 Test Plan: Imported from OSS Differential Revision: D20450239 Pulled By: jamesr66a fbshipit-source-id: 3f5597626f21d7b5e329b57da358c76b531bf806	2020-03-13 23:03:14 -07:00
Will Feng	d041d0784e	[C++ API] RNNCell / LSTMCell / GRUCell layers (#34400 ) Summary: This PR adds `RNNCell` / `LSTMCell` / `GRUCell` layers to the C++ frontend, with implementations exactly matching the Python API equivalent. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34400 Differential Revision: D20316859 Pulled By: yf225 fbshipit-source-id: bb7cee092622334043c0d0fd0fcb4e75e707699c	2020-03-13 21:52:24 -07:00
James Reed	ab76a8206f	[JIT][mobile] Support built-in Function call in lite interpreter (#34676 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34676 Test Plan: Imported from OSS Differential Revision: D20427938 Pulled By: jamesr66a fbshipit-source-id: 79eebfa858776f26da55ffd49d3f78fa7ae0df9b	2020-03-13 18:24:18 -07:00
eellison	44256199a9	[JIT] remove specialized list ops (#34520 ) Summary: Now that lists are no longer specialized, we can register only one operator for list ops that are generic to their element type. This PR reorgs lists into three sets of ops: - CREATE_GENERIC_LIST_OPS - CREATE_SPECIALIZED_LIST_OPS - CREATE_COMPARATOR_LIST_OPS_SPECIALIZED (we didn't bind certain specialized ops to Tensor) This is important to land quickly because mobile is finalizing its bytecode soon, after which we could not remove these ops. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34520 Reviewed By: iseeyuan Differential Revision: D20429775 Pulled By: eellison fbshipit-source-id: ae6519f9b0f731eaa2bf4ac20736317d0a66b8a0	2020-03-12 17:49:23 -07:00
Elias Ellison	787c307e63	Revert D20368543: [pytorch][PR] [JIT] remove specialized list ops Test Plan: revert-hammer Differential Revision: D20368543 Original commit changeset: ad0c6d70d2a6 fbshipit-source-id: b8b1a64ac830d5f544567714b940c57274194d3f	2020-03-12 12:55:49 -07:00
Jeremy Lilley	fff6fe83a7	[pytorch-rpc] WireSerializer should check has_storage() (#34626 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34626 We need to check has_storage() before looking at it in cloneSparseTensors(), to avoid gratuitously throwing. Ideally, we'd add a test for this (I wrote one up but had to disable it), but won't work until JIT Pickler supports sparse tensors. ghstack-source-id: 100018077 Test Plan: buck test mode/dev-nosan caffe2/torch/fb/distributed/thriftRpcAgent/... Differential Revision: D20399971 fbshipit-source-id: 5debfa8140eb1f949d37336330223962cc320abc	2020-03-12 11:35:21 -07:00
eellison	f9f8424386	[JIT] remove specialized list ops (#34520 ) Summary: Now that lists are no longer specialized, we can register only one operator for list ops that are generic to their element type. This PR reorgs lists into three sets of ops: - CREATE_GENERIC_LIST_OPS - CREATE_SPECIALIZED_LIST_OPS - CREATE_COMPARATOR_LIST_OPS_SPECIALIZED (we didn't bind certain specialized ops to Tensor) This is important to land quickly because mobile is finalizing its bytecode soon, after which we could not remove these ops. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34520 Differential Revision: D20368543 Pulled By: eellison fbshipit-source-id: ad0c6d70d2a6be6ff0e948d6786052167fc43e27	2020-03-12 10:48:14 -07:00
Will Feng	a54416d208	[C++ API] Remove deprecated torch::nn::BatchNorm / FeatureDropout / modules_ordered_dict and torch::nn::init::Nonlinearity / FanMode (#34508 ) Summary: This PR is BC-breaking in the following way: - The deprecated `torch::nn::BatchNorm` is removed in favor of `torch::nn::BatchNorm{1,2,3}d` - The deprecated `torch::nn::FeatureDropout` is removed in favor of `torch::nn::Dropout{2,3}d` - The deprecated `torch::nn::modules_ordered_dict` is removed. User should do `Sequential sequential({{"m1", MyModule(1)}, {"m2", MyModule(2)}})` instead. - The deprecated `torch::nn::init::Nonlinearity` is removed, in favor of the following enums: - `torch::kLinear` - `torch::kConv1D` - `torch::kConv2D` - `torch::kConv3D` - `torch::kConvTranspose1D` - `torch::kConvTranspose2D` - `torch::kConvTranspose3D` - `torch::kSigmoid` - `torch::kTanh` - `torch::kReLU` - `torch::kLeakyReLU` - The deprecated `torch::nn::init::FanMode` is removed, in favor of the following enums: - `torch::kFanIn` - `torch::kFanOut` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34508 Differential Revision: D20351601 Pulled By: yf225 fbshipit-source-id: cca0cd112f29a31bb023e348ca8f82780e42bea3	2020-03-12 10:09:58 -07:00
Mansoor	e95657b87e	[C++ API] AdaptiveLogSoftmaxWithLoss (#29076 ) Summary: Implemented AdaptiveLogSoftmaxWithLoss and some tests for modules. Reference https://github.com/pytorch/pytorch/issues/25883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/29076 Differential Revision: D20404588 Pulled By: yf225 fbshipit-source-id: edbadf432b8173cbcc6caf83c9c03dd92dc31a37	2020-03-12 09:53:58 -07:00
Michael Suo	c235be42dd	[jit] kill script namespace (#34515 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34515 Once upon a time we thought this was necessary. In reality it is not, so removing it. For backcompat, our public interface (defined in `api/`) still has typedefs to the old `script::` names. There was only one collision: `Pass` as a `Stmt` and `Pass` as a graph transform. I renamed one of them. Test Plan: Imported from OSS Differential Revision: D20353503 Pulled By: suo fbshipit-source-id: 48bb911ce75120a8c9e0c6fb65262ef775dfba93	2020-03-11 23:32:48 -07:00
Edward Yang	cf8b728255	Delete OperatorOptions, absorb AliasAnalysisKind into FunctionSchema. (#34588 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34588 I constructed the patch by deleting OperatorOptions and then rerouting all queries for AliasAnalysisKind to FunctionSchema. Some of the behavior is kind of bogus: we really shouldn't be mutating FunctionSchema after the fact, but that won't get fixed until we actually switch to true schema merging. Reland of https://github.com/pytorch/pytorch/pull/34160 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D20387079 Pulled By: ezyang fbshipit-source-id: d189f7a6ad8cd186b88b6fbfa3f189994eea14e8	2020-03-11 20:59:46 -07:00
Edward Yang	6f8a8e4e47	Revert D20282846: Delete OperatorOptions, absorb AliasAnalysisKind into FunctionSchema. Test Plan: revert-hammer Differential Revision: D20282846 Original commit changeset: ba7bca6e8adc fbshipit-source-id: b9e15d2b2c3d1dbc6e971ab3c0bdf380e769dcf1	2020-03-11 07:50:29 -07:00
Edward Yang	9d42177a31	Delete OperatorOptions, absorb AliasAnalysisKind into FunctionSchema. (#34160 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34160 I constructed the patch by deleting OperatorOptions and then rerouting all queries for AliasAnalysisKind to FunctionSchema. Some of the behavior is kind of bogus: we really shouldn't be mutating FunctionSchema after the fact, but that won't get fixed until we actually switch to true schema merging. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D20282846 Pulled By: ezyang fbshipit-source-id: ba7bca6e8adc3365789639b88e54c4e881b1692e	2020-03-11 07:15:18 -07:00
davidriazati	23b2fba79a	[jit] Add type tags to lists/dicts in pickle (#33255 ) Summary: Stacked PRs * #33474 - [jit] Remove list specializations from pickler * #33255 - [jit] Add type tags to lists/dicts in pickle This adds a global call to `torch.jit._pickle.restore_type_tags` for lists and dicts so that we can preserve their types after serialization. ](https://our.intern.facebook.com/intern/diff/20346780/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/33255 Pulled By: driazati Differential Revision: D20346780 fbshipit-source-id: c8534954ef4adb2e3c880401acbee30cd284f3db	2020-03-10 19:17:01 -07:00
Meghan Lele	903ad90325	[JIT] Introduce a fake Tensor creation node for IR unit tests (#34334 ) Summary: Summary There is often a need to create a Tensor when writing IR by hand for JIT optimisation pass unit tests. The only options for this today are real Tensor creation functions like `aten::ones`. Any test that uses these functions must also use the same default arguments as the Python/C++ API, which means that all of the tests have to be updated when the API is updated. This commit introduces a new primitive, `prim::MakeTestTensor` with schema `() -> Tensor` that should be used in unit tests instead of real Tensor creation functions. This new primitive has no public-facing API, so the maintenance burden is much lower. Testing This commit updates the alias analysis and DCE tests to use `prim::MakeTestTensor` instead of `aten::rand`, `aten::ones`, and `aten::zeros`. ``` $ ./bin/test_jit CUDA not available. Disabling CUDA and MultiCUDA tests Note: Google Test filter = -_CUDA:_MultiCUDA [==========] Running 75 tests from 1 test case. [----------] Global test environment set-up. [----------] 75 tests from JitTest [ RUN ] JitTest.ADFormulas [ OK ] JitTest.ADFormulas (82 ms) [ RUN ] JitTest.Attributes [ OK ] JitTest.Attributes (0 ms) ... ... ... [ RUN ] JitTest.LiteInterpreterPrim [ OK ] JitTest.LiteInterpreterPrim (0 ms) [ RUN ] JitTest.LiteInterpreterLoadOrigJit [ OK ] JitTest.LiteInterpreterLoadOrigJit (2 ms) [----------] 75 tests from JitTest (150 ms total) [----------] Global test environment tear-down [==========] 75 tests from 1 test case ran. (150 ms total) [ PASSED ] 75 tests. ``` Fixes* This pull request fixes https://github.com/pytorch/pytorch/issues/33500. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34334 Differential Revision: D20296437 Pulled By: SplitInfinity fbshipit-source-id: df4e7b0881ae4913424e5a409bfa171a61c3e568	2020-03-10 16:12:45 -07:00
Lingyi Liu	09296c34a4	Add the build for runtime dispatch for AVX, AVX2 instruction set (#26125 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26125 We already had some optimization implementation using AVX2 for improve the quantized kernel performance. In this diff, we want to enable the runtime dispatch. Test Plan: Sandcastle build and test Also test with a python binary calling into vectorized op. torch.__config__.show() PyTorch built with: - GCC 4.2 - clang 8.0.20181009 - Intel(R) Math Kernel Library Version 2017.0.3 Product Build 20170413 for Intel(R) 64 architecture applications - Intel(R) MKL-DNN v0.18.1 (Git Hash N/A) - OpenMP 1 - CPU capability usage: AVX2 - Build settings: Reviewed By: jamesr66a Differential Revision: D17337251 fbshipit-source-id: 8e22d10011a12a4eaf54cea3485353eb1811d828	2020-03-10 15:32:57 -07:00
anjali411	2d24005d18	[C++ API Parity] rmsprop optimizer update (#33450 ) Summary: This PR is BC-breaking in the following way: In RMSpropOptions: 1. learning_rate is renamed to lr. Test plan before 1.5 release: Test that in 1.5 we can load a C++ RMSprop optimizer that was serialized in 1.4, and their states are the same. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33450 Differential Revision: D20366623 Pulled By: anjali411 fbshipit-source-id: 83250be9b583a766927e0e22a4de8b0765379451	2020-03-10 13:30:56 -07:00
Michael Suo	965146b818	[jit] delete netdef converter (#33807 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33807 afaik this is unused, so removing it from the source tree. RIP :( Test Plan: Imported from OSS Differential Revision: D20122118 Pulled By: suo fbshipit-source-id: cb45943f5b9f969482301a2f9fe540326dbc78f2	2020-03-09 22:25:16 -07:00
Will Feng	baeb359e7a	Remove `using namespace torch::autograd` from header files (#34423 ) Summary: This PR prevents leaking symbols from `torch::autograd` namespace to the root namespace. Fixes https://github.com/pytorch/pytorch/issues/34371. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34423 Differential Revision: D20338404 Pulled By: yf225 fbshipit-source-id: e7ff3348193667a0cee5d38f9a003ae36cc704ca	2020-03-09 10:31:21 -07:00
Will Feng	739d4609c3	[C++ API] Fix ModuleList compile error: error: 'begin' was not declared in this scope (#34463 ) Summary: One example in the current docs for `torch::nn::ModuleList` doesn't compile, and this PR fixes it. Fixes https://github.com/pytorch/pytorch/issues/32414. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34463 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20331120 Pulled By: yf225 fbshipit-source-id: 50bb078fe1a900c9114d5434e92dc40ee13b52bf	2020-03-09 08:15:50 -07:00
James Reed	45a504dd2d	[JIT] Introduce BuiltinOpFunction and integrate into torchbind (#34098 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34098 * #33900 [JIT] Move stuff out of class_type.cpp Test Plan: Imported from OSS Differential Revision: D20229166 Pulled By: jamesr66a fbshipit-source-id: d658a63a5d6e372e675f35b8456adc8de82b49f3	2020-03-07 10:03:56 -08:00
Leah Dickstein	c5e822b7bb	Back out "[jit] Add type tags to lists/dicts in pickle" (#34406 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34406 Pull Request resolved: https://github.com/pytorch/pytorch/pull/34405 Original commit changeset: 2f1826e6679a Test Plan: reverting, see S197156 Reviewed By: akyrola, volkhin Differential Revision: D20317456 fbshipit-source-id: 89298a9c022edba1d54bcdc7541804cb919e33f5	2020-03-06 20:02:16 -08:00
Jeremy Lilley	5f641f93f1	[aten] Don't deadlock in IValue::Future impl, tests. (#34099 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34099 This change effectively applies into IValue's future impl a few fixes we discovered when using the torch::utils::Future<T> impl. The parallel impls should probably eventually be merged, but until then: - Don't hold the lock when invoking the callbacks. This makes it effectively impossible (deadlocks) to call value() to get the value from inside the callback. - We discovered that it was slightly cleaner in practice to notify condition variables prior to invoking callbacks (best to unblock paused threads ASAP, before spawning new work). - Fix some var naming inconsistency. - Add a some caffe2 cpp test coverage. ghstack-source-id: 99336569 Test Plan: ``` buck test mode/dev //caffe2/test/cpp/jit:jit -- 'JitTest\.IValueFuture' ``` Differential Revision: D20203278 fbshipit-source-id: 6e805ba547899dab9aab458e4b23049db31f930e	2020-03-06 12:34:50 -08:00
Will Feng	415595ace4	[C++ API] Remove init-list form of at::indexing::Slice (#34255 ) Summary: The init-list form of `at::indexing::Slice` (i.e. `tensor.index({{1, None, 2}, ...})` instead of `tensor.index({Slice(1, None, 2), ...})`) in C++ API can be easily confused with the list-form indexing in Python API (e.g. `tensor[[1, 3, 2], ...]`), which is not good from readability perspective. This PR removes the init-list form of `at::indexing::Slice` to make the API less confusing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34255 Test Plan: Imported from GitHub, without a `Test Plan:` line. Differential Revision: D20290166 Pulled By: yf225 fbshipit-source-id: abbcbeca0b179219e5e1f196a33ef8aec87ebb76	2020-03-06 05:51:53 -08:00
meganset	b8fd88319a	C++ make torch::nn::Sequential push_back(AnyModule) methods public (#34208 ) Summary: Issue https://github.com/pytorch/pytorch/issues/33192 Moves Sequential::push_back methods with AnyModule from private -> public Allows adding an existing AnyModule via something like: ``` torch::nn::Sequential q; auto a=torch::nn::AnyModule(torch::nn::Linear(1,2)); q->push_back(a); q->push_back("fc",a); ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34208 Differential Revision: D20300278 Pulled By: yf225 fbshipit-source-id: 4525319bb7fb6667e43a006c9f446a2193781005	2020-03-06 05:47:14 -08:00
Ilia Cherniavskii	b50825e011	Make RecordFunction more robust for async use cases (#34122 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34122 Earlier work added support for async rpc cases when RecordFunction's end callbacks might be called in a different thread; in addition some extra care was needed to handle pointer to parent function; This PR makes RecordFunction aware of potentially multiple threads in use, as well as removes unused parent() call and restricts current() RecordFunction to scope-based record functions (RECORD_FUNCTION macro) Test Plan: unit tests Differential Revision: D20297709 Pulled By: ilia-cher fbshipit-source-id: 46a59e1b2eea0bbd8a59630385e193b38d30f9d1	2020-03-05 22:28:53 -08:00
anjali411	76035f050b	[C++ API Parity] Adam: updated step and class design (#33730 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33730 Differential Revision: D20292073 Pulled By: anjali411 fbshipit-source-id: a7b4a70f29027ab355aebb91873ea55d5cb51783	2020-03-05 19:15:24 -08:00
Meghan Lele	5500c3de0a	Revert D20150304: [pytorch][PR] [JIT] Introduce a fake Tensor creation node for IR unit tests Test Plan: revert-hammer Differential Revision: D20150304 Original commit changeset: c88f5289055a fbshipit-source-id: 14ac0e46145e9fb4f200c6318b63edd541380aeb	2020-03-05 16:25:08 -08:00
Martin Yuan	ccf4d69b75	[Lite Interpreter] Enable __setstate__ (#33294 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33294 1. Serialize bytecode of __setstate__ and run it when loading the model. 2. One use case is quantization. To test this use case a few operators are registered temporarily for lite interpreter. The "_" prefix registration will be removed when the operators are all migrated to mobile. Test Plan: Imported from OSS Differential Revision: D20162898 Pulled By: iseeyuan fbshipit-source-id: 7a3180807bf38fbce594d86993896861f12bb58c	2020-03-05 15:24:21 -08:00
Meghan Lele	9ce833879f	[JIT] Introduce a fake Tensor creation node for IR unit tests (#33914 ) Summary: Summary There is often a need to create a Tensor when writing IR by hand for JIT optimisation pass unit tests. The only options for this today are real Tensor creation functions like `aten::ones`. Any test that uses these functions must also use the same default arguments as the Python/C++ API, which means that all of the tests have to be updated when the API is updated. This commit introduces a new primitive, `prim::MakeTestTensor` with schema `() -> Tensor` that should be used in unit tests instead of real Tensor creation functions. This new primitive has no public-facing API, so the maintenance burden is much lower. Testing This commit updates the alias analysis and DCE tests to use `prim::MakeTestTensor` instead of `aten::rand`, `aten::ones`, and `aten::zeros`. ``` $ ./bin/test_jit CUDA not available. Disabling CUDA and MultiCUDA tests Note: Google Test filter = -_CUDA:_MultiCUDA [==========] Running 75 tests from 1 test case. [----------] Global test environment set-up. [----------] 75 tests from JitTest [ RUN ] JitTest.ADFormulas [ OK ] JitTest.ADFormulas (82 ms) [ RUN ] JitTest.Attributes [ OK ] JitTest.Attributes (0 ms) ... ... ... [ RUN ] JitTest.LiteInterpreterPrim [ OK ] JitTest.LiteInterpreterPrim (0 ms) [ RUN ] JitTest.LiteInterpreterLoadOrigJit [ OK ] JitTest.LiteInterpreterLoadOrigJit (2 ms) [----------] 75 tests from JitTest (150 ms total) [----------] Global test environment tear-down [==========] 75 tests from 1 test case ran. (150 ms total) [ PASSED ] 75 tests. ``` Fixes* This pull request fixes https://github.com/pytorch/pytorch/issues/33500. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33914 Differential Revision: D20150304 Pulled By: SplitInfinity fbshipit-source-id: c88f5289055a02dc20b7a5dcdf87469f9816d020	2020-03-05 12:42:42 -08:00
Martin Yuan	f097ca503d	Add and test training in lite interpreter. (#32359 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32359 Test Plan: Imported from OSS Differential Revision: D19450614 Pulled By: iseeyuan fbshipit-source-id: 6bafff39d7880a5b7fb9cd70c33a4e584812be12	2020-03-03 23:33:43 -08:00
davidriazati	99e211e661	[jit] Add type tags to lists/dicts in pickle (#33255 ) Summary: Stacked PRs * #33474 - [jit] Remove list specializations from pickler * #33255 - [jit] Add type tags to lists/dicts in pickle This adds a global call to `torch.jit._pickle.restore_type_tags` for lists and dicts so that we can preserve their types after serialization. ](https://our.intern.facebook.com/intern/diff/19868637/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/33255 Pulled By: driazati Reviewed By: xman1979, Tianshu-Bao Differential Revision: D19868637 fbshipit-source-id: 2f1826e6679a786ca209198690269f399a542c04	2020-03-03 16:48:21 -08:00
Zachary DeVito	358450e02b	improved TorchScript traceback (#33834 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33834 This changes how we report Tracebacks to make them more clear when there are both serialized and non-serialized ranges. It now looks like: ``` Traceback (most recent call last): File "foo.py", line 25, in <module> s2(a, b) File "/scratch/zdevito/pytorch/torch/nn/modules/module.py", line 550, in __call__ result = self.forward(input, kwargs) RuntimeError: The following operation failed in the TorchScript interpreter. Traceback of TorchScript, serialized code (most recent call last): File "code/__torch__.py", line 7, in forward x: Tensor, y: Tensor) -> Tensor: return (self).bar(x, y, ) ~~~~~~~~~ <--- HERE def bar(self: __torch__.Moo, x: Tensor, File "code/__torch__.py", line 11, in bar x: Tensor, y: Tensor) -> Tensor: _0 = (self).baz(x, y, ) ~~~~~~~~~ <--- HERE _1 = torch.ones([3], dtype=None, layout=None, device=None, pin_memory=None) return torch.add(_0, _1, alpha=1) File "code/__torch__.py", line 17, in baz x: Tensor, y: Tensor) -> Tensor: return torch.add(x, y, alpha=1) ~~~~~~~~~ <--- HERE Traceback of TorchScript, original code (most recent call last): File "foo.py", line 11, in forward def forward(self, x, y): return self.bar(x, y) ~~~~~~~~ <--- HERE File "foo.py", line 9, in bar def bar(self, x, y): return self.baz(x, y) + torch.ones(3) ~~~~~~~~ <--- HERE File "foo.py", line 7, in baz def baz(self, x, y): return x + y ~~~~~ <--- HERE RuntimeError: The size of tensor a (4) must match the size of tensor b (5) at non-singleton dimension 1 ``` It follows Python convension of having the most important information last and reading from the bottom up. Changes: Moved the error message to the end, to copy Python * Report original traceback separate from serialized traceback * Make sure root functions have names in the interpreter trace. Test Plan: Imported from OSS Differential Revision: D20126136 Pulled By: zdevito fbshipit-source-id: fd01f9985e5d74e04c4d064c02e8bc320f4fac13	2020-03-03 12:27:38 -08:00
Wanchao Liang	f909b5535e	[autograd] fix allow_unused checking for C++ API (#34035 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34035 Bug for the conditon check in https://github.com/pytorch/pytorch/pull/24342, realized we don't have tests in either python or cpp to catch this, so added testes for both python and cpp. Thanks hczhu on capturing it! Test Plan: Imported from OSS Differential Revision: D20198837 Pulled By: wanchaol fbshipit-source-id: 33846a14c0a8e7aac2e8328189d10c38a0d7e6ee	2020-03-02 17:57:15 -08:00
Will Feng	1494005cfd	C++ tensor indexing: more indexing tests (#30427 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30427 Test Plan: Imported from OSS Differential Revision: D18695899 Pulled By: yf225 fbshipit-source-id: 74455fe52ef922556fabe65aefca9ec93fe2346d	2020-02-28 22:07:41 -08:00
Martin Yuan	55b44f6746	Throw an exception when method cannot be found from mobile module. (#33972 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33972 Test Plan: Imported from OSS Differential Revision: D20168965 Pulled By: iseeyuan fbshipit-source-id: 2efe5dcb1fb80407cd88a47c50cb382ecd8aa275	2020-02-28 18:28:09 -08:00
Meghan Lele	5eacdfb21f	Revert D20127441: [pytorch][PR] [JIT] Introduce a fake Tensor creation node for IR unit tests Test Plan: revert-hammer Differential Revision: D20127441 Original commit changeset: 56da4f23ac46 fbshipit-source-id: 7d4602e5011bec6f6871eab16af05a3198694e5d	2020-02-27 13:48:31 -08:00
Meghan Lele	390d4d6df3	[JIT] Introduce a fake Tensor creation node for IR unit tests (#33595 ) Summary: Summary There is often a need to create a Tensor when writing IR by hand for JIT optimisation pass unit tests. The only options for this today are real Tensor creation functions like `aten::ones`. Any test that uses these functions must also use the same default arguments as the Python/C++ API, which means that all of the tests have to be updated when the API is updated. This commit introduces a new primitive, `prim::MakeTestTensor` with schema `() -> Tensor` that should be used in unit tests instead of real Tensor creation functions. This new primitive has no public-facing API, so the maintenance burden is much lower. Testing This commit updates the alias analysis and DCE tests to use `prim::MakeTestTensor` instead of `aten::rand`, `aten::ones`, and `aten::zeros`. ``` $ ./bin/test_jit CUDA not available. Disabling CUDA and MultiCUDA tests Note: Google Test filter = -_CUDA:_MultiCUDA [==========] Running 75 tests from 1 test case. [----------] Global test environment set-up. [----------] 75 tests from JitTest [ RUN ] JitTest.ADFormulas [ OK ] JitTest.ADFormulas (82 ms) [ RUN ] JitTest.Attributes [ OK ] JitTest.Attributes (0 ms) ... ... ... [ RUN ] JitTest.LiteInterpreterPrim [ OK ] JitTest.LiteInterpreterPrim (0 ms) [ RUN ] JitTest.LiteInterpreterLoadOrigJit [ OK ] JitTest.LiteInterpreterLoadOrigJit (2 ms) [----------] 75 tests from JitTest (150 ms total) [----------] Global test environment tear-down [==========] 75 tests from 1 test case ran. (150 ms total) [ PASSED ] 75 tests. ``` Fixes* This pull request fixes https://github.com/pytorch/pytorch/issues/33500. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33595 Differential Revision: D20127441 Pulled By: SplitInfinity fbshipit-source-id: 56da4f23ac46335227254f606c6481718108f378	2020-02-27 13:10:20 -08:00
Michael Suo	dbe850af5b	[jit] do the code reorg (#33851 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33851 Rationale and context described in #33828. Script to reproduce the move: https://gist.github.com/suo/16cbefaaeb67ca5a7c6caffd49b7f6e9 ghstack-source-id: 99079645 Test Plan: Make sure CI passes Reviewed By: jamesr66a Differential Revision: D20133869 fbshipit-source-id: 390e9241a9c85366d9005c492ac31f10aa96488e	2020-02-27 13:02:51 -08:00
Omkar Salpekar	24dd800e6a	[Dist Autograd] Functional API for Dist Autograd and Dist Optimizer (#33711 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33711 Fixed #33480 This makes `dist_autograd.backward` and `dist_optimizer.step` functional by making the user explicitly pass in the `context_id` as opposed to relying on the confusing thread_local context_id. This diff incorporates these API changes and all places where these functions are called. More concretely, this code: ``` with dist_autograd.context(): # Forward pass. dist_autograd.backward([loss.sum()]) dist_optim.step() ``` should now be written as follows: ``` with dist_autograd.context() as context_id: # Forward pass. dist_autograd.backward(context_id, [loss.sum()]) dist_optim.step(context_id) ``` Test Plan: Ensuring all existing dist_autograd and dist_optimizer tests pass with the new API. Also added a new test case for input checking. Differential Revision: D20011710 fbshipit-source-id: 216e12207934a2a79c7223332b97c558d89d4d65	2020-02-26 19:08:28 -08:00
Will Feng	5c33d98b0d	Add assert_tensor_equal and assert_tensor_not_equal to test/cpp/api/support.h (#30426 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30426 This PR adds `assert_tensor_equal` and `assert_tensor_not_equal` to `test/cpp/api/support.h`, as better functions for testing whether two tensors are equal / not equal. Test Plan: Imported from OSS Differential Revision: D18695900 Pulled By: yf225 fbshipit-source-id: c19b9bc4c4e84d9f444015023649d27618fcbdf5	2020-02-26 13:25:25 -08:00
Will Feng	0dded4026e	[C++ API] Add PackedSequence / pack_padded_sequence / pad_packed_sequence / pack_sequence (#33652 ) Summary: Most of the function implementation and test code are translated from the Python version. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33652 Differential Revision: D20052211 Pulled By: yf225 fbshipit-source-id: ce6767db54364f91ef4f06674239a12278c2752a	2020-02-25 12:53:41 -08:00

1 2 3 4 5 ...

756 commits