pytorch

mirror of https://github.com/saymrwulf/pytorch.git synced 2026-05-15 21:00:47 +00:00

Author	SHA1	Message	Date
Jerry Zhang	a597c0ca05	Add inplace FeedTensor for python frontend (#14512 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14512 att Reviewed By: dzhulgakov Differential Revision: D13243278 fbshipit-source-id: 78af417d0fcd9b9791ee839d62095903e49205cb	2018-12-04 12:45:11 -08:00
Michael Antonov	773f4d8081	Implements Gather operator for arbitrary axis, sharing the code with BatchGather. (#13756 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13756 This implements general Gather operator for arbitrary axis, sharing the code with BatchGather. - CPU gather & batch gather logic is now shared through caffe2::gather_helper, for any axis. - Shared CUDA kernel moved to gather_op.cuh, for any axis. - Gradients of axis > 0 delegate to BatchGatherGradientOp which now has axis argument. - BatchGatherOp doc strings updated to have correct rank (q + (r -1)) and output. - Added tests for axis == 2. GatherOp supports index wrapping for axis == 0 by default, which was earlier for ONNX. This diff also extends it to work in Cuda kernel. Added "wrap_indices" argument which specifies wheather this wrapping should be done; set it to true if you'd like wrapping for any axis. TBD: Update gradients to support negative indices (separate diff). TBD: Once we have operator versioning, we'd like to update GatherOp to NOT support axis 0 wrapping by default, but rather do it only if wrap_indices is set. Reviewed By: dzhulgakov Differential Revision: D12983815 fbshipit-source-id: 8add9d67b47fe8c5ba7a335f581ca0530b205cd7	2018-12-04 11:54:28 -08:00
Lu Fang	44894915d6	Automatic update of fbcode/onnx to 6b34743d2e361bbc0acb29dd73536478cb92562e (#14637 ) Summary: Previous import was f461f7aad9987635b4aff108620ed7918f002d19 Included changes: - [6b34743](https://github.com/onnx/onnx/commit/6b34743): fix the const map initializatoin (#1662) <Lu Fang> - [ae80999](https://github.com/onnx/onnx/commit/ae80999): Fuse Pad into Conv optimizer (#1580) <vloncar> Pull Request resolved: https://github.com/pytorch/pytorch/pull/14637 Differential Revision: D13281338 Pulled By: houseroad fbshipit-source-id: c31429914bf5954fdc85e0c02168836ef47d635c	2018-12-03 20:11:17 -08:00
Yan Zhu	aeb38cfcea	cuda implementation for PackSegment to support presence mask (#14635 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14635 as title Reviewed By: enosair Differential Revision: D13254097 fbshipit-source-id: b9f40109e2889907c925f9a4df9da14f67f45f38	2018-11-30 16:54:10 -08:00
Lu Fang	2752ad8045	Automatic update of fbcode/onnx to f461f7aad9987635b4aff108620ed7918f002d19 (#14568 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14568 Previous import was 882c5283c54345d131e8fe5c859e4844dcf7ca8e Included changes: - [f461f7a](https://github.com/onnx/onnx/commit/f461f7a): Show the op's type and name when the shape inference is failed. (#1623) <Jerry> - [ab8aaf9](https://github.com/onnx/onnx/commit/ab8aaf9): Add scan test case (#1586) <G. Ramalingam> - [c95357e](https://github.com/onnx/onnx/commit/c95357e): link the tutorial (#1650) <Lu Fang> - [d7e2420](https://github.com/onnx/onnx/commit/d7e2420): Upgrade label encoder to support more input types (#1596) <Wei-Sheng Chin> - [6425108](https://github.com/onnx/onnx/commit/6425108): Add Doc about Adding New Operator into ONNX (#1647) <Lu Fang> - [295889c](https://github.com/onnx/onnx/commit/295889c): use an empty initializer to create map (#1643) <Lu Fang> - [e38f3ec](https://github.com/onnx/onnx/commit/e38f3ec): Remove redundant const (#1639) <daquexian> - [ea694bf](https://github.com/onnx/onnx/commit/ea694bf): implement fuse reduce->unsqueeze + fix assumption in nop_dropout pass (#1565) <Armen> - [6db386e](https://github.com/onnx/onnx/commit/6db386e): make output shape clear enough for Softmax family (#1634) <Lu Fang> - [2b67c6e](https://github.com/onnx/onnx/commit/2b67c6e): fix batchnorm doc (#1633) <Lu Fang> - [c901784](https://github.com/onnx/onnx/commit/c901784): remove inappropriate consts (#1632) <Lu Fang> - [de82119](https://github.com/onnx/onnx/commit/de82119): Shape inference fix for broadcast, concat and scan (#1594) <KeDengMS> - [d7ffe3b](https://github.com/onnx/onnx/commit/d7ffe3b): Update Optimizer Docs (#1607) <Armen> - [d09d139](https://github.com/onnx/onnx/commit/d09d139): mark PROTOBUF_INCLUDE_DIRS as BUILD_INTERFACE (#1466) <Yuta Okamoto> - [eb4b7c2](https://github.com/onnx/onnx/commit/eb4b7c2): allow variadic parameters of different types (#1615) <G. Ramalingam> - [4166246](https://github.com/onnx/onnx/commit/4166246): Fix onnxifi test (#1617) <Yinghai Lu> - [6706a4d](https://github.com/onnx/onnx/commit/6706a4d): Fix a bug in vector address access (#1598) <Raymond Yang> - [ae39866](https://github.com/onnx/onnx/commit/ae39866): Separate types of inputs 1 and 2 in OneHot op. (#1610) <Spandan Tiwari> - [45ba661](https://github.com/onnx/onnx/commit/45ba661): Handle new types in the switch. (#1608) <Dmitri Smirnov> - [14853b6](https://github.com/onnx/onnx/commit/14853b6): Bump docker image version to 230 used in CircleCI (#1606) <bddppq> - [e0993b8](https://github.com/onnx/onnx/commit/e0993b8): [onnxifi] Make sure that backend handles run async. (#1599) <Roman Dzhabarov> - [e6965cc](https://github.com/onnx/onnx/commit/e6965cc): Introduce SparseTensor ML proto (#1554) <Dmitri Smirnov> - [75b782f](https://github.com/onnx/onnx/commit/75b782f): In driver test check the return status of onnxGetBackendIDs (#1597) <bddppq> - [c05b364](https://github.com/onnx/onnx/commit/c05b364): Make CI log less verbose (#1595) <bddppq> - [fa568e4](https://github.com/onnx/onnx/commit/fa568e4): Loop type shape inferencing (#1591) <Scott McKay> - [937e64c](https://github.com/onnx/onnx/commit/937e64c): add uint8 (#1590) <Lu Fang> - [f86e951](https://github.com/onnx/onnx/commit/f86e951): Add domain as an optional parameter for make_node function (#1588) <Young Kim> - [ff45588](https://github.com/onnx/onnx/commit/ff45588): Remove unreachable code in shape_inference.h (#1585) <Changming Sun> - [f7dcad0](https://github.com/onnx/onnx/commit/f7dcad0): Add several hyperbolic function ops. (#1499) <Sergii Dymchenko> - [a60ac7d](https://github.com/onnx/onnx/commit/a60ac7d): Add OneHot op to ONNX. (#1567) <Spandan Tiwari> - [f6c3a7e](https://github.com/onnx/onnx/commit/f6c3a7e): [compiler flag] Issue a warning if class has virtual method but missing virtual dtor. (#1583) <Roman Dzhabarov> - [88d1784](https://github.com/onnx/onnx/commit/88d1784): Fix MaxUnpool shape inference when output_shape is provided as input (#1578) <Spandan Tiwari> - [20041b7](https://github.com/onnx/onnx/commit/20041b7): Add type shape inferencing for the If operator (#1571) <Scott McKay> - [d6c4c75](https://github.com/onnx/onnx/commit/d6c4c75): Add a virtual destructor to GraphInferencer (#1574) <Changming Sun> - [a339598](https://github.com/onnx/onnx/commit/a339598): fix ConvTranspose spec (#1566) <Wenhao Hu> Reviewed By: zrphercule Differential Revision: D13263831 fbshipit-source-id: a2ff22c6454e2430429e5a7d18d21661a7ffb0cb	2018-11-29 16:31:56 -08:00
rohithkrn	0d663cec30	Unify cuda and hip device types in Caffe2 python front end (#14221 ) Summary: Goal of this PR is to unify cuda and hip device types in caffe2 python front end. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14221 Differential Revision: D13148564 Pulled By: bddppq fbshipit-source-id: ef9bd2c7d238200165f217097ac5727e686d887b	2018-11-29 14:00:16 -08:00
Dmytro Dzhulgakov	0cfbbceac3	Change Tensor::CopyFrom to a simple double dispatch (#14268 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14268 Removes the need for Context in Tensor by doing simple dispatch for CopyBytes. It'd eventually be subsumed by Roy Li's changes of proper copy_ op, but before that is done, let's get a clear logic of how copies are implemented and clean up some craft in CopyFrom implementation. Note, that with these changes, one can probably can get rid of Context::CopyFromCPU/CopyToCPU, but it's a matter for follow up diffs. This diff doesn't change the API of Tensor yet, but relies on the fact that passing `Context` to CopyFrom makes copy async if the device is CUDA and doesn't have any effect otherwise (that's how Context methods are implemented). This doesn't change semantics of copy async implementation - as before it blindly calls cudaMemcpyAsync which probably means that it can be misused if invoked separately outside of operator body. I'll leave it for the follow up copy_ unification. For Extend() we always do async copy - it makes sense as it's an in-place device-device operation and only any further op would be observable. Note: there are now three ways of invoking copy in C2 code - templated CopyBytes, virtual CopyFromCPU/etc, and double-dispatch free method here. Hopefully we can get rid of the second one. Also, please advise whether it's c10-worthy :) Reviewed By: ezyang Differential Revision: D13117987 fbshipit-source-id: a6772d6dcf3effaf06717da3a656fc9873b310b5	2018-11-28 15:45:37 -08:00
Jiyan Yang	a2fcd4dee5	Ensure FP16 rowwise Adagrad can be run Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12317 Reviewed By: hyuen Differential Revision: D10190778 fbshipit-source-id: 720a9aaa4e6b1736023d8c6326a613e4ea592b31	2018-11-28 02:15:36 -08:00
Jiyan Yang	0199d59d3a	Resubmit: Set the correct engine name for position weighted pooling when fp16 is used for training Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13768 Reviewed By: xianjiec Differential Revision: D12996103 fbshipit-source-id: 5ca4cda4210f68ece2b5d6eced8cf52ee91fb36f	2018-11-27 14:51:56 -08:00
Hassan Eslami	e392d428b1	Allowing TaskGroups to carry remote nets (#14342 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14342 Sometimes, when we are creating a TaskGroup, we are in fact creating a TaskGroup for a distributed job. In some cases, we may want to register a few nets as "remote" to a TaskGroup. The remote net should have sufficient attributes on where they should be executed later on. This diff adds the remote net attribute to the TaskGroup class. It exposes two minimal functionalities: adding a remote net, and getting all remote nets added to a TaskGroup. Reviewed By: d4l3k Differential Revision: D13188320 fbshipit-source-id: efe947aec30817e9512a5e18be985713b9356bdc	2018-11-27 13:34:11 -08:00
Kevin Chen	b18063b39a	Fix caffe2 => onnx exporter for ConvTranspose (#14143 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14143 ConvTranspose has a per-operator attribute rename, which meant that the global attribute rename for kernels => kernel_shape was not applied. Changing the behavior so that the global renames always apply, but per-op renames can override those for specific attributes. Note: The python frontend path isn't actually used for ConvTranspose, but I thought it would be good to make it consistent. Reviewed By: yinghai Differential Revision: D13113395 fbshipit-source-id: cd3f124b4b5c753a506d297138b7d002b51bfb38	2018-11-26 15:51:42 -08:00
Jerry Zhang	735cd06536	FeedTensor returns a Tensor (#14196 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14196 Pull Request resolved: https://github.com/pytorch/pytorch/pull/13641 FeedTensor function used to take a pointer to Tensor and feed the content using Resize and mutable_data, but since Tensor is a pointer now, we can just return a Tensor instead. Reviewed By: dzhulgakov Differential Revision: D13091163 fbshipit-source-id: 9abf2fd320baca76e050530c500dd29f8e2d0211	2018-11-26 13:05:44 -08:00
Huan Gui	60e7d04961	Add Recency Weighted into SparseLookup (#14291 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14291 Add RecencyWeighted into SparseLookup. Reviewed By: Wakeupbuddy Differential Revision: D13147738 fbshipit-source-id: de5dc3aaee8ce7d41c6d30d2ff47e9786a7fa4da	2018-11-24 02:43:31 -08:00
Yinghai Lu	f79fb58744	Make sure we bind input/output of Onnxifi op positionally (#14214 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14214 This is to pick up the residual task of T36325466 to make sure that input/output binding of c2 Onnxifi op is positional. Reviewed By: dzhulgakov Differential Revision: D13134470 fbshipit-source-id: d1b916dade65c79133b86507cd54ea5166fa6810	2018-11-22 00:31:01 -08:00
Gu, Jinghui	60963c2ecb	Add "axis" and "axis_w" arguments in FC to support customized axix to reduce dim. (#12971 ) Summary: Add "axis" and "axis_w" arguments in FC to support customized axix to reduce dim. Pull Request resolved: https://github.com/pytorch/pytorch/pull/12971 Reviewed By: bddppq Differential Revision: D12850675 Pulled By: yinghai fbshipit-source-id: f1cde163201bd7add53b8475329db1f038a73019	2018-11-21 15:44:50 -08:00
Hui Wu	acd7811e33	Add sigmoid op based on MKL-DNN Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13097 Differential Revision: D13105366 Pulled By: yinghai fbshipit-source-id: d156e8fd519baeecf61c25dcd8fa2c2fa7351ef4	2018-11-19 22:56:35 -08:00
zrphercule	03a02b6fd5	Fix a bug in test case of onnx::If Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14209 Differential Revision: D13132607 Pulled By: zrphercule fbshipit-source-id: b7f7ccc6a6cbdeb57a7f88a1971d15dd81e6fc81	2018-11-19 18:46:21 -08:00
Junjie Bai	0d7a986da1	Change hip filename extension to .hip (#14036 ) Summary: xw285cornell - To make hip files to have unique filename extension we change hip files from _hip.cc to .hip (it's the only blessing option other than .cu in hipcc `3d51a1fb01/bin/hipcc (L552)`). - Change to use host compiler to compile .cc\|.cpp files. Previously we use hcc to compile them which is unnecessary - Change the hipify script to not replace "gpu" with "hip" in the filename of the generated hipified files. Previously we do this because hcc has a bug when linking files that have same filename. We have now changed to use host linker to do linking so this is unnecessary anymore. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14036 Reviewed By: xw285cornell Differential Revision: D13091813 Pulled By: bddppq fbshipit-source-id: ea3d887751d8abb39d75f5d5104aa66ce66b9ee0	2018-11-16 11:55:59 -08:00
Duc Ngo	c7a247facf	nomnigraph - support subgraph visualization (#13795 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13795 Add ability for dot string generation for a single subgraph and python bindings (which is pretty useful for model exploration in Python) Restructure DotGenerator class a bit to make it easy to implement this feature Reviewed By: bwasti Differential Revision: D13010512 fbshipit-source-id: 825665438394b7e6968ab6da167b477af82a7b62	2018-11-16 08:19:20 -08:00
Duc Ngo	d7b95dda51	nomnigraph - easy - expose hasProduce(NodeRef) to python (#14075 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14075 Expose hasProduce(NodeRef) to python Reviewed By: bwasti Differential Revision: D13092930 fbshipit-source-id: f1ec06e73e0f5f6a16ad0cbb7d2e3e499a861d8e	2018-11-16 08:19:18 -08:00
Duc Ngo	e7f5fceb99	nomnigraph - easy - expose inducesEdges and addNode to python's NNSubgraph (#14074 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14074 expose inducesEdges and addNode to python's NNSubgraph. This make it easy to manually construct a NNSubgraph in python Reviewed By: bwasti Differential Revision: D13092885 fbshipit-source-id: a94ed0b318162e27e3a4b5a4954eb6d169da7405	2018-11-16 08:19:16 -08:00
Parth Raichura	3808e9fad3	Caffe2: Fix for creating entries of external_input in predic_net (#12979 ) Summary: Currently after performing export it gives two entries of externel_input of input data in predict_net proto because it extends the externel_input twice once seperately using input blob and one it is extendind all the entries of external_input from proto in which input blob is already included Signed-off-by: Parth Raichura <parth.raichura@softnautics.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/12979 Differential Revision: D12916349 Pulled By: soumith fbshipit-source-id: 4d4a1c68c0936f8de3f4e380aea1393fe193cd2d	2018-11-15 22:33:50 -08:00
Matthew Brandyberry	c5afad5579	Fix skip logic in caffe_translator_test.py (#13627 ) Summary: Avoid false failure by checking for the presence of the test data in setup. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13627 Differential Revision: D13090324 Pulled By: ezyang fbshipit-source-id: e85571943d168c0007212d7b1a5b99ffa0c39235	2018-11-15 16:45:49 -08:00
Ilia Cherniavskii	0e93500841	Remove async_polling (#13825 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13825 async_polling was an intermediate step towards async_scheduling and is not used Reviewed By: yinghai Differential Revision: D13019059 fbshipit-source-id: eee6ba53e7f476ddb481afba3bf1768303864d32	2018-11-15 16:23:15 -08:00
Edward Yang	3fbb753512	Revert D12873145: [pt1][tensor][refactor] FeedTensor returns a Tensor Differential Revision: D12873145 Original commit changeset: 653735c20d61 fbshipit-source-id: aa6e40a6a24c6f90acbe87b32b3be0020e2584f8	2018-11-15 14:52:46 -08:00
Yan Zhu	2356c8d542	device inference for Adam (#13990 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13990 to make sure ITER blob lives on CPU. Reviewed By: xianjiec Differential Revision: D13056070 fbshipit-source-id: 148edbf745e50e886da3eb99d4e485d11c1924e2	2018-11-14 17:21:08 -08:00
Ashish	f4e502a8c5	Added MIOpen conv transpose op (#13938 ) Summary: This pull request contains changes for: 1. Removing ConvTranspose related changes from caffe2/operators/hip/conv_op_miopen.cc 2. Adding the file caffe2/operators/hip/conv_transpose_op_miopen.cc 3. Modifying the tests to run convTranspose op using MIOpen engine Differential Revision: D13055099 Pulled By: bddppq fbshipit-source-id: ca284f8f9a073005b22013c375cc958257815865	2018-11-13 21:01:52 -08:00
Shuting Wang	23e19ebfa7	add non expotential emphasis loss to Lambdarank Summary: Currently Lambdarank applies exponential emphasis on relevance, i.e., g=2^rel when calculating dcg, this diff adds options that supports g=rel in the loss function. Reviewed By: itomatik Differential Revision: D9891514 fbshipit-source-id: 64730d467a665670edd37e6dc1c077987991d1a8	2018-11-13 14:54:04 -08:00
Jerry Zhang	266bb8bf30	FeedTensor returns a Tensor (#13641 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13641 FeedTensor function used to take a pointer to Tensor and feed the content using Resize and mutable_data, but since Tensor is a pointer now, we can just return a Tensor instead. Reviewed By: ezyang Differential Revision: D12873145 fbshipit-source-id: 653735c20d611ff6ac9e380d8b3c721cb396a28f	2018-11-13 10:50:32 -08:00
Yinghai Lu	a7eee0a1e9	Add Reshape if there is add_axis when exporting C2 concat (#13798 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13798 The semantics of C2 and ONNX Concat is a bit different. C2 concat accepts "add_axis" arg and will raise the dim if so. It's equivalent of attaching a Reshape after plain concat in ONNX. Reviewed By: rdzhabarov Differential Revision: D13012867 fbshipit-source-id: da23e555bae709fd2a373b04dcb9db4e984ae315	2018-11-12 22:27:49 -08:00
Jesse Hellemn	1600649792	Fix for nightly builds (#13779 ) Summary: Being tested on nightlies manually. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13779 Reviewed By: yinghai Differential Revision: D13001930 Pulled By: pjh5 fbshipit-source-id: 954eaabe052914b7b23c74e922666bf9dbfb630a	2018-11-12 16:38:14 -08:00
Bram Wasti	b052fe6c2f	Upgrade DLPack Summary: Needed to use TVM Reviewed By: ajtulloch Differential Revision: D12994038 fbshipit-source-id: f0b6c48a43a87fac37fcef73b78026d8384cd022	2018-11-12 15:59:46 -08:00
Bram Wasti	8480fe0105	Fix up creation of unique data nodes Summary: There was a bug in the uniqueness check that only made the first run unique Reviewed By: duc0 Differential Revision: D13013504 fbshipit-source-id: ecf7526d0fafd7968f1301734123f93968efef46	2018-11-12 15:37:08 -08:00
Yan Zhu	003f97cefa	fc layer accept axis argument (#13822 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13822 as title Reviewed By: xianjiec Differential Revision: D12996338 fbshipit-source-id: 1aa61e71e2d79535325ea7034c82e1cb6bf3a9f6	2018-11-11 13:44:57 -08:00
Yinghai Lu	d97ac82bf5	Back out "Revert D12967258: Support more data types in ONNXIFI transform" (#13812 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13812 Original commit changeset: 2cf95bdc5ed8 Looks like in iOS, `uint64_t` is not the same as `size_t`. :( Fixed it here. Reviewed By: houseroad Differential Revision: D13017390 fbshipit-source-id: d33854ce341225aba372fb945c3704edc14f9411	2018-11-10 20:00:34 -08:00
Soumith Chintala	7c02f285dc	Revert D12967258: Support more data types in ONNXIFI transform Differential Revision: D12967258 Original commit changeset: 688076e6f504 fbshipit-source-id: 2cf95bdc5ed8f1e13646bc5cf8139bdc516861d7	2018-11-10 12:34:31 -08:00
Yinghai Lu	5923d76f96	Support more data types in ONNXIFI transform (#13745 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13745 We need to support types beside `int64` and `float`. Reviewed By: bddppq, rdzhabarov Differential Revision: D12967258 fbshipit-source-id: 688076e6f504b2bf24bba89714df87a678c5638a	2018-11-10 10:41:01 -08:00
Yan Shang	c85463fc74	Allow Gather to handle empty data (#13781 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13781 allow Gather Op to handle empty data. Reviewed By: intermilan Differential Revision: D13001267 fbshipit-source-id: 633c8471b637c56be8f6574f9bf9430785073977	2018-11-10 10:00:47 -08:00
Ansha Yu	e3e6ca1102	operator serialized test coverage summary document (#13703 ) Summary: Add a markdown document summarizing the coverage of serialized operator tests. This currently only takes into account what has been covered by the tests with respect to the entire registry of c2 operators. Next, we will break down the coverage by which operators have unit tests associated with them, which have hypothesis tests, and which have tests more specifically calling assertReferenceChecks. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13703 Reviewed By: dzhulgakov Differential Revision: D12970810 Pulled By: ajyu fbshipit-source-id: 4f0cd057b1cf734371333e24d26cbab630a170e1	2018-11-09 15:04:08 -08:00
Gu, Jinghui	d01cb70497	build with mkl-dnn by default (#13303 ) Summary: build with mkl-dnn by default Pull Request resolved: https://github.com/pytorch/pytorch/pull/13303 Reviewed By: yinghai Differential Revision: D12979633 Pulled By: orionr fbshipit-source-id: 00d23fa27c0d13e82f7e5acb3ebd00ed7ba1d5dc	2018-11-08 11:18:27 -08:00
Yinghai Lu	8581d3ec67	Allow blacklist ops in onnxifi transform Differential Revision: D12945523 fbshipit-source-id: cf5055652591bd1dd8d4be92b7fd6a40a0764536	2018-11-08 09:59:03 -08:00
Xiaoqiang Zheng	de41d1ae0b	Enable junk fill for the default CPU allocator (#13377 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13377 * Enable junk fill for the default CPU allocator. The first diff only enables this for the tests. A second diff will change the default of zero-fill to false. * Fix tests to use 64-bit counters that IterOp and LearningRateOp demands. * Fix kernels that uses uninitialized memory. Reviewed By: salexspb Differential Revision: D10866512 fbshipit-source-id: 17860e77e63a203edf46d0da0335608f77884821	2018-11-08 00:02:37 -08:00
François Garillot	edd2e38023	Clean up a couple of items in the C2 test scaffolding (WIP) (#7847 ) Summary: - Py3 compatibility - utility functions refactoring Pull Request resolved: https://github.com/pytorch/pytorch/pull/7847 Reviewed By: pietern Differential Revision: D9355096 Pulled By: huitseeker fbshipit-source-id: 8e78faa937488c5299714f78075d7cadb1b2490c	2018-11-07 09:16:13 -08:00
Jerry Zhang	508f676c50	Rename ndim() -> dim() - 5/6 Summary: Codemod generated with clangr shard mode, 50 files per diff, clangr code(ndim()->dim()): diffusion/FBS/browse/master/fbcode/caffe2/caffe2/fb/codemods/TensorMethodRename.cpp Reviewed By: salexspb Differential Revision: D12935787 fbshipit-source-id: 303d71d3eb050789af2ab9575e5dcc48f6037086	2018-11-06 16:38:35 -08:00
Pradeep Dorairaj	76c1b5cd79	Fix overflow error in stats_put_ops Summary: I was hitting this error: caffe2/caffe2/operators/stats_put_ops.h:66:25: runtime error: 9.22337e+18 is outside the range of representable values of type 'long' So, the assignment from int64_t to float loses some precision and because of that we overflow. Reproduced this issue with this diff D12945013 Reviewed By: mlappelbaum, jdshi-fb Differential Revision: D12927086 fbshipit-source-id: 7eae7fe25ab49d5ac15279335bd5b1fa89d6e683	2018-11-06 15:41:51 -08:00
Junjie Bai	95ca66763d	Add math functions overloaded over different numeric types for cuda and hip (#13602 ) Summary: petrex ashishfarmer rohithkrn iotamudelta Pull Request resolved: https://github.com/pytorch/pytorch/pull/13602 Reviewed By: dzhulgakov Differential Revision: D12935797 Pulled By: bddppq fbshipit-source-id: a49ec66fb60bfd947c63dd2133d431884df62235	2018-11-06 01:40:31 -08:00
Hong Li	d03c6ba50d	Adding Fetching Real number representation Summary: Adding Fetching Real number representation for int8 tensor in workpace.py Reviewed By: harouwu Differential Revision: D12936556 fbshipit-source-id: f8756a37bce21c93d44d52faf5da9c9bd6473f4a	2018-11-05 23:35:24 -08:00
zrphercule	02d3787a19	Support new upsample in symbolic, caffe2 backend & caffe2 frontend (#13272 ) Summary: We updated the description of upsample_op in onnx: https://github.com/onnx/onnx/pull/1467 Therefore, we need to support the new upsample_op in caffe2-onnx backend as well. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13272 Reviewed By: houseroad Differential Revision: D12833656 Pulled By: zrphercule fbshipit-source-id: 21af5282abaae12d2d044e4018a2b152aff79917	2018-11-05 19:13:57 -08:00
Jongsoo Park	54e8623d26	3D Conv in NHWC layout (#12733 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12733 Conv in NHWC layout only works for 2D images. This has been a pain point when implementing quantized 3D convolution because we need NHWC layout for best performance (note that NHWC layout in general gives better performance in CPU not just for quantized operators). For example, our quantized ops have a functionality to measure quantized error operator by operator but this needs running a shadow fp32 operator, but this is not easy when there's no 3D conv in NHWC layout is available (currently we're doing layout conversion on the fly for the shadow fp32 operator which is error prone). Some of Caffe2 frameworks like brew generates error when we try to create a 3D conv op in NHWC layout. This was also a blocker for using aibench because aibench is using brew. i-am-not-moving-c2-to-c10 Reviewed By: houseroad Differential Revision: D10333829 fbshipit-source-id: 2d203ee1db833cd3f9d39353219e3894b46c4389	2018-11-04 21:50:09 -08:00
Jongsoo Park	8be0efaa8c	omit group conv NHWC test for HIP (#13554 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13554 D10233252 broke ROCM test. We don't have group conv in NHWC for hip yet and this diff omits related tests. Reviewed By: hyuen Differential Revision: D12917880 fbshipit-source-id: 9baf36a8cb061ee8cf393b2c438a2d1460ce5cd8	2018-11-03 21:18:23 -07:00

1 2 3 4 5 ...

2194 commits