pytorch

mirror of https://github.com/saymrwulf/pytorch.git synced 2026-05-14 20:57:59 +00:00

Author	SHA1	Message	Date
James Reed	585e6b581f	Allow method-style casts on tensors (#10641 ) Summary: Closes https://github.com/pytorch/pytorch/issues/10631 Pull Request resolved: https://github.com/pytorch/pytorch/pull/10641 Differential Revision: D9407598 Pulled By: jamesr66a fbshipit-source-id: a0331f4e9e55d92718cde7a1112fe8c705206b1f	2018-08-20 14:10:21 -07:00
Edward Yang	39a3dcc999	Fix #10698 build failure (#10704 ) Summary: Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/10704 Differential Revision: D9406072 Pulled By: ezyang fbshipit-source-id: 0d472ef84cddc3bf7600b06d04e5e02e94d59fa3	2018-08-20 14:10:19 -07:00
Jason Gauci	b4684db698	Add support for Log() Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10694 Reviewed By: houseroad Differential Revision: D9405612 Pulled By: MisterTea fbshipit-source-id: 6d83d3c2db933a3822076c7faf578ac0e92e60c6	2018-08-20 13:25:21 -07:00
Huan Gui	7832e9d564	Add a bisect percentile operator (#10563 ) Summary: Add a bisect percentile operators with lower and upper bounds for interpolation Pull Request resolved: https://github.com/pytorch/pytorch/pull/10563 Reviewed By: chocjy Differential Revision: D7802182 Pulled By: olittle fbshipit-source-id: 89ebfa8b3463adc2c89235fa3dfffa187a9d5417	2018-08-20 13:14:05 -07:00
Jerry Zhang	3d0757430b	Fix EnsureCPUOutputOp (#10651 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10651 EnsureCPUOutputOp will copy the input from another Context to CPU, but currently there is no guarantee that the Copy will be executed. Differential Revision: D9390046 fbshipit-source-id: af3ff19cf46560264cb77d2ab8821f0cc5be74f6	2018-08-20 12:12:48 -07:00
Duc Ngo	2e563c417c	Nomnigraph - rename some APIs that invole Subtree to Subgraph (#10551 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10551 Renaming from "subtree" -> "subgraph" to improve clarity of subgraph matcher APIs since it now supports DAG This is pure renaming, no functionalities change. Reviewed By: bwasti Differential Revision: D9348311 fbshipit-source-id: 4b9267845950f3029dfe385ce3257d3abb8bdad4	2018-08-20 10:55:21 -07:00
Duc Ngo	aa9f328fa3	Nomnigraph - DAG matching (#10549 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10549 Support dag matching in nomnigraph. This is done by maintaining a map from node in the MatchGraph to node in the input graph, and additionally enforce that same nodes in the MatchGraph must match to same nodes in the input graph (with the exception of multiplicity i.e. when count != 1 on the MatchGraph node). In a follow up diff, I'll rename the API that refers to subtree as subgraph to improve clarity. Reviewed By: bwasti Differential Revision: D9347322 fbshipit-source-id: 171491b98c76852240a253279c2654e96dd12632	2018-08-20 10:55:19 -07:00
Gregory Chanan	0cce4620fe	Fix backend/device-type comparison with MKLDNN. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10689 Differential Revision: D9400450 Pulled By: gchanan fbshipit-source-id: f75b042b886d5d525edb2c423173a9646c613a1b	2018-08-20 10:41:08 -07:00
Tongzhou Wang	db7b7f1359	fix typo Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10686 Differential Revision: D9399874 Pulled By: SsnL fbshipit-source-id: 28130992d2416721552f72cfa835ff0358caeefa	2018-08-20 10:40:55 -07:00
Orion Reblitz-Richardson	d4832f1e7b	More fixes for hidden visibility (#10624 ) Summary: Some more `ATEN_API` additions for hidden visibility. Running CI tests to see what fails to link. cc Yangqing mingzhe09088 ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/10624 Reviewed By: mingzhe09088 Differential Revision: D9392728 Pulled By: orionr fbshipit-source-id: e0f0861496b12c9a4e40c10b6e0c9e0df18e8726	2018-08-20 10:11:59 -07:00
Adam Paszke	9ad9191323	Fix cuDNN dropout state cache (#10662 ) Summary: Minor fix for the cuDNN cache. Previously we would skip the event reinitialization when an RNN function would be called on GPU 0, and then on GPU 1, but it would be in eval mode on GPU1. That would cause us to skip event re-initialization, and cause an incorrect resource handle error when trying to record the event. soumith Pull Request resolved: https://github.com/pytorch/pytorch/pull/10662 Reviewed By: soumith Differential Revision: D9393629 Pulled By: apaszke fbshipit-source-id: e64c1c1d2860e80f5a7ba727d0b01aeb5f762d90	2018-08-20 05:09:41 -07:00
Kittipat Virochsiri	c37fac4d50	Fixing stop condition on composite reader (#9888 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/9888 Limiter cannot be shared or copied; just pass it to the first reader. Reviewed By: xianjiec Differential Revision: D9008871 fbshipit-source-id: e20cd785b26b1844e156efc3833ca77cfc3ffe82	2018-08-20 03:02:20 -07:00
Xiang Gao	83066e9b30	Add trigonometry functions for ONNX export (#7540 ) Summary: Trigonometry functions are newly added to ONNX in a recent PR https://github.com/onnx/onnx/pull/869 This PR makes pytorch support exporting graphs with trigonometry functions. This PR might need to wait until it is ready to change ```python _onnx_opset_version = 6 ``` to ```python _onnx_opset_version = 7 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/7540 Differential Revision: D9395041 Pulled By: bddppq fbshipit-source-id: bdf3e9d212b911c8c4eacf5a0753bb092e4748d2	2018-08-19 23:01:28 -07:00
Tongzhou Wang	3f603eeee8	some improvements on distributed docs Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10666 Differential Revision: D9395242 Pulled By: SsnL fbshipit-source-id: 952326b9c5a1a974a1c33a0e12738e1e21ad9956	2018-08-19 17:40:28 -07:00
Tongzhou Wang	108b657159	Import DistributedSampler in utils/data/__init__ (#10671 ) Summary: There is no reason that user should do an extra import to use DistributedSampler. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10671 Differential Revision: D9395189 Pulled By: SsnL fbshipit-source-id: 8f41d93813c8fb52fe012f76980c6a261a8db9b2	2018-08-19 16:55:13 -07:00
Edward Yang	6bdbad93b9	Refactor Device to not depend on Backend. (#10478 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10478 - Removed Backend constructor from Device, and fixed all use-sites to use DeviceType::CPU instead of kCPU, or use a new function backendToDeviceType to perform the conversion. - New method device_type() on Type; it gives you the underlying device type, e.g., CPU for SparseCPU. - We add backward compatibility for kCPU/kCUDA uses, by introducing a new special type which is implicitly convertible to both DeviceType and Backend. As long as you don't define a function that's overloaded on both DeviceType and Backend (but not on BackendOrDeviceType), the implicit conversions will ensure that uses of at::Device(at::kCPU) keep working. We fixed use-sites in the library, but did NOT fix sites in the test code, so that we can exercise this BC code. Reviewed By: Yangqing Differential Revision: D9301861 fbshipit-source-id: 9a9d88620500715c7b37e655b4fd761f6dd72716	2018-08-18 17:39:14 -07:00
Richard Zou	f1420adfe3	Move at::chunk into the graph fuser (#10178 ) Summary: ... to avoid slow at::chunk (it is slow due to tensor initialization). Picking up from #10026 This is done through the following: 1) Absorb starting chunks into FusionGroup as a part of the graph fuser pass. 2) When compiling a kernel, emit a `std::vector<ConcatDesc>` that describes if an input (of the original graph) will be chunked. 3) When launching a kernel, `use std::vector<ConcatDesc>` to chunk an input tensor on the CPU. This chunk directly takes in an at::Tensor and creates four TensorInfo structs in-place in the argument list, bypassing the creation of intermediate Tensors. - Expect test and correctness test to see if a single chunk is fused by the graph fuser - Correctness test for a variety of chunks (dimension = beginning, middle, end) and tensors (contiguous, non-contiguous, edge case (splitSize = 1) for both CPU/CUDA - Expect test for multiple chunks fused into the same kernel and correctness test. cc zdevito apaszke LSTM forward pass, 1 layer, 512 hidden size and input size, 100 seq length, requires_grad=False on all inputs and weights. After changes: ``` thnn cudnn jit 8.8468 6.5797 9.3470 ``` Before changes: ``` thnn cudnn jit 9.9221 6.6539 11.2550 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/10178 Differential Revision: D9382661 Pulled By: zou3519 fbshipit-source-id: 1f8a749208fbdd45559775ce98cf4eb9558448f8	2018-08-18 16:10:11 -07:00
poh	d87b4e941b	fix python interpreter can not be found without `PYTHON_EXECUTABLE` (#10659 ) Summary: Take 2 of #10543 The problem was that between commit and merge there was added one more run point `tools/build_libtorch.py`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10659 Differential Revision: D9393540 Pulled By: soumith fbshipit-source-id: 8ebfed600fc735fd1cb0489b161ec80e3db062e0	2018-08-18 15:40:08 -07:00
Taewook Oh	152762a567	Fix warnings diagnosed in recent clang (#10647 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10647 Fix "missing std::move from the return value" warning diagnosed by recent clang compiler. Reviewed By: soumith, DavidCallahan Differential Revision: D9384692 fbshipit-source-id: 8ad951e47d605e6f98a9650f2dec2909ad0f3eb8	2018-08-17 21:32:58 -07:00
Richard Zou	e29b5a1ea8	graph fuser inserts explicit expands where necessary (#10325 ) Summary: Fixes #10096 If the only thing preventing a simple mappable operator from being fused into a fusion group is that its Tensor inputs are not of the same shape as the output, then the graph fuser inserts explicit expand nodes for those inputs. This helps the graph fuser not miss out on any fusion opportunities involving simple mappable operations that have Tensor inputs. This PR doesn't do anything for the scalar case; that can be addressed later. Test Plan - Simple expect test case - Added expect tests for a raw LSTMCell. The expands help speed up the forwards pass by allowing more operations to be fused into the LSTMCell's single FusionGroup. cc apaszke zdevito Pull Request resolved: https://github.com/pytorch/pytorch/pull/10325 Differential Revision: D9379308 Pulled By: zou3519 fbshipit-source-id: 86d2202eb97e9bb16e511667b7fe177aeaf88245	2018-08-17 16:03:46 -07:00
Yinghai Lu	7c55d11ba5	Make sure we don't relocate the weight name buffer (#10630 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10630 `onnxTensorDescriptorV1.name` points to the string buffer. We use a vector of strings to serve as the storage. This means we cannot reallocate the vector because that may invalidate the `onnxTensorDescriptorV1.name` pointers. Solution is to reserve a large enough vector so that it won't reallocate. Reviewed By: bddppq, houseroad Differential Revision: D9381838 fbshipit-source-id: f49c5719aafcc0829c79f95a2a39a175bcad7bfe	2018-08-17 16:03:31 -07:00
Peter Goldsborough	65b9308128	Basic infrastructure for C++ documentation (#10569 ) Summary: Adds the folder structure, Doxyfile, sphinx setup and Makefile to build C++ docs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10569 Differential Revision: D9386744 Pulled By: goldsborough fbshipit-source-id: 0a7c581dcf0a5f7b01ba19d317b493cf95935134	2018-08-17 15:39:50 -07:00
Jesse Hellemn	b62b378022	Adding torch support for CMAKE_ARGS env Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10635 Reviewed By: ml7 Differential Revision: D9383845 Pulled By: pjh5 fbshipit-source-id: fb21bda12e88053eec738974e6e419388c5038d9	2018-08-17 14:54:43 -07:00
Tongzhou Wang	c5c1c051ca	Fix dropout fused kernel applied in eval mode (#10621 ) Summary: fixes https://github.com/pytorch/pytorch/issues/10584 cc apaszke Pull Request resolved: https://github.com/pytorch/pytorch/pull/10621 Differential Revision: D9379397 Pulled By: SsnL fbshipit-source-id: 5ff2939ba794af082ce597ef289a09ee757636dc	2018-08-17 14:54:42 -07:00
Richard Zou	86c9856d9c	Fuse tensor-scalar ops when scalar is constant (#10511 ) Summary: This is on the way to resolving #9940. Fixes #10501 This PR modifies graph fuser to fuse operations that have constant scalar arguments. These constant scalar arguments are directly inlined into the kernel body. The context for this is that LSTM backward (in particular, sigmoid backward) has many add(x, 1.) operations. This PR should be sufficient for LSTM backward to get fused by the graph fuser. cc apaszke zdevito Pull Request resolved: https://github.com/pytorch/pytorch/pull/10511 Differential Revision: D9378896 Pulled By: zou3519 fbshipit-source-id: 6a7a2987f5b6e8edaaf4b599cd200df33361650f	2018-08-17 14:10:23 -07:00
Keren Zhou	f3ac619764	Add fusion support for batchnorm and convolution without bias Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10595 Reviewed By: bwasti Differential Revision: D9110099 fbshipit-source-id: e1ed66c7d82b2f9987b7eb9c7f98877a6dbeb902	2018-08-17 12:11:44 -07:00
Adam Paszke	d35f365ad5	Remove all cuDNN specific inputs to RNN functions (#10581 ) Summary: This is still not the final PR, but it removes all blockers for actually using the RNN functions directly in the JIT. Next patch should be final, and will actually remove the symbolic_override code, and change it to proper symbolics for those ATen functions. Turns out the symbolic code can be also cleaned up a bit, and I'll do that too. zdevito ezyang colesbury (for minor DispatchStub.h) changes There was no way to handle those in the JIT for now, and they turned out to be completely unnecessary. It should make the Python and C++ module code much simpler too, since all the logic is now centralized in the native functions. The downside is that RNN modules no longer own their dropout buffers, which are shared per-device instead (with appropriate locking and synchronization). This might appear as a perf regression at first, but in reality it's highly unlikely that anyone will want to run cuDNN RNNs on the same GPU in parallel. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10581 Reviewed By: colesbury Differential Revision: D9365541 Pulled By: apaszke fbshipit-source-id: 3ef8677ee5481bae60c74a9117a2508665b476b5	2018-08-17 11:09:51 -07:00
Wanchao Liang	52058204d6	Add nn functional tests in JIT (#10409 ) Summary: The PR is the first step to integrate torch.nn library with JIT. It adds the tests for nn functional interfaces in trace/script mode, and tries to find out the different between torch.nn.functional ops and the ATen ops, to see the work need to be done in order to support a full set of nn functional in script mode. Some statistics in summary: - Totally 84 useful functions in torch.nn.functional (the number does not include helper funcs and deprecated funcs in torch.nn.functional). - 7 functions/ops does not support higher gradient, so just excluded from the whole test. - 36 functions is different with the Aten op for different reasons. Among those 36 functions, bunch of them (roughly around 10-15) are just naming difference and simple transformation using other ops inside the function. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10409 Differential Revision: D9350694 Pulled By: wanchaol fbshipit-source-id: 8fce6f30d8d25ace5a544a57b219fe61f5a092f8	2018-08-17 11:09:49 -07:00
Andrey Malevich	b4e72ea811	Revert D9377394: [pytorch][PR] [Caffe2] Add AT_CORE_EXPORT and AT_CORE_IMPORT. Differential Revision: D9377394 Original commit changeset: 993062a461ff fbshipit-source-id: af8ab92e9b88466602508981d9b3ea24ce393dfc	2018-08-17 10:39:27 -07:00
Jongsoo Park	bd9ab650ae	fix compile error in math_hip.cc from new Im2Col/Col2Im interface (#10623 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10623 Fix compile error in https://ci.pytorch.org/jenkins/job/caffe2-builds/job/py2-clang3.8-rocm1.7.1-ubuntu16.04-build/10280//console Reviewed By: ezyang Differential Revision: D9379451 fbshipit-source-id: 67cc3964981edba1915b93c49643caa300d63c16	2018-08-17 10:24:25 -07:00
Edward Yang	ff440b61f6	Revert D9378844: [pytorch][PR] fix python interpreter can not be found Differential Revision: D9378844 Original commit changeset: 022e20aab7e2 fbshipit-source-id: 962280707e84edff2a4f59b1ce2f4211a579a055	2018-08-17 10:09:27 -07:00
Elias Ellison	e190505e84	Adding support for inlining if branches (#10084 ) Summary: Inlining if branches which have constant inputs. If an if node gets inlined, the set of mutated variables returned by its ancestors may have changed. In the following example the block should return a mutated set of (a) and not (a, b). ``` if cond: if True: a = a - 1 else: b = b - 1 ``` To calculate this we recursively update mutate variables in if branches from the leaf nodes up. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10084 Reviewed By: michaelsuo Differential Revision: D9340429 Pulled By: eellison fbshipit-source-id: b0dd638a5cace9fdec3130460428fca655ce4b98	2018-08-17 09:48:47 -07:00
Junjie Bai	31c7a32d1c	Include aten_op by default in caffe2 Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10603 Reviewed By: ahhegazy, dzhulgakov Differential Revision: D9364309 fbshipit-source-id: e72d9f2b1e99cb0fb2186c737fcd925b14d42754	2018-08-17 08:39:46 -07:00
Yinghai Lu	03982fb8d3	Fix subgraph cutting wrt recent external_input change in nomnigraph (#10598 ) Summary: https://github.com/pytorch/pytorch/pull/10100 recently take external input/output in nomnigraph. This PR makes adjust to 0. Relax some of the conditions on external input 1. Update NNModule inputs/outputs when pruning the input/output. 2. Avoiding copying external input/output as nomnigraph already takes care of it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10598 Reviewed By: bwasti Differential Revision: D9371730 Pulled By: yinghai fbshipit-source-id: 9273be5041dc4cc8585587f47cb6721e518a06a8	2018-08-17 08:25:49 -07:00
Nikita Melentev	ff3a481aee	fix python interpreter can not be found (#10543 ) Summary: Custom python installation, which have no aliases to `python` or `python3` can't be found by cmake `findPythonInterp` without extra cmake argument. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10543 Differential Revision: D9378844 Pulled By: ezyang fbshipit-source-id: 022e20aab7e27a5a56b8eb91b6026151116193c7	2018-08-17 08:25:48 -07:00
Tongliang Liao	51222500e2	Add AT_CORE_EXPORT and AT_CORE_IMPORT. (#10602 ) Summary: Fix "error LNK2019: unresolved external symbol" from "CAFFE_KNOWN_TYPE" in tests where we should use dllexport instead of AT_CORE_API(=dllimport). Pull Request resolved: https://github.com/pytorch/pytorch/pull/10602 Differential Revision: D9377394 Pulled By: Yangqing fbshipit-source-id: 993062a461ffce393f2321c5391db5afb9b4e7ba	2018-08-17 02:09:38 -07:00
Jongsoo Park	cc53807be5	group conv with NHWC layout (#10585 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10585 group conv with NHWC layout Reviewed By: BIT-silence Differential Revision: D7547497 fbshipit-source-id: da0ec5a4512c15a0a0d7b79e6ce00c1f8f77f661	2018-08-17 00:39:23 -07:00
onnxbot	0aefb9f26c	Update onnx to onnx/onnx@7848f1e (#10613 ) Summary: https://github.com/onnx/onnx/commit/7848f1e Pull Request resolved: https://github.com/pytorch/pytorch/pull/10613 Reviewed By: houseroad Differential Revision: D9376224 Pulled By: bddppq fbshipit-source-id: ce8a53255ba24f0f8f989570e8b015837f8442fb	2018-08-16 23:39:37 -07:00
Summer Deng	6667d55e73	Disallow input filler for GatherRangesOp (#10592 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10592 Filter out GatherRanges ops Reviewed By: highker Differential Revision: D9365220 fbshipit-source-id: e21ab00dc9e553c9aaf172e1241206e0c0a7a23d	2018-08-16 21:39:09 -07:00
Hassan Eslami	3578909671	Remove unused code base for distributed training (#10282 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10282 This diff removes the unused/deprecated features from the code base. Reviewed By: manojkris Differential Revision: D9169859 fbshipit-source-id: d6447b7916a7c687b44b20da868112e6720ba245	2018-08-16 20:10:17 -07:00
Anders Papitto	f1d40ef280	build_pytorch_libs.sh: use MAX_JOBS rather than NUM_JOBS (#10600 ) Summary: MAX_JOBS is set by our jenkins setup Pull Request resolved: https://github.com/pytorch/pytorch/pull/10600 Differential Revision: D9375317 Pulled By: anderspapitto fbshipit-source-id: 25416d5ee12372f7610baa78cb7b423806b26aa2	2018-08-16 20:10:15 -07:00
Peter Goldsborough	c101a57a74	Build mechanism for custom operators (#10226 ) Summary: This is the last step in the custom operator implementation: providing a way to build from C++ and Python. For this I: 1. Created a `FindTorch.cmake` taken largely from ebetica with a CMake function to easily create simple custom op libraries 2. Created a ` torch/op.h` header for easy inclusion of necessary headers, 3. Created a test directory `pytorch/test/custom_operator` which includes the basic setup for a custom op. 1. It defines an op in `op.{h,cpp}` 2. Registers it with the JIT using `RegisterOperators` 3. Builds it into a shared library via a `CMakeLists.txt` 4. Binds it into Python using a `setup.py`. This step makes use of our C++ extension setup that we already have. No work, yey! The pure C++ and the Python builds are separate and not coupled in any way. zdevito soumith dzhulgakov Pull Request resolved: https://github.com/pytorch/pytorch/pull/10226 Differential Revision: D9296839 Pulled By: goldsborough fbshipit-source-id: 32f74cafb6e3d86cada8dfca8136d0dfb1f197a0	2018-08-16 18:56:17 -07:00
Marat Dukhan	67c6d93634	Tune minimal work size (#10599 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10599 Not spawning threads with spin-lock synchronization is bad because they will switch to `condvar` wait, which increases wake-up latency next time they are needed. Reviewed By: ajtulloch Differential Revision: D9366664 fbshipit-source-id: 3b9e4a502aeefaf0ddc4795303a855d98980b02e	2018-08-16 17:39:57 -07:00
Jerry Ma	afd7477eaa	Add ``buffers(),` `named_buffers()`` methods. (#10554 ) Summary: This commit adds the ``buffers()`` and ``named_buffers()`` methods as analogues of ``parameters()`` and ``named_parameters()``. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10554 Reviewed By: SsnL Differential Revision: D9367762 Pulled By: jma127 fbshipit-source-id: f2042e46a7e833dce40cb41681dbd80d7885c74e	2018-08-16 16:26:48 -07:00
Junjie Bai	342517e6e7	Back out "Add aten_op to caffe2 onnx (python) backend" (#10589 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10589 Original commit changeset: 2cc6fedbaf08 Reviewed By: houseroad Differential Revision: D9365208 fbshipit-source-id: 3871d8e70f0d8e48c8af9593c78587d16c45afc2	2018-08-16 15:15:27 -07:00
Orion Reblitz-Richardson	488ea824ed	Additional changes to make GPU builds work (#10507 ) Summary: A continuation of https://github.com/pytorch/pytorch/pull/10504 for GPU, torch, etc. builds. I was testing with ``` FULL_CAFFE2=1 python setup.py build_deps \| tee ~/log.txt cat ~/log.txt \| egrep 'undefined refer' \| sort \| less ``` I'll rebase on master when Yangqing's changes in 10504 land, but putting up for some testing. cc mingzhe09088 anderspapitto ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/10507 Reviewed By: Yangqing Differential Revision: D9359606 Pulled By: orionr fbshipit-source-id: c2a3683b3ea5839689f5d2661da0bc9055a54cd2	2018-08-16 13:25:27 -07:00
Ailing Zhang	ef15bb8787	remove implicit conversion from gpu to cpu (#10553 ) Summary: Resubmit #10416 with fixed tests . This is to remove implicit conversion from gpu to cpu in when calling numpy to keep behavior match others. It requires users to move the tensor back to cpu() before call numpy functions on it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10553 Differential Revision: D9350212 Pulled By: ailzhang fbshipit-source-id: 9317d8fea925d4b20ae3150e2c1b39ba5c9c9d0a	2018-08-16 12:10:39 -07:00
Chao Li	d6f3c88418	Revert D9076734: Split storage from tensor Differential Revision: D9076734 Original commit changeset: ea9e1094ecf8 fbshipit-source-id: 3fa9b65b7265fce6207d9e1d9ef4707dbb29704b	2018-08-16 11:25:32 -07:00
Kirtesh Patil	40a070422d	Adding new allreduce bcube routines to ops supported by gloo (#10494 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10494 Adding the AllredubeBcube routines as they are now available in gloo. Reviewed By: wesolwsk Differential Revision: D8269473 fbshipit-source-id: 6a3a32291bbf1fbb328b3ced0f2a753dc5caf4e5	2018-08-16 10:56:26 -07:00
Yinghai Lu	4be4b4c8b5	Remove weight from input of onnxifi backend op (#10575 ) Summary: The ONNXIFI backend will absorb the constant weight in Conv, so we should not add it as an input. This is just a test artifacts. Note that Onnxifi transformer will do the right thing when cutting the graph to absorb the weights. rdzhabarov Pull Request resolved: https://github.com/pytorch/pytorch/pull/10575 Reviewed By: houseroad Differential Revision: D9357339 Pulled By: yinghai fbshipit-source-id: a613fa3acafa687295312f5211f8e9d7f77b39cd	2018-08-16 10:56:25 -07:00

1 2 3 4 5 ...

12822 commits