pytorch

mirror of https://github.com/saymrwulf/pytorch.git synced 2026-05-15 21:00:47 +00:00

Author	SHA1	Message	Date
bddppq	479481b6cb	Remove linker and dlopen flags that allowed undefined symbols in rocm build (#15091 ) Summary: Previously the undefined symbols were caused by disabled_modules in tools/amd_build/disabled_features.json (now it's cleared). Pull Request resolved: https://github.com/pytorch/pytorch/pull/15091 Differential Revision: D13429595 Pulled By: bddppq fbshipit-source-id: b341e83f9e5a8d16440a364e837b045a8a4fd6e1	2018-12-11 23:23:47 -08:00
Daniel Ingram	5c2c40ad87	Add error type to raise statement Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15039 Differential Revision: D13419566 Pulled By: zou3519 fbshipit-source-id: f67a3aebce937e3e640e91e81eb3e184cfdf269c	2018-12-11 17:41:44 -08:00
Zachary DeVito	92314c83fa	re-enable copy of python files, but be careful that the copy is only … (#14982 ) Summary: …done once This allow no-op build to work correctly even when BUILD_CAFFE2_OPS is on. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14982 Differential Revision: D13413960 Pulled By: zdevito fbshipit-source-id: 6e5412a8c375af8a47c76f548cdd31cff15f3853	2018-12-11 16:54:08 -08:00
TerryTsao	c2a754c58b	Fix CMakeLists.txt for Int8 python bindings (#15047 ) Summary: Currently in caffe2, one cannot properly fetch the content of Int8 blobs. Upon digging the source code, it turns out that the relevant source code is not being compiled. Adding the source to CMakeLists.txt fixes this issue. First time ever doing a pull request. Please let me know if there's any rule I should follow. Thanks. Pull Request resolved: https://github.com/pytorch/pytorch/pull/15047 Differential Revision: D13417583 Pulled By: bddppq fbshipit-source-id: dd39575971a3012635edbf97a045d80e4b62a8eb	2018-12-11 10:48:47 -08:00
Jongsoo Park	cff509e2b1	share code between adagrad and rowwise adagrad tests (#14692 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14692 Remove some code duplication Reviewed By: chocjy Differential Revision: D13296731 fbshipit-source-id: 5924e037ca64fc4b89234be922bc5ca47fb8bd32	2018-12-10 22:10:39 -08:00
bddppq	45dfc6764e	Enable more caffe2 fp16 rocm tests (#15040 ) Summary: cc rohithkrn petrex Pull Request resolved: https://github.com/pytorch/pytorch/pull/15040 Reviewed By: houseroad Differential Revision: D13413068 Pulled By: bddppq fbshipit-source-id: b2967f16f8da0b9e80083138fb8632c14e9e9b63	2018-12-10 21:30:21 -08:00
Ilia Cherniavskii	e9cd781681	Back out "Revert D13043261: [caffe2] Task graph and task future abstractions in executor" Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15030 Reviewed By: bddppq Differential Revision: D13408998 fbshipit-source-id: 9eb675e09fbc4829eab34df7aa660a0590816feb	2018-12-10 19:30:58 -08:00
rohithkrn	7e2b074219	Integrate rocBLAS fp16 api into Caffe2 (#14882 ) Summary: This PR integrates rocBLAS half and mixed precision APIs in to Caffe2. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14882 Differential Revision: D13407840 Pulled By: bddppq fbshipit-source-id: 75cb0d74da066776fa66575f1d255e879d36121e	2018-12-10 17:54:06 -08:00
Junjie Bai	4a145cd95c	Revert D13043261: [caffe2] Task graph and task future abstractions in executor Differential Revision: D13043261 Original commit changeset: d89424354aea fbshipit-source-id: b307e3281c4d83b60ba2bfadcbcf69afb7a41412	2018-12-10 16:03:59 -08:00
Ilia Cherniavskii	029600813e	Task graph and task future abstractions in executor Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14116 Reviewed By: dmudiger Differential Revision: D13043261 fbshipit-source-id: d89424354aea14d1d14eb8320fb3aa34908a4e81	2018-12-10 14:28:56 -08:00
Jerry Zhang	a51fe386c8	caffe2/caffe2/contrib/script (#15007 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15007 Pull Request resolved: https://github.com/pytorch/pytorch/pull/14979 att Reviewed By: dzhulgakov Differential Revision: D13286191 fbshipit-source-id: b8a6bc7aea44487aea4dcf7f44c858fd30c6293c	2018-12-10 14:23:31 -08:00
Yiming Wu	a1494efdfa	fix auto grad summing for IfOp where intermediate output needs renaming (#14772 ) Summary: fix auto grad summing for IfOp where intermediate output needs renaming. Bug before this diff: - we only renames the output of IfOp without changing the subnet ops output - this results in blob not found error the unittest provides an example this diff fix that for IfOp Pull Request resolved: https://github.com/pytorch/pytorch/pull/14772 Differential Revision: D13327090 Pulled By: harouwu fbshipit-source-id: ec40ee88526ace3619c54551e223dd71158a02f8	2018-12-09 08:26:46 -08:00
Your Name	5e06fa0baf	ONNX changes to use int32_t (instead of enum) to store data type Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14926 Reviewed By: houseroad Differential Revision: D13390642 Pulled By: bddppq fbshipit-source-id: c2314b24d9384f188fda2b9a5cc16465ad39581e	2018-12-08 01:06:08 -08:00
Lu Fang	5be28ade66	Automatic update of fbcode/onnx to aca8473a40cf43f01958c81b648efcee7f3a755a (#14865 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14865 Previous import was 42804705bdbf179d1a98394008417e1392013547 Included changes: - [aca8473](https://github.com/onnx/onnx/commit/aca8473): Add Erf operator for computing error function (#1675) <bddppq> - [3fc82ca](https://github.com/onnx/onnx/commit/3fc82ca): Add IsNaN operator. (#1656) <Pranav Sharma> - [0685f01](https://github.com/onnx/onnx/commit/0685f01): Add Sign Op (#1658) <Rui Zhu> - [2a8fae8](https://github.com/onnx/onnx/commit/2a8fae8): Fix unused var warning (#1669) <Yinghai Lu> - [e212833](https://github.com/onnx/onnx/commit/e212833): Update scan (#1653) <G. Ramalingam> Reviewed By: zrphercule Differential Revision: D13370727 fbshipit-source-id: 13a93d5acc8d4758f682278ea162ec9124ced22d	2018-12-07 17:37:42 -08:00
rohithkrn	11a9248d01	Enable fp16 for MIOPEN operators in Caffe2 (#14905 ) Summary: This PR enables fp16 MIOPEN operators in Caffe2. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14905 Differential Revision: D13383439 Pulled By: bddppq fbshipit-source-id: 840afa8d08bef2952ca0039dee2423f1542bb330	2018-12-07 17:26:44 -08:00
Sergei Nikolaev	a0ee3a279c	USE_TENSORRT support and TensorRT 5 compatibility Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13945 Differential Revision: D13317525 Pulled By: yinghai fbshipit-source-id: 8630dfec1bbc5aac19539e344e7c38a7fd8b051d	2018-12-07 14:01:11 -08:00
Orion Reblitz-Richardson	febc7ff99f	Add __init__.py so files get picked up on install (#14898 ) Summary: This will let us install tests and other Caffe2 python code as a part of running Caffe2 tests in PyTorch. Broken out of https://github.com/pytorch/pytorch/pull/13733/ cc pjh5 yf225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/14898 Reviewed By: pjh5 Differential Revision: D13381123 Pulled By: orionr fbshipit-source-id: 0ec96629b0570f6cc2abb1d1d6fce084e7464dbe	2018-12-07 13:40:23 -08:00
PenghuiCheng	939877bf4b	Implementation of WeightedSum op for mkl-dnn and fix FC op output shape issue. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14407 Reviewed By: yinghai Differential Revision: D13364364 Pulled By: wesolwsk fbshipit-source-id: e69bcd1bc52e35b2f0e45e5dc40184f1bd66605d	2018-12-07 12:35:19 -08:00
Yudong Guang	265b55d028	Revert D13205604: Move numa.{h, cc} to c10/util Differential Revision: D13205604 Original commit changeset: 54166492d318 fbshipit-source-id: 89b6833518c0b554668c88ae38d97fbc47e2de17	2018-12-07 10:01:25 -08:00
Jerry Zhang	1d111853ae	Move numa.{h, cc} to c10/util (#14393 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14393 att Reviewed By: ezyang Differential Revision: D13205604 fbshipit-source-id: 54166492d31827b0343ed070cc36a825dd86e2ed	2018-12-06 11:30:13 -08:00
lcskrishna	12addc64a6	Fixed MIOpen RNN Segfault issue and enabled RNN test (#14810 ) Summary: This pull request contains changes for: 1. Added MIOpen RNN API miopenGetRNNLayerBiasSize and miopenGetRNNLayerParamSize. 2. Fixed usage of API miopenGetRNNLayerParam. 3. Modifying the RNN test to run using MIOpen engine. Differential Revision: D13355699 Pulled By: bddppq fbshipit-source-id: 6f750657f8049c5446eca893880b397804120b69	2018-12-05 23:54:31 -08:00
Huan Gui	ba287eebca	Fix clip gradient with empty input (#14709 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14709 As titled Reviewed By: Wakeupbuddy Differential Revision: D13305554 fbshipit-source-id: 380062d4b0e4f9dc0207a27766cac7b8d05384d5	2018-12-05 22:53:25 -08:00
Jerry Zhang	a597c0ca05	Add inplace FeedTensor for python frontend (#14512 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14512 att Reviewed By: dzhulgakov Differential Revision: D13243278 fbshipit-source-id: 78af417d0fcd9b9791ee839d62095903e49205cb	2018-12-04 12:45:11 -08:00
Michael Antonov	773f4d8081	Implements Gather operator for arbitrary axis, sharing the code with BatchGather. (#13756 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13756 This implements general Gather operator for arbitrary axis, sharing the code with BatchGather. - CPU gather & batch gather logic is now shared through caffe2::gather_helper, for any axis. - Shared CUDA kernel moved to gather_op.cuh, for any axis. - Gradients of axis > 0 delegate to BatchGatherGradientOp which now has axis argument. - BatchGatherOp doc strings updated to have correct rank (q + (r -1)) and output. - Added tests for axis == 2. GatherOp supports index wrapping for axis == 0 by default, which was earlier for ONNX. This diff also extends it to work in Cuda kernel. Added "wrap_indices" argument which specifies wheather this wrapping should be done; set it to true if you'd like wrapping for any axis. TBD: Update gradients to support negative indices (separate diff). TBD: Once we have operator versioning, we'd like to update GatherOp to NOT support axis 0 wrapping by default, but rather do it only if wrap_indices is set. Reviewed By: dzhulgakov Differential Revision: D12983815 fbshipit-source-id: 8add9d67b47fe8c5ba7a335f581ca0530b205cd7	2018-12-04 11:54:28 -08:00
Lu Fang	44894915d6	Automatic update of fbcode/onnx to 6b34743d2e361bbc0acb29dd73536478cb92562e (#14637 ) Summary: Previous import was f461f7aad9987635b4aff108620ed7918f002d19 Included changes: - [6b34743](https://github.com/onnx/onnx/commit/6b34743): fix the const map initializatoin (#1662) <Lu Fang> - [ae80999](https://github.com/onnx/onnx/commit/ae80999): Fuse Pad into Conv optimizer (#1580) <vloncar> Pull Request resolved: https://github.com/pytorch/pytorch/pull/14637 Differential Revision: D13281338 Pulled By: houseroad fbshipit-source-id: c31429914bf5954fdc85e0c02168836ef47d635c	2018-12-03 20:11:17 -08:00
Yan Zhu	aeb38cfcea	cuda implementation for PackSegment to support presence mask (#14635 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14635 as title Reviewed By: enosair Differential Revision: D13254097 fbshipit-source-id: b9f40109e2889907c925f9a4df9da14f67f45f38	2018-11-30 16:54:10 -08:00
Lu Fang	2752ad8045	Automatic update of fbcode/onnx to f461f7aad9987635b4aff108620ed7918f002d19 (#14568 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14568 Previous import was 882c5283c54345d131e8fe5c859e4844dcf7ca8e Included changes: - [f461f7a](https://github.com/onnx/onnx/commit/f461f7a): Show the op's type and name when the shape inference is failed. (#1623) <Jerry> - [ab8aaf9](https://github.com/onnx/onnx/commit/ab8aaf9): Add scan test case (#1586) <G. Ramalingam> - [c95357e](https://github.com/onnx/onnx/commit/c95357e): link the tutorial (#1650) <Lu Fang> - [d7e2420](https://github.com/onnx/onnx/commit/d7e2420): Upgrade label encoder to support more input types (#1596) <Wei-Sheng Chin> - [6425108](https://github.com/onnx/onnx/commit/6425108): Add Doc about Adding New Operator into ONNX (#1647) <Lu Fang> - [295889c](https://github.com/onnx/onnx/commit/295889c): use an empty initializer to create map (#1643) <Lu Fang> - [e38f3ec](https://github.com/onnx/onnx/commit/e38f3ec): Remove redundant const (#1639) <daquexian> - [ea694bf](https://github.com/onnx/onnx/commit/ea694bf): implement fuse reduce->unsqueeze + fix assumption in nop_dropout pass (#1565) <Armen> - [6db386e](https://github.com/onnx/onnx/commit/6db386e): make output shape clear enough for Softmax family (#1634) <Lu Fang> - [2b67c6e](https://github.com/onnx/onnx/commit/2b67c6e): fix batchnorm doc (#1633) <Lu Fang> - [c901784](https://github.com/onnx/onnx/commit/c901784): remove inappropriate consts (#1632) <Lu Fang> - [de82119](https://github.com/onnx/onnx/commit/de82119): Shape inference fix for broadcast, concat and scan (#1594) <KeDengMS> - [d7ffe3b](https://github.com/onnx/onnx/commit/d7ffe3b): Update Optimizer Docs (#1607) <Armen> - [d09d139](https://github.com/onnx/onnx/commit/d09d139): mark PROTOBUF_INCLUDE_DIRS as BUILD_INTERFACE (#1466) <Yuta Okamoto> - [eb4b7c2](https://github.com/onnx/onnx/commit/eb4b7c2): allow variadic parameters of different types (#1615) <G. Ramalingam> - [4166246](https://github.com/onnx/onnx/commit/4166246): Fix onnxifi test (#1617) <Yinghai Lu> - [6706a4d](https://github.com/onnx/onnx/commit/6706a4d): Fix a bug in vector address access (#1598) <Raymond Yang> - [ae39866](https://github.com/onnx/onnx/commit/ae39866): Separate types of inputs 1 and 2 in OneHot op. (#1610) <Spandan Tiwari> - [45ba661](https://github.com/onnx/onnx/commit/45ba661): Handle new types in the switch. (#1608) <Dmitri Smirnov> - [14853b6](https://github.com/onnx/onnx/commit/14853b6): Bump docker image version to 230 used in CircleCI (#1606) <bddppq> - [e0993b8](https://github.com/onnx/onnx/commit/e0993b8): [onnxifi] Make sure that backend handles run async. (#1599) <Roman Dzhabarov> - [e6965cc](https://github.com/onnx/onnx/commit/e6965cc): Introduce SparseTensor ML proto (#1554) <Dmitri Smirnov> - [75b782f](https://github.com/onnx/onnx/commit/75b782f): In driver test check the return status of onnxGetBackendIDs (#1597) <bddppq> - [c05b364](https://github.com/onnx/onnx/commit/c05b364): Make CI log less verbose (#1595) <bddppq> - [fa568e4](https://github.com/onnx/onnx/commit/fa568e4): Loop type shape inferencing (#1591) <Scott McKay> - [937e64c](https://github.com/onnx/onnx/commit/937e64c): add uint8 (#1590) <Lu Fang> - [f86e951](https://github.com/onnx/onnx/commit/f86e951): Add domain as an optional parameter for make_node function (#1588) <Young Kim> - [ff45588](https://github.com/onnx/onnx/commit/ff45588): Remove unreachable code in shape_inference.h (#1585) <Changming Sun> - [f7dcad0](https://github.com/onnx/onnx/commit/f7dcad0): Add several hyperbolic function ops. (#1499) <Sergii Dymchenko> - [a60ac7d](https://github.com/onnx/onnx/commit/a60ac7d): Add OneHot op to ONNX. (#1567) <Spandan Tiwari> - [f6c3a7e](https://github.com/onnx/onnx/commit/f6c3a7e): [compiler flag] Issue a warning if class has virtual method but missing virtual dtor. (#1583) <Roman Dzhabarov> - [88d1784](https://github.com/onnx/onnx/commit/88d1784): Fix MaxUnpool shape inference when output_shape is provided as input (#1578) <Spandan Tiwari> - [20041b7](https://github.com/onnx/onnx/commit/20041b7): Add type shape inferencing for the If operator (#1571) <Scott McKay> - [d6c4c75](https://github.com/onnx/onnx/commit/d6c4c75): Add a virtual destructor to GraphInferencer (#1574) <Changming Sun> - [a339598](https://github.com/onnx/onnx/commit/a339598): fix ConvTranspose spec (#1566) <Wenhao Hu> Reviewed By: zrphercule Differential Revision: D13263831 fbshipit-source-id: a2ff22c6454e2430429e5a7d18d21661a7ffb0cb	2018-11-29 16:31:56 -08:00
rohithkrn	0d663cec30	Unify cuda and hip device types in Caffe2 python front end (#14221 ) Summary: Goal of this PR is to unify cuda and hip device types in caffe2 python front end. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14221 Differential Revision: D13148564 Pulled By: bddppq fbshipit-source-id: ef9bd2c7d238200165f217097ac5727e686d887b	2018-11-29 14:00:16 -08:00
Dmytro Dzhulgakov	0cfbbceac3	Change Tensor::CopyFrom to a simple double dispatch (#14268 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14268 Removes the need for Context in Tensor by doing simple dispatch for CopyBytes. It'd eventually be subsumed by Roy Li's changes of proper copy_ op, but before that is done, let's get a clear logic of how copies are implemented and clean up some craft in CopyFrom implementation. Note, that with these changes, one can probably can get rid of Context::CopyFromCPU/CopyToCPU, but it's a matter for follow up diffs. This diff doesn't change the API of Tensor yet, but relies on the fact that passing `Context` to CopyFrom makes copy async if the device is CUDA and doesn't have any effect otherwise (that's how Context methods are implemented). This doesn't change semantics of copy async implementation - as before it blindly calls cudaMemcpyAsync which probably means that it can be misused if invoked separately outside of operator body. I'll leave it for the follow up copy_ unification. For Extend() we always do async copy - it makes sense as it's an in-place device-device operation and only any further op would be observable. Note: there are now three ways of invoking copy in C2 code - templated CopyBytes, virtual CopyFromCPU/etc, and double-dispatch free method here. Hopefully we can get rid of the second one. Also, please advise whether it's c10-worthy :) Reviewed By: ezyang Differential Revision: D13117987 fbshipit-source-id: a6772d6dcf3effaf06717da3a656fc9873b310b5	2018-11-28 15:45:37 -08:00
Jiyan Yang	a2fcd4dee5	Ensure FP16 rowwise Adagrad can be run Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12317 Reviewed By: hyuen Differential Revision: D10190778 fbshipit-source-id: 720a9aaa4e6b1736023d8c6326a613e4ea592b31	2018-11-28 02:15:36 -08:00
Jiyan Yang	0199d59d3a	Resubmit: Set the correct engine name for position weighted pooling when fp16 is used for training Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13768 Reviewed By: xianjiec Differential Revision: D12996103 fbshipit-source-id: 5ca4cda4210f68ece2b5d6eced8cf52ee91fb36f	2018-11-27 14:51:56 -08:00
Hassan Eslami	e392d428b1	Allowing TaskGroups to carry remote nets (#14342 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14342 Sometimes, when we are creating a TaskGroup, we are in fact creating a TaskGroup for a distributed job. In some cases, we may want to register a few nets as "remote" to a TaskGroup. The remote net should have sufficient attributes on where they should be executed later on. This diff adds the remote net attribute to the TaskGroup class. It exposes two minimal functionalities: adding a remote net, and getting all remote nets added to a TaskGroup. Reviewed By: d4l3k Differential Revision: D13188320 fbshipit-source-id: efe947aec30817e9512a5e18be985713b9356bdc	2018-11-27 13:34:11 -08:00
Kevin Chen	b18063b39a	Fix caffe2 => onnx exporter for ConvTranspose (#14143 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14143 ConvTranspose has a per-operator attribute rename, which meant that the global attribute rename for kernels => kernel_shape was not applied. Changing the behavior so that the global renames always apply, but per-op renames can override those for specific attributes. Note: The python frontend path isn't actually used for ConvTranspose, but I thought it would be good to make it consistent. Reviewed By: yinghai Differential Revision: D13113395 fbshipit-source-id: cd3f124b4b5c753a506d297138b7d002b51bfb38	2018-11-26 15:51:42 -08:00
Jerry Zhang	735cd06536	FeedTensor returns a Tensor (#14196 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14196 Pull Request resolved: https://github.com/pytorch/pytorch/pull/13641 FeedTensor function used to take a pointer to Tensor and feed the content using Resize and mutable_data, but since Tensor is a pointer now, we can just return a Tensor instead. Reviewed By: dzhulgakov Differential Revision: D13091163 fbshipit-source-id: 9abf2fd320baca76e050530c500dd29f8e2d0211	2018-11-26 13:05:44 -08:00
Huan Gui	60e7d04961	Add Recency Weighted into SparseLookup (#14291 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14291 Add RecencyWeighted into SparseLookup. Reviewed By: Wakeupbuddy Differential Revision: D13147738 fbshipit-source-id: de5dc3aaee8ce7d41c6d30d2ff47e9786a7fa4da	2018-11-24 02:43:31 -08:00
Yinghai Lu	f79fb58744	Make sure we bind input/output of Onnxifi op positionally (#14214 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14214 This is to pick up the residual task of T36325466 to make sure that input/output binding of c2 Onnxifi op is positional. Reviewed By: dzhulgakov Differential Revision: D13134470 fbshipit-source-id: d1b916dade65c79133b86507cd54ea5166fa6810	2018-11-22 00:31:01 -08:00
Gu, Jinghui	60963c2ecb	Add "axis" and "axis_w" arguments in FC to support customized axix to reduce dim. (#12971 ) Summary: Add "axis" and "axis_w" arguments in FC to support customized axix to reduce dim. Pull Request resolved: https://github.com/pytorch/pytorch/pull/12971 Reviewed By: bddppq Differential Revision: D12850675 Pulled By: yinghai fbshipit-source-id: f1cde163201bd7add53b8475329db1f038a73019	2018-11-21 15:44:50 -08:00
Hui Wu	acd7811e33	Add sigmoid op based on MKL-DNN Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13097 Differential Revision: D13105366 Pulled By: yinghai fbshipit-source-id: d156e8fd519baeecf61c25dcd8fa2c2fa7351ef4	2018-11-19 22:56:35 -08:00
zrphercule	03a02b6fd5	Fix a bug in test case of onnx::If Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14209 Differential Revision: D13132607 Pulled By: zrphercule fbshipit-source-id: b7f7ccc6a6cbdeb57a7f88a1971d15dd81e6fc81	2018-11-19 18:46:21 -08:00
Junjie Bai	0d7a986da1	Change hip filename extension to .hip (#14036 ) Summary: xw285cornell - To make hip files to have unique filename extension we change hip files from _hip.cc to .hip (it's the only blessing option other than .cu in hipcc `3d51a1fb01/bin/hipcc (L552)`). - Change to use host compiler to compile .cc\|.cpp files. Previously we use hcc to compile them which is unnecessary - Change the hipify script to not replace "gpu" with "hip" in the filename of the generated hipified files. Previously we do this because hcc has a bug when linking files that have same filename. We have now changed to use host linker to do linking so this is unnecessary anymore. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14036 Reviewed By: xw285cornell Differential Revision: D13091813 Pulled By: bddppq fbshipit-source-id: ea3d887751d8abb39d75f5d5104aa66ce66b9ee0	2018-11-16 11:55:59 -08:00
Duc Ngo	c7a247facf	nomnigraph - support subgraph visualization (#13795 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13795 Add ability for dot string generation for a single subgraph and python bindings (which is pretty useful for model exploration in Python) Restructure DotGenerator class a bit to make it easy to implement this feature Reviewed By: bwasti Differential Revision: D13010512 fbshipit-source-id: 825665438394b7e6968ab6da167b477af82a7b62	2018-11-16 08:19:20 -08:00
Duc Ngo	d7b95dda51	nomnigraph - easy - expose hasProduce(NodeRef) to python (#14075 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14075 Expose hasProduce(NodeRef) to python Reviewed By: bwasti Differential Revision: D13092930 fbshipit-source-id: f1ec06e73e0f5f6a16ad0cbb7d2e3e499a861d8e	2018-11-16 08:19:18 -08:00
Duc Ngo	e7f5fceb99	nomnigraph - easy - expose inducesEdges and addNode to python's NNSubgraph (#14074 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14074 expose inducesEdges and addNode to python's NNSubgraph. This make it easy to manually construct a NNSubgraph in python Reviewed By: bwasti Differential Revision: D13092885 fbshipit-source-id: a94ed0b318162e27e3a4b5a4954eb6d169da7405	2018-11-16 08:19:16 -08:00
Parth Raichura	3808e9fad3	Caffe2: Fix for creating entries of external_input in predic_net (#12979 ) Summary: Currently after performing export it gives two entries of externel_input of input data in predict_net proto because it extends the externel_input twice once seperately using input blob and one it is extendind all the entries of external_input from proto in which input blob is already included Signed-off-by: Parth Raichura <parth.raichura@softnautics.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/12979 Differential Revision: D12916349 Pulled By: soumith fbshipit-source-id: 4d4a1c68c0936f8de3f4e380aea1393fe193cd2d	2018-11-15 22:33:50 -08:00
Matthew Brandyberry	c5afad5579	Fix skip logic in caffe_translator_test.py (#13627 ) Summary: Avoid false failure by checking for the presence of the test data in setup. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13627 Differential Revision: D13090324 Pulled By: ezyang fbshipit-source-id: e85571943d168c0007212d7b1a5b99ffa0c39235	2018-11-15 16:45:49 -08:00
Ilia Cherniavskii	0e93500841	Remove async_polling (#13825 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13825 async_polling was an intermediate step towards async_scheduling and is not used Reviewed By: yinghai Differential Revision: D13019059 fbshipit-source-id: eee6ba53e7f476ddb481afba3bf1768303864d32	2018-11-15 16:23:15 -08:00
Edward Yang	3fbb753512	Revert D12873145: [pt1][tensor][refactor] FeedTensor returns a Tensor Differential Revision: D12873145 Original commit changeset: 653735c20d61 fbshipit-source-id: aa6e40a6a24c6f90acbe87b32b3be0020e2584f8	2018-11-15 14:52:46 -08:00
Yan Zhu	2356c8d542	device inference for Adam (#13990 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13990 to make sure ITER blob lives on CPU. Reviewed By: xianjiec Differential Revision: D13056070 fbshipit-source-id: 148edbf745e50e886da3eb99d4e485d11c1924e2	2018-11-14 17:21:08 -08:00
Ashish	f4e502a8c5	Added MIOpen conv transpose op (#13938 ) Summary: This pull request contains changes for: 1. Removing ConvTranspose related changes from caffe2/operators/hip/conv_op_miopen.cc 2. Adding the file caffe2/operators/hip/conv_transpose_op_miopen.cc 3. Modifying the tests to run convTranspose op using MIOpen engine Differential Revision: D13055099 Pulled By: bddppq fbshipit-source-id: ca284f8f9a073005b22013c375cc958257815865	2018-11-13 21:01:52 -08:00
Shuting Wang	23e19ebfa7	add non expotential emphasis loss to Lambdarank Summary: Currently Lambdarank applies exponential emphasis on relevance, i.e., g=2^rel when calculating dcg, this diff adds options that supports g=rel in the loss function. Reviewed By: itomatik Differential Revision: D9891514 fbshipit-source-id: 64730d467a665670edd37e6dc1c077987991d1a8	2018-11-13 14:54:04 -08:00

1 2 3 4 5 ...

2216 commits