pytorch

mirror of https://github.com/saymrwulf/pytorch.git synced 2026-05-15 21:00:47 +00:00

Author	SHA1	Message	Date
Aapo Kyrola	adb3f0ec22	add exception for empty shape param Summary: Following krp's suggestion, check if the shape parameter is empty. Reviewed By: dzhulgakov Differential Revision: D4686698 fbshipit-source-id: 3f9fb1e3215dd2a4a726442531201eeb18224bc6	2017-03-10 00:33:59 -08:00
Karthik Prasad	965a7daf9b	Implement MILSTM in caffe2 Summary: Created a new function with specifics related to MI LSTM implementation in caffe2 See https://arxiv.org/pdf/1606.06630.pdf for details. See D4478877 for the implementation of the same in tensorflow Reviewed By: jhcross Differential Revision: D4669882 fbshipit-source-id: 095bbcf187dbdac2cd79558ff0c8f9f67d8af639	2017-03-09 16:32:47 -08:00
James Cross	c5621ded31	Allow use of ReversePackedSegs operator in CUDA context Summary: ReversePackedSegs operator for CUDA. Input "lengths" (static integers) required to be in CPU memory. Differential Revision: D4661281 fbshipit-source-id: c800c316c34015ba8e732dcbcaa8c4edaffdfeab	2017-03-09 15:03:55 -08:00
James Reed	8de1db9eb6	Implement recurrent attention in C2 Summary: Super rough implementation of recurrent attention. Planning to factor out the common code between the two functions as well as train and eval. I want to get this out and get eyes on it sooner rather than later Differential Revision: D4647837 fbshipit-source-id: 54bc4e8ed0df6f04c86c425926decbe89f73b068	2017-03-08 11:21:28 -08:00
James Cross	8de2027d9b	Add gradient operator for SumElements Summary: Add gradient support for Caffe2 operator SumElements (for use in Translation RNN training pipeline). Differential Revision: D4669036 fbshipit-source-id: 502760a2a624b20b3241e83a2f208f450b6ff36f	2017-03-07 20:03:07 -08:00
Aapo Kyrola	d8588d8007	CUDA version of elementwise power + rename to Pow + gradient Summary: Renamed ElementwisePower to Pow for better discoverability. Added CUDA version and Gradient + tests. Reviewed By: kennyhorror Differential Revision: D4665550 fbshipit-source-id: dd33d8ad3917d71504e363ab397af50d38a63b1f	2017-03-07 10:20:40 -08:00
Aapo Kyrola	695ea6c7a1	SumElementsOp Summary: Add a simple op to sum the elements, with optional averaging. This is basically copy from AverageLossOp that we should alias to this. And maybe develop this towards a generic norm op. Reviewed By: jhcross Differential Revision: D4664591 fbshipit-source-id: 0e0c0efe9e415e2ad2feecfa42b03db2c83bee70	2017-03-07 05:23:53 -08:00
Aapo Kyrola	8fab453863	Sqr op and gradient Summary: Due to popular demand, added an op to compute element-wise square + gradient for it (just for the fun of it). Reviewed By: Yangqing Differential Revision: D4664797 fbshipit-source-id: 0a29c7c249fdc72f51412bebd6ae352a7801cf05	2017-03-07 03:03:07 -08:00
Kairan Sun	9f588aa8a2	Add Inference for Flatten Summary: Implementing shape inference for Flatten operator and adding unit tests. Differential Revision: D4664073 fbshipit-source-id: c54a269fc7633908fe4197682d27076ef97d9c22	2017-03-07 01:21:40 -08:00
Aapo Kyrola	2333ccadfb	MaxOp for CUDA Summary: Simple elementwise Max implementation for CUDA. Given N inputs, it will do N-1 pairwise maxes. I am not sure if it would be much better to iterate through all the inputs in the kernel, since this has better locality. We can also optimize later. Reviewed By: Yangqing Differential Revision: D4659953 fbshipit-source-id: 3a23b7fb3dbdf1d43bf3134ece03af4a791844dd	2017-03-06 16:46:53 -08:00
Pooya Davoodi	c61a7ca777	Make counts datatype int. Used as index. Summary: To avoid Numpy warning: using a non-integer number instead of an integer will result in an error in the future Closes https://github.com/caffe2/caffe2/pull/64 Differential Revision: D4658348 Pulled By: Yangqing fbshipit-source-id: 3a1b33cbb27849bc167b08147d078e8d487567f4	2017-03-06 10:46:36 -08:00
Aapo Kyrola	8caa7cec8d	CUDA version of Log Summary: As in the title. Simple registration issue. Reviewed By: Yangqing, jhcross Differential Revision: D4655691 fbshipit-source-id: 661e4d5f1226ec05e099c84f4454aa07c6be4449	2017-03-04 00:32:03 -08:00
Huazhong Ning	6c9105447c	support fill bool tensors in GivenTensorFill Summary: the existing code uses vector<T> to store the given tensor and then copy to output. If T=bool, vector<bool> stores the data as bits and then copy does not work. we use TensorCPU to store it instead. Also add unittest. Reviewed By: kennyhorror Differential Revision: D4622325 fbshipit-source-id: 95c27b5d1cfbc836d2419d01cacde5a3172f4d7e	2017-03-02 20:18:59 -08:00
Aapo Kyrola	ec56737190	fix shape inference for spatial softmax with loss Summary: The shape inferenec did not check for spatial mode. Reviewed By: andrewwdye Differential Revision: D4638218 fbshipit-source-id: f15419738587013dea39e04a3da086890938c4e2	2017-03-01 19:32:32 -08:00
Aapo Kyrola	02937903cc	add inference for gradient ops + a couple of missing shape inference functions + fix to scalars Summary: A bit too much stuff in one diff, so sorry: 1. Add inference for gradient types by using the fact that x_grad is gradient of x and must be of same shape. This is kind of awkward to use string matching, but in addition I rely on the operator being actually a gradient op. 2. dzhulgakov was write, scalar shape is () and not (1). Sorry, my claim easlier was #fakenews. 3. Added inference functions for MakeTwoClass, MomentumSGDUpdate and Cross entropy ops. Reviewed By: dzhulgakov Differential Revision: D4569758 fbshipit-source-id: 0db13f33819777fdddefe21d4b1ebf906fcaf98c	2017-02-28 23:33:32 -08:00
Jerry Pan	8a0ebed4c9	Caffe2: Tile operator Summary: Caffe2: Tile operator Differential Revision: D4630698 fbshipit-source-id: 1aa5c3c9d7fcfc17f78c80fd4b752595280266a0	2017-02-28 23:17:26 -08:00
Luke Yeager	69fa85be26	Fix some typos Summary: Found while reading through `d522693cc8` Closes https://github.com/caffe2/caffe2/pull/176 Differential Revision: D4630275 Pulled By: Yangqing fbshipit-source-id: 0a8e85d317d427a39467ebcb5e9a70594075bae2	2017-02-28 18:36:12 -08:00
Simon Layton	fbf47a8825	Cudnn v6 Summary: Add cudnn v6 support, including testing support for dilated convolution. Add a check to ensure that the versions of cuDNN used to compile Caffe2 and run it are compatible Closes https://github.com/caffe2/caffe2/pull/85 Reviewed By: bwasti Differential Revision: D4387690 Pulled By: Yangqing fbshipit-source-id: 312960134398dd4afe6ee0c01cdc160046c904e8	2017-02-28 17:46:33 -08:00
Artem Volkhin	000db87bc7	Half-floats support for the rest of segment ops Summary: previously fp16 type was supported in SparseLengthsSum operator, now it works in all other segment operator as well. Reviewed By: dzhulgakov Differential Revision: D4624312 fbshipit-source-id: c9d72110e3762167270bb088405eaf9c56e88493	2017-02-28 11:19:15 -08:00
Kun Huang	07623e24c9	Implement shape inference function for Im2Colop Summary: Inference function for the Im2ColOp: caffe2/caffe2/operators/im2col_op.cc. Differential Revision: D4608663 fbshipit-source-id: d26ffb403c2acb7a5ead5f58f044ee3340c8311a	2017-02-27 10:46:54 -08:00
Kevin Matzen	04d02632e9	instance norm test fix Summary: Reduce test input size to instance norm gradient check. Larger size is currently timing out on stress tests. e.g. failed: Timeout: Ran out of time before finding a satisfying example for test_instance_norm_gradients. Only found 2 examples in 125.39s. Reviewed By: Yangqing Differential Revision: D4608828 fbshipit-source-id: ce17a3ad28752d808efcbf79f1ea4238e63fb005	2017-02-25 14:31:42 -08:00
Peng Yang	8ab13eea6f	delete redundant comment lines. Summary: delete redundant comment lines. Differential Revision: D4600596 fbshipit-source-id: 4bb619f9ff99d6f799e87970b6b6d5ea7de02c98	2017-02-24 11:04:36 -08:00
Deepak Gopinath	cd4ea42048	Allowing creation of random odd length arrays in RandGaussian Summary: curandGenerateNormal can only generate arrays of multiple of 2 lengths. MSRAFill and GaussianFill operators use RandGaussian utility method which in turn uses curandGenerateNormal. This is a test which runs the operators on both devices to generate odd sized random arrays. Differential Revision: D4602819 fbshipit-source-id: e65f5c731e925886cfa14afff482f7053bd020a0	2017-02-23 15:03:22 -08:00
Yury Zemlyanskiy	4a53ab3cb6	LSTMWithAttention implementation in Caffe2 Summary: Implementation of ##LSTMWithAttention## Still TBD: 1. There are problems with back propagation, because gradient is not implemented for ops with broadcasting 2. I need to make initial_recurrent_state to be of shape [dim] rather than [1, batch_size, dim], so one doesn't need to provide batch_size to LSTMWithAttention Differential Revision: D4298735 fbshipit-source-id: 8903fcff4d6a66647ee6d45a6ef28803fc3091e5	2017-02-23 04:08:34 -08:00
Andrew Tulloch	312821d36c	Allow in-place instance norm. Summary: In-place is ~30% speedup, but needs a change to torch2caffe or a graph rewrite on the client. Differential Revision: D4577582 fbshipit-source-id: c31bf8ba97f4fa4cedf355cf2475eb7bab48b304	2017-02-22 14:03:55 -08:00
Artem Volkhin	45e1905722	add support of fp16 to SparseLengthsSum and SparseLengthsMean Summary: Another part of making DPER compatible with half-floats. This diffs adds supoprt of fp16 to segment reduction operators used in DPER. Reviewed By: dzhulgakov Differential Revision: D4587560 fbshipit-source-id: 0ae10648a7286a820bffaee802464dd9464584bc	2017-02-22 11:05:55 -08:00
Peng Yang	26be1977bf	fix CrossEntropyOp bug for batch input Summary: this is to fix the bug with eigen implementation which calculating crossentropy Reviewed By: salexspb Differential Revision: D4582078 fbshipit-source-id: 4c92047e9dbbe219fcbef618a45c584c2fbfaad5	2017-02-21 17:34:31 -08:00
Alisson Gusatti Azzolini	04eccb8ebe	Performance counters Summary: - Key-value store for counters. - Counters are updated via macros that also export USTD probes. - Counter values can be exported using caffe2 operators. - Snapshot mechanism for tracking time-window counter values. Reviewed By: dzhulgakov, pietern Differential Revision: D4553761 fbshipit-source-id: 25a1a91a3168dcff2159c6fba7b357d3fd3aa9bf	2017-02-21 16:31:24 -08:00
Qichao Que	7f4d5e9900	Add feed label parser operator. Summary: Add feed label parser operator, this layer depends on D4520993. Reviewed By: kennyhorror Differential Revision: D4538797 fbshipit-source-id: 8efcd7b2f6962c30023c7464a13c125ba1a99dc4	2017-02-21 14:17:00 -08:00
Ahmed Taei	5bc3d2ef03	Add ReduceFront GPU Op's Summary: Add GPU implementation for ReduceFront{Sum\|Mean} Ops Differential Revision: D4577270 fbshipit-source-id: 697f498531af6b9da4a0138d2a9beb39234f9756	2017-02-17 16:46:42 -08:00
Xianjie Chen	d0621a2449	NextScopedBlob with well-defined behavior and respect namescope Summary: Remove the use of `NextName` in layer model helper, so that the same function return `model_helper` that should construct identical `Net`, when under the same NameScope. The `NextScopedBlob` should only take effect when there is real name conflicting, otherwise it returns ScopedBlobReference. This is critical for parameter blobs. In long run, we need to be able to specify parameter blobs more explicitly. (kennyhorror is working on this). This solution works in short term for e.g., two tower sparse nn models. Reviewed By: kennyhorror Differential Revision: D4555423 fbshipit-source-id: 2c4b99a61392e5d51aa878f7346466a8f14be187	2017-02-16 17:16:36 -08:00
James Cross	b436788b16	LSTMUnit: pass through H values Summary: Pass through the h-value recurrent output unchanged at each LSTM step beyond the valid part of a sequence (computed based on seqLengths, allowing batching of sequences of different length). This enables using the final-step output of each sequence as the output when one vector is desired for the entire sequence. Gradient also passed back unchanged. Also made some cosmetic changes to recurrent_network_test.py (seq_lengths offset corrected, should be in [1, T] rather than [0, T-1]). Reviewed By: urikz Differential Revision: D4540307 fbshipit-source-id: 73a9f6326069d713dcb0cdc8d17869317c6dbe96	2017-02-16 15:31:38 -08:00
Steven Strijakov	5429031917	Adding SoftmaxWithLoss operator to Shape Inference Summary: This diff adds shape inference for the SoftmaxWithLoss Operator Differential Revision: D4565835 fbshipit-source-id: 1c2db398524c765977ec4d8a22c9b986bf9faf82	2017-02-16 12:32:51 -08:00
Yury Zemlyanskiy	40534de705	Gradient for Copy operator Summary: One can find a reason, why I need gradient for CopyOp in this post - https://fb.facebook.com/groups/1405155842844877/permalink/1639683782725414/ Gradient for CopyOp is trivial in case the device was the same (cpu, or same gpu), but get's a little harder, when the copy was made across two different gpu. I introduce new operator CopyOnDeviceLike, which has additional second input. The op copies the first input to the same device as the second one. The default implementation is exactly the same as CopyOp, but I specialize it for CUDAContext. Please, let me know if I'm doing anything wrong here! That's my first caffe2 diff, related to operators definitions. Reviewed By: Yangqing Differential Revision: D4557258 fbshipit-source-id: 9494be589cc1e5696bbbfe25b7622aaa4c9efe4a	2017-02-16 06:11:27 -08:00
Tullie Murrell	81d932b161	Add LeakyReluOp to caffe Summary: Adds LeakyRelu to caffe2 with a test. Reviewed By: bwasti Differential Revision: D4511970 fbshipit-source-id: a7189c691ec1813b304bf04f2b73f1c61acd08e2	2017-02-15 16:00:45 -08:00
Aapo Kyrola	50a6897e80	Shape inference for ImageInput, NHWC2NCHW and StopGradient Summary: As in headline. I had missed these originally. Reviewed By: kennyhorror Differential Revision: D4560255 fbshipit-source-id: e69458e8a2574b981e40e915d87c8e16dadee7d6	2017-02-15 16:00:45 -08:00
James Cross	63901e9aca	allow recurrent network gradient op to receive gradient on any combination of network output blobs Summary: (Caffe2) Modified RecurrentNetworkGradient operator so that training is possible with any of the output blob(s) receiving gradient during the backward pass. This is realized through a new argument for the RecurrentNetwork op, outputs_with_grads, which takes a list of the indices of the output blobs which will receive gradient. The default case (only receiving gradient from the first output blob) remains the default. New unit test covers the case where outputs_with_grads = [1, 2] using Python LSTM wrapper. Reviewed By: urikz Differential Revision: D4518516 fbshipit-source-id: 5c531582b20f3cf727d1aa91239b4d5a2b8a7c1f	2017-02-15 16:00:45 -08:00
Huazhong Ning	cb3c41b9a9	PiecewiseLinearTransformOp transform binary predictions specially Summary: The existing op tranforms the input in a general way. It needs M transform mappings to transform a NxM input tensor. But for binary predictions X (Nx2 tensor), we know that X[:, 0] = 1 - X[:, 1]. So we just need one mapping for X[:, 1]. After being transformed, we can compute X[:, 0]. This diff is to handle this. Differential Revision: D4550441 fbshipit-source-id: 42d8c6e88d830c97628ee930b543740a32acf904	2017-02-15 16:00:44 -08:00
Kittipat Virochsiri	718786add7	UniqueUniformFillOp Summary: This is like `UniformIntFill` but guarantee to return unique elements in the output, excluding the optional avoiding elements. Reviewed By: xianjiec Differential Revision: D4511814 fbshipit-source-id: 5dc98ee580616e60e46ee74ebb3f5ddd29a09965	2017-02-15 16:00:44 -08:00
Steven Strijakov	2de4b8840d	Added MatMul operator inference Summary: MatMul operator now performs inference Differential Revision: D4515770 fbshipit-source-id: 237b527cce306b4858452d430c8ecc8a79537aff	2017-02-14 15:32:14 -08:00
Kittipat Virochsiri	524bc07973	Change the schema of IndexLoad & IndexFreeze so that state change is captured by the framework Summary: These operators update the state of the instance and therefor should have the instance in the output list. Reviewed By: xianjiec Differential Revision: D4554773 fbshipit-source-id: 556d484fcf58878308aa6b0f7cd7ea2446d3f29e	2017-02-14 10:05:12 -08:00
David Truong	60be25f4cd	Added shape inference to padding operator for tensors Summary: Can now infer the shape of the tensor Differential Revision: D4529339 fbshipit-source-id: 33553611fd3ecd7fde4b7b432c7720255ddda8be	2017-02-13 11:04:13 -08:00
Amy Zhang	5c007be804	add soft label functionality to softmax with loss op Differential Revision: D4527240 fbshipit-source-id: 548bf943857adb8f198348cc5b17ec52dc65bd2e	2017-02-10 09:01:53 -08:00
Andrew Dye	306fde233a	Accept optional blob map for InferShapesAndTypes Summary: Shape inference allows Caffe2 to compute shapes of blobs without running a model. Update InferShapesAndTypes() to accept an optional blob:dimensions map so that external input blobs do not need to be part of the workspace. InferShapesAndTypes() in workspace.py conditionally calls the ...from_workspace or ...from_map bindings. Note I favored a small amount of code duplication here for the sake of readability. InferShapesAndTypes() in operator.cc has been refactored into mirrored entry points, invoking a common helper. Other minor changes to address linter warnings. Reviewed By: dzhulgakov Differential Revision: D4524873 fbshipit-source-id: 56f863b759c016d7f23523f06fda3aa5bba22357	2017-02-08 15:04:24 -08:00
Steven Strijakov	e6a18d2e9a	Added TransposeOp Inference Summary: TransposeOp shape inference is now implemented Differential Revision: D4517155 fbshipit-source-id: fb2b11c27231043f87a4c128b0eb3cbb60ab2c0c	2017-02-08 10:29:31 -08:00
Yury Zemlyanskiy	280718b40c	Allow non-batched initial recurrent states for RecurrentNetworkOp Summary: title Reviewed By: salexspb Differential Revision: D4493728 fbshipit-source-id: a9ba25bd325b413ed15c35754afb9ed562b1a60c	2017-02-06 15:01:36 -08:00
Aapo Kyrola	dcefc74a0c	Shape and Type Inference Part1 Summary: This is a bit large diff, sorry about it. It includes basic shape and type inference functionality, based on YQ's Schema scaffolding. I added some helper functions to make it easier to write simple translations. Bigger refactoring was needed for ConvPoolBase so that we could use the shape inference already there in the schema. I annotated enough operators to be able to infer forward-pass of shapes for basic convnet, and added test for that. I intend to bootcamp some annotations and annotate enough to handle Resnets fully. Need to think about gradients, if they could be annotated in an easier way. Only shapes are now exposed to Python, types will follow later. Also the inference is not called yet anywhere but unit test. Also I am not sure if everything is in the best location in the code, but shouldn't be hard to move stuff around. Reviewed By: dzhulgakov Differential Revision: D4436818 fbshipit-source-id: eebee5937ccc9ac09c245465302388a1fae6933c	2017-02-02 22:29:22 -08:00
Alisson Gusatti Azzolini	000c53a7b1	AtomicCounter to return previous value on Reset. Summary: This allows to save the previous value of the counter and send it upstream without losing counts. Reviewed By: kennyhorror Differential Revision: D4497854 fbshipit-source-id: 28a7ad0ff1020bde26f78b1f59614b094d1e1881	2017-02-02 14:59:30 -08:00
Alexander Sidorov	b7fa6b2a8b	remove recurrent_inputs in a favor of recurrent_input_ids Summary: I have forgotten to remove this one. The rest of indexing instead of string names is comming after D4446813 lands as scratches aren't inputs or outputs and thus can't be indexed. Reviewed By: urikz Differential Revision: D4465748 fbshipit-source-id: 2ccbedfb35541ef4a2231d1480eef59025bd5290	2017-01-31 13:14:33 -08:00
Alexander Sidorov	d019ec793c	improve fluky test Summary: On some inputs TestWarden was failing Reviewed By: Yangqing Differential Revision: D4487293 fbshipit-source-id: 3da4b310a619c2b57f033b2dd7727f71403bfd68	2017-01-30 22:14:27 -08:00

1 2

94 commits