pytorch/caffe2/python
Aapo Kyrola 02937903cc add inference for gradient ops + a couple of missing shape inference functions + fix to scalars
Summary:
A bit too much stuff in one diff, so sorry:

1. Add inference for gradient types by using the fact that x_grad is gradient of x and must be of same shape. This is kind of awkward to use string matching, but in addition I rely on the operator being actually a gradient op.
2. dzhulgakov was write, scalar shape is () and not (1). Sorry, my claim easlier was #fakenews.
3. Added inference functions for MakeTwoClass, MomentumSGDUpdate and Cross entropy ops.

Reviewed By: dzhulgakov

Differential Revision: D4569758

fbshipit-source-id: 0db13f33819777fdddefe21d4b1ebf906fcaf98c
2017-02-28 23:33:32 -08:00
..
docs Documenation generation to wiki 2017-02-15 16:00:44 -08:00
examples ] Char RNN bug fix for batching 2017-02-22 17:34:30 -08:00
layers Half-floats support for the rest of segment ops 2017-02-28 11:19:15 -08:00
mint
models Added model downloader 2017-02-22 12:47:15 -08:00
operator_test add inference for gradient ops + a couple of missing shape inference functions + fix to scalars 2017-02-28 23:33:32 -08:00
_import_c_extension.py
attention.py LSTMWithAttention implementation in Caffe2 2017-02-23 04:08:34 -08:00
caffe_translator.py translator fix to solve Aaron's issue 2017-02-13 11:19:13 -08:00
caffe_translator_test.py
checkpoint.py Fix issues pickling jobs 2017-02-21 20:47:27 -08:00
checkpoint_test.py Fix issues pickling jobs 2017-02-21 20:47:27 -08:00
CMakeLists.txt
cnn.py Do not initialize BN params if init_params is false. 2017-02-27 20:19:03 -08:00
context.py Make ContextManager thread-safe 2017-02-13 19:45:35 -08:00
context_test.py Make ContextManager thread-safe 2017-02-13 19:45:35 -08:00
control.py
control_test.py
convnet_benchmarks.py Use ws_nbytes_limit (called cudnn_ws in args). 2017-02-22 12:19:16 -08:00
convnet_benchmarks_test.py
core.py Added editDistance helper to caffe2 operators 2017-02-28 13:31:56 -08:00
core_gradients_test.py add inference for gradient ops + a couple of missing shape inference functions + fix to scalars 2017-02-28 23:33:32 -08:00
core_test.py NextScopedBlob with well-defined behavior and respect namescope 2017-02-16 17:16:36 -08:00
data_parallel_model.py Use fbcollective 2017-02-07 10:48:33 -08:00
data_parallel_model_test.py lr_scale to loss_scale 2017-02-03 07:44:40 -08:00
data_workers.py Share queue + reduce logging 2017-02-27 19:38:45 -08:00
data_workers_test.py close blobs queues when stopping + test 2017-02-27 10:07:57 -08:00
dataio.py fix typo in TextFileReader 2017-02-21 14:02:48 -08:00
dataio_test.py NextScopedBlob with well-defined behavior and respect namescope 2017-02-16 17:16:36 -08:00
dataset.py
db_test.py
device_checker.py
dyndep.py
experiment_util.py
extension_loader.py
gradient_check_test.py
gradient_checker.py
hsm_util.py
hypothesis_test.py support vectors with different dims in for DotProductOp. 2017-02-23 11:09:07 -08:00
hypothesis_test_util.py LSTMWithAttention implementation in Caffe2 2017-02-23 04:08:34 -08:00
introspect_vis.py Dump data for DocNN visualization 2017-02-09 13:47:07 -08:00
layer_model_helper.py Add a way do describe layers in a more AdHoc manner. 2017-02-27 23:30:39 -08:00
layer_model_instantiator.py Add model graph to dper_example 2017-02-07 13:03:54 -08:00
layers_test.py Add a way do describe layers in a more AdHoc manner. 2017-02-27 23:30:39 -08:00
load_save_test.py Add name_overrides argument to SaveOp 2017-02-16 12:32:51 -08:00
lstm_benchmark.py LSTM benchmark (Caffe2 RNN based) 2017-02-28 23:17:26 -08:00
memonger.py Fixes to topological sort, canonical blob naming, sharing final blob 2017-01-25 15:14:26 -08:00
memonger_test.py
mkl_test_util.py
model_device_test.py Comment out NHWC Alexnet test for now 2017-01-23 13:59:29 -08:00
model_helper.py Added editDistance helper to caffe2 operators 2017-02-28 13:31:56 -08:00
mpi_python.cc
muji.py
muji_test.py
net_builder.py Improve "reporter net" design 2017-02-21 20:17:40 -08:00
net_builder_test.py
net_drawer.py Add model graph to dper_example 2017-02-07 13:03:54 -08:00
net_printer.py Improve "reporter net" design 2017-02-21 20:17:40 -08:00
net_printer_test.py Debug/Analysis tools for Jobs/ExecutionSteps 2017-02-06 17:31:20 -08:00
pipeline.py Better names for nets, steps and tasks 2017-02-09 16:33:54 -08:00
pybind_state.cc Added editDistance helper to caffe2 operators 2017-02-28 13:31:56 -08:00
pybind_state.h
pybind_state_gpu.cc Cudnn v6 2017-02-28 17:46:33 -08:00
pybind_state_mkl.cc
python_op_test.py
queue_util.py Better names for nets, steps and tasks 2017-02-09 16:33:54 -08:00
record_queue.py
recurrent.py LSTMWithAttention implementation in Caffe2 2017-02-23 04:08:34 -08:00
schema.py Add a way do describe layers in a more AdHoc manner. 2017-02-27 23:30:39 -08:00
schema_test.py schema.Struct.__add__ 2017-02-06 13:47:58 -08:00
scope.py
scope_test.py
session.py Fix issues pickling jobs 2017-02-21 20:47:27 -08:00
session_test.py NextScopedBlob with well-defined behavior and respect namescope 2017-02-16 17:16:36 -08:00
sgd.py Move build_ftrl to open source directory 2017-02-16 18:02:15 -08:00
sgd_test.py Move build_ftrl to open source directory 2017-02-16 18:02:15 -08:00
sgd_test_util.py Move tests of build_sgd, build_adagrad, and build_adam to pyton directory 2017-02-14 15:32:14 -08:00
sparse_to_dense_mask_test.py
task.py Gather perf counters for distributed jobs 2017-02-21 22:06:25 -08:00
test_util.py MKL convolution operator 2017-01-23 09:59:30 -08:00
text_file_reader.py fix typo in TextFileReader 2017-02-21 14:02:48 -08:00
timeout_guard.py better killing after timeout, cleanup 2017-02-23 13:16:19 -08:00
toy_regression_test.py
tt_core.py
tt_core_test.py
utils.py Add a create your own dataset tutorial 2017-02-22 03:31:47 -08:00
visualize.py
workspace.py backup functions for non-cuda cases 2017-02-28 22:07:54 -08:00
workspace_test.py