pytorch/caffe2/python
Pieter Noordhuis 9e6fd02c28 Use Gloo ops in data_parallel_model
Summary:
No longer need GPU to CPU copies. The allreduce operator no longer
uses 'local allreduce - global allreduce - local broadcast' sequence
when Gloo is used, but passes all input blobs directly.

Depends on D4708860.

Differential Revision: D4709897

fbshipit-source-id: 4d745d5d8bac9c2fcca081dd5d812c902808c3b6
2017-03-14 22:34:51 -07:00
..
docs Documenation generation to wiki 2017-02-15 16:00:44 -08:00
examples Adding UNK to vocab | Changing default params 2017-03-13 22:17:48 -07:00
layers Allow scalar output in functional layer 2017-03-14 15:32:47 -07:00
mint
models Added model downloader 2017-02-22 12:47:15 -08:00
operator_test Allow test discovery in caffe2/python/ 2017-03-14 18:16:41 -07:00
_import_c_extension.py
attention.py Implement recurrent attention in C2 2017-03-08 11:21:28 -08:00
caffe_translator.py translator fix to solve Aaron's issue 2017-02-13 11:19:13 -08:00
caffe_translator_test.py Allow test discovery in caffe2/python/ 2017-03-14 18:16:41 -07:00
checkpoint.py Fix issues pickling jobs 2017-02-21 20:47:27 -08:00
checkpoint_test.py Fix issues pickling jobs 2017-02-21 20:47:27 -08:00
CMakeLists.txt
cnn.py Quantized Training API 2017-03-13 22:17:58 -07:00
context.py Make ContextManager thread-safe 2017-02-13 19:45:35 -08:00
context_test.py Make ContextManager thread-safe 2017-02-13 19:45:35 -08:00
control.py
control_test.py
convnet_benchmarks.py Convnet benchmark cudnn_ws 2017-03-02 15:32:37 -08:00
convnet_benchmarks_test.py
core.py fixes to make data parallel model work for RecurrentNet + test case 2017-03-14 15:48:07 -07:00
core_gradients_test.py add inference for gradient ops + a couple of missing shape inference functions + fix to scalars 2017-02-28 23:33:32 -08:00
core_test.py NextScopedBlob with well-defined behavior and respect namescope 2017-02-16 17:16:36 -08:00
data_parallel_model.py Use Gloo ops in data_parallel_model 2017-03-14 22:34:51 -07:00
data_parallel_model_test.py fixes to make data parallel model work for RecurrentNet + test case 2017-03-14 15:48:07 -07:00
data_workers.py Remove use of logging module and np.random.randint() due to deadlocks with forks 2017-03-01 03:32:56 -08:00
data_workers_test.py close blobs queues when stopping + test 2017-02-27 10:07:57 -08:00
dataio.py Stop multi_reader if we run out of data before max_examples 2017-03-10 18:03:57 -08:00
dataio_test.py Stop multi_reader if we run out of data before max_examples 2017-03-10 18:03:57 -08:00
dataset.py
db_test.py
device_checker.py
dyndep.py
experiment_util.py XRay mobile quantized model 2017-03-14 22:18:40 -07:00
extension_loader.py
gradient_check_test.py
gradient_checker.py
hsm_util.py
hypothesis_test.py add AccumulateHistogramOp 2017-03-08 19:37:32 -08:00
hypothesis_test_util.py Allow use of ReversePackedSegs operator in CUDA context 2017-03-09 15:03:55 -08:00
introspect_vis.py User input (Conv out, etc.) 2017-03-08 13:49:45 -08:00
layer_model_helper.py Use new metric intefaces in trainer workflows. 2017-03-07 12:46:52 -08:00
layer_model_instantiator.py Migrate realtime training workflows to use new metrics. 2017-03-08 23:49:41 -08:00
layers_test.py Allow scalar output in functional layer 2017-03-14 15:32:47 -07:00
load_save_test.py Improve error message from LogFileDB on missing file 2017-03-10 23:31:28 -08:00
lstm_benchmark.py LSTM benchmark (Caffe2 RNN based) 2017-02-28 23:17:26 -08:00
memonger.py Fixes to topological sort, canonical blob naming, sharing final blob 2017-01-25 15:14:26 -08:00
memonger_test.py
mkl_test_util.py
model_device_test.py Comment out NHWC Alexnet test for now 2017-01-23 13:59:29 -08:00
model_helper.py fixes to make data parallel model work for RecurrentNet + test case 2017-03-14 15:48:07 -07:00
mpi_python.cc
muji.py
muji_test.py
net_builder.py Improve "reporter net" design 2017-02-21 20:17:40 -08:00
net_builder_test.py Allow test discovery in caffe2/python/ 2017-03-14 18:16:41 -07:00
net_drawer.py Add model graph to dper_example 2017-02-07 13:03:54 -08:00
net_printer.py Add task outputs and stop signals to net_printer 2017-03-07 01:21:40 -08:00
net_printer_test.py Debug/Analysis tools for Jobs/ExecutionSteps 2017-02-06 17:31:20 -08:00
optimizer.py refactor and modulize optimizers 2017-03-07 18:46:47 -08:00
optimizer_test.py Allow test discovery in caffe2/python/ 2017-03-14 18:16:41 -07:00
optimizer_test_util.py Allow test discovery in caffe2/python/ 2017-03-14 18:16:41 -07:00
pipeline.py Better names for nets, steps and tasks 2017-02-09 16:33:54 -08:00
pybind_state.cc Make ModelExporter.load_from_db() load to specific workspace 2017-03-08 09:31:42 -08:00
pybind_state.h
pybind_state_gpu.cc Cudnn v6 2017-02-28 17:46:33 -08:00
pybind_state_mkl.cc
python_op_test.py
queue_util.py Better names for nets, steps and tasks 2017-02-09 16:33:54 -08:00
record_queue.py
recurrent.py fixes to make data parallel model work for RecurrentNet + test case 2017-03-14 15:48:07 -07:00
schema.py Enable use of Print for LayerModelHelper 2017-03-10 15:26:16 -08:00
schema_test.py schema.Struct.__add__ 2017-02-06 13:47:58 -08:00
scope.py
scope_test.py
session.py Default LocalSession to current workspace. 2017-03-01 16:03:18 -08:00
session_test.py NextScopedBlob with well-defined behavior and respect namescope 2017-02-16 17:16:36 -08:00
sparse_to_dense_mask_test.py
task.py Gather perf counters for distributed jobs 2017-02-21 22:06:25 -08:00
test_util.py
text_file_reader.py fix typo in TextFileReader 2017-02-21 14:02:48 -08:00
timeout_guard.py Euthanize a process with timeout 2017-03-01 11:38:11 -08:00
toy_regression_test.py
tt_core.py
tt_core_test.py
utils.py Add a create your own dataset tutorial 2017-02-22 03:31:47 -08:00
visualize.py
workspace.py backup functions for non-cuda cases 2017-02-28 22:07:54 -08:00
workspace_test.py