pytorch/caffe2/python/examples
Yury Zemlyanskiy 0c47d345df Multi-gpu training for OSS seq2seq
Summary:
Use data_parallel_model for seq2seq multi-gpu training. The main reason for complexity here is that GatherOp hasn't yet been implemented on GPU.

This diff also adds better cliping procedure - clip by global norm rather than by absolute value.

Differential Revision: D4778691

fbshipit-source-id: bff184dae02ecc227413fef51f48a4726e5d3825
2017-03-27 17:32:39 -07:00
..
char_rnn.py ] Char RNN bug fix for batching 2017-02-22 17:34:30 -08:00
lmdb_create_example.py LMDB example 2016-12-05 11:53:26 -08:00
resnet50_trainer.py Update resnet50 example 2017-03-15 08:18:50 -07:00
seq2seq.py Multi-gpu training for OSS seq2seq 2017-03-27 17:32:39 -07:00
seq2seq_util.py Multi-gpu training for OSS seq2seq 2017-03-27 17:32:39 -07:00