pytorch/caffe2/python
James Cross f27c9eea20 dropout for C2 multilayer
Summary:
Incorporate arbitrary dropout for encoder and decoder layers for Caffe2 NMT models using current configuration. This involves separate output processing (_prepare_output() and _prepare_output_sequence()) for the final layer in a MultiRNNCell.

Switching to using the newly introduced forward_only switch for RNN cells revealed an unrelated bug in our NetGradientChecker test, which urikz is investigating.

Reviewed By: salexspb

Differential Revision: D5031964

fbshipit-source-id: 19b49607d551aa3e2140041ef4e585f128c8f178
2017-05-17 11:32:47 -07:00
..
docs fixed document generator for github 2017-05-15 11:40:46 -07:00
examples rnn with brew 2017-05-16 13:33:44 -07:00
helpers rnn with brew 2017-05-16 13:33:44 -07:00
layers Support un-equal weight training for mtml models 2017-05-15 12:56:11 -07:00
mint
mkl MKL related files with review comments incorporated 2017-04-25 00:31:29 -07:00
models new resnet building with brew 2017-05-15 22:47:24 -07:00
operator_test fix mkl_sparse and migrate sparsity experiments 2017-05-16 18:33:51 -07:00
predictor Re-apply #266 2017-04-25 21:17:04 -07:00
_import_c_extension.py
attention.py rnn with brew 2017-05-16 13:33:44 -07:00
brew.py arg scope in ModelHelper 2017-05-12 11:18:59 -07:00
brew_test.py arg scope in ModelHelper 2017-05-12 11:18:59 -07:00
caffe_translator.py adding 3d operator translators 2017-05-11 23:01:44 -07:00
caffe_translator_test.py
checkpoint.py
checkpoint_test.py
CMakeLists.txt
cnn.py arg scope in ModelHelper 2017-05-12 11:18:59 -07:00
context.py
context_test.py
control.py
control_test.py
convnet_benchmarks.py
convnet_benchmarks_test.py
core.py Relax requirement on token uniqueness 2017-05-09 19:36:00 -07:00
core_gradients_test.py
core_test.py fix enternalinputs handling in AppendNet v2 2017-05-05 21:50:57 -07:00
crf.py Integrate CRF in DeepText + New caffe2 operator for viterbi decode 2017-05-01 20:39:41 -07:00
data_parallel_model.py Error status for Gloo ops, and handling in elastic dpm 2017-05-17 00:16:52 -07:00
data_parallel_model_test.py Initial implementation of Blockwise Model Update Filtering (BMUF) 2017-05-15 18:18:15 -07:00
data_workers.py Add direct enqueuing to enable RNN input, allow specify batch columns 2017-05-03 14:49:44 -07:00
data_workers_test.py Add direct enqueuing to enable RNN input, allow specify batch columns 2017-05-03 14:49:44 -07:00
dataio.py
dataio_test.py
dataset.py
db_test.py
device_checker.py
dyndep.py
empty.so Adding video data layer for caffe2 2017-05-05 14:16:38 -07:00
experiment_util.py Port Xray OC workflow to elastic_data_parallel_model 2017-05-01 00:32:47 -07:00
extension_loader.py
gradient_check_test.py
gradient_checker.py MultiRNNCell (Caffe2) 2017-05-03 10:02:31 -07:00
hsm_util.py
hypothesis_test.py cuda unique op 2017-05-11 21:08:10 -07:00
hypothesis_test_util.py Add option to control the size of lengths tensor 2017-04-20 09:53:22 -07:00
layer_model_helper.py ttsn model fails to set optimizer for FC layer 2017-05-16 11:26:02 -07:00
layer_model_instantiator.py JoinContext 2017-05-02 17:32:26 -07:00
layer_test_util.py Re-apply #266 2017-04-25 21:17:04 -07:00
layers_test.py LastNWindowCollector 2017-05-04 17:32:09 -07:00
load_save_test.py Allow Load operator to load into overriden names 2017-04-27 01:18:12 -07:00
lstm_benchmark.py lstm_benchmark use rnn_cell.LSTM multicell + assertion 2017-05-04 17:02:32 -07:00
memonger.py share forward activation blobs + pass unused free blobs down all branches + use shape infernece 2017-04-25 14:23:25 -07:00
memonger_test.py share forward activation blobs + pass unused free blobs down all branches + use shape infernece 2017-04-25 14:23:25 -07:00
mkl_test_util.py
model_device_test.py
model_helper.py arg scope in ModelHelper 2017-05-12 11:18:59 -07:00
mpi_python.cc Fix pybind11 module name for MPI helpers 2017-05-02 23:18:50 -07:00
muji.py
muji_test.py
net_builder.py
net_builder_test.py
net_drawer.py net_drawer: --input is required 2017-05-04 11:45:57 -07:00
net_printer.py
net_printer_test.py
optimizer.py Refactor Optimizer to Allow scale_learning_rate 2017-05-09 13:16:21 -07:00
optimizer_test.py Returns auxiliary parameters in the optimizers. 2017-04-17 10:16:32 -07:00
optimizer_test_util.py create_net: explicitly specify if one wants to overwrite the network. 2017-04-17 21:46:53 -07:00
pipeline.py
predictor_constants.py Re-apply #266 2017-04-25 21:17:04 -07:00
pybind_state.cc Error status for Gloo ops, and handling in elastic dpm 2017-05-17 00:16:52 -07:00
pybind_state.h
pybind_state_gpu.cc
pybind_state_mkl.cc
python_op_test.py
queue_util.py
record_queue.py
recurrent.py fix recompute_blobs_on_backward 2017-05-05 14:01:34 -07:00
rnn_cell.py dropout for C2 multilayer 2017-05-17 11:32:47 -07:00
schema.py Prevent false overwriting of a field 2017-05-02 23:18:49 -07:00
schema_test.py Prevent false overwriting of a field 2017-05-02 23:18:49 -07:00
scope.py Fix corruption of NameScope when exception is thrown 2017-04-24 22:46:27 -07:00
scope_test.py Fix corruption of NameScope when exception is thrown 2017-04-24 22:46:27 -07:00
session.py
session_test.py Warn on setting blob on Scalar 2017-05-01 20:18:30 -07:00
sparse_to_dense_mask_test.py Caffe2: SparseToDenseMask: return key presence 2017-05-16 20:22:03 -07:00
task.py Remove application-specific comment. 2017-05-16 12:17:03 -07:00
test_util.py
text_file_reader.py
timeout_guard.py
toy_regression_test.py
tt_core.py
tt_core_test.py
utils.py Free scratch blobs when data workers exits, add utility function to reset blobs 2017-04-26 13:40:13 -07:00
visualize.py
workspace.py Error status for Gloo ops, and handling in elastic dpm 2017-05-17 00:16:52 -07:00
workspace_test.py create_net: explicitly specify if one wants to overwrite the network. 2017-04-17 21:46:53 -07:00