pytorch

mirror of https://github.com/saymrwulf/pytorch.git synced 2026-05-15 21:00:47 +00:00

Author	SHA1	Message	Date
Jacqueline Xu	8cedf35d55	Adding Random Fourier Features to SparseNN Model and Flow Summary: - Integrated RFF into the preprocessing workflow for dense features - Developed Flow interface to input RFF parameters - Created unit test for using RFF with sparseNN Reviewed By: chocjy Differential Revision: D5367534 fbshipit-source-id: 07307259c501a614d9ee68a731f0cc8ecd17db68	2017-07-07 09:39:32 -07:00
Jacqueline Xu	25bd5dda27	Implementing random fourier features layer Summary: - Created the random fourier features layer - Generated a unit test to test the random fourier features layer is built correctly - Inspired by the paper [[ https://people.eecs.berkeley.edu/~brecht/papers/07.rah.rec.nips.pdf \| Random Features for Large-Scale Kernel Machines]] Reviewed By: chocjy Differential Revision: D5318105 fbshipit-source-id: c3885cb5ad1358853d4fc13c780fec3141609176	2017-07-04 23:48:42 -07:00
Thomas Dudziak	5355634dac	Dict fixes/improvements and unittest targets for Python 3 in caffe2 core Summary: As title Reviewed By: salexspb Differential Revision: D5316104 fbshipit-source-id: aee43819d817842e5ce6ba3d045a55b1a2491c30	2017-06-29 17:05:41 -07:00
Jiyan Yang	8260002941	Partial eval layers Summary: In some cases we don't want to compute the full FC during eval. These layers allow us to compute dot product between X and W[idx,:] where idx is an input, e.g., label. Reviewed By: kittipatv Differential Revision: D5305364 fbshipit-source-id: 0b6a1b61cc8fcb26c8def8bcd037a4a35d223078	2017-06-28 00:36:40 -07:00
Yiming Wu	a1fcbb8be1	offline_all_gpu_experiment Summary: similar to sparse_nn all gpu, this is our first step towards offline full gpu experiment. Compare Run cat(128, 32)512-512 : GPU 21138598 https://fburl.com/jpeod1pi CPU 21138787 https://fburl.com/vma7225l Reviewed By: dzhulgakov Differential Revision: D5308789 fbshipit-source-id: 413819bf9c5fff125d6967ed48faa5c7b3d6fa85	2017-06-27 23:09:54 -07:00
Yiming Wu	1fce3eac4e	single trainer hybrid device Summary: First try of single trainer hybrid device training for sparsenn Comparison results with CPU training: https://our.intern.facebook.com/intern/fblearner/run/compare/?compare_to[0]=20016969&compare_to[1]=19660293&baseline_run=19660293&all_runs[0]=20016969&all_runs[1]=19660293 Reviewed By: dzhulgakov Differential Revision: D5205723 fbshipit-source-id: 4a024324ac2efc3248dd470d4c533cf2ecec2e92	2017-06-27 22:06:30 -07:00
Yan Shang	cf4ac83a91	Make List.__getitem__() works with output of List.field_names() Summary: As described in T19378176 by kittipatv, in this diff, we fix the issue of __getitem__() of schema.List. For example, given Map(int32, float) (Map is a special List), field_names() will return "lengths", "values:keys", & "values:values". "values:keys" and "values:values" are not accessible via __getitem__(). __getitem__() bypasses the values prefix and directly access the fields in the map. Other APIs (e.g., _SchemaNode & dataset_ops) expect "values:keys" and "values:values" as it simplifies traversal logic. Therefore, we should keep field_names() as is and fix __getitem__(). Reviewed By: kittipatv Differential Revision: D5251657 fbshipit-source-id: 1acfb8d6e53e286eb866cf5ddab01d2dce97e1d2	2017-06-21 14:06:05 -07:00
Tao Wu	4be5337cca	add support for weight in batch_softmax_loss Summary: weighted batch_softmax_loss when weight exists in input_record Reviewed By: kittipatv Differential Revision: D5291646 fbshipit-source-id: f1bcd386ad1fc0e95e0a0315ec1c36531c792495	2017-06-21 10:32:15 -07:00
Bokai Cao	d9087edb07	add rekey in feature_processor Differential Revision: D5270972 fbshipit-source-id: 8805c0e947f4752d2c575e2a7b8986cd804601dc	2017-06-20 23:19:09 -07:00
Jacqueline Xu	5957218cf0	Adding Dropout Layer to SparseNN Model and Flow Summary: - Incorporated dropout layer to the sparseNN training and testing pipeline - Integrated an advanced model options feature on Flow UI for users to specify dropout rate - Created an end-to-end unit test to build and run a model with dropout Reviewed By: chocjy Differential Revision: D5273478 fbshipit-source-id: f7ae7bf4de1172b6e320f5933eaaebca3fd8749e	2017-06-20 15:46:55 -07:00
Bokai Cao	d2b1cb22a4	rekey layer Differential Revision: D5210095 fbshipit-source-id: dc66a10d95842e0f10cb53a5afb7ddcc3fcac0de	2017-06-19 18:47:28 -07:00
Jacqueline Xu	6150d9bef2	Building dropout as layer Summary: Dropout layer and unittest for DPer2 Reviewed By: chocjy Differential Revision: D5254866 fbshipit-source-id: 5eaea81808ddf8e0c7a7d76209ea44cda2ee28aa	2017-06-19 14:46:52 -07:00
haracejacob	2ec294a8bb	Fix a few typos and grammars in comment Summary: Fix a few typos and grammars in comment by using language-check, python library spell_checker source code is here : https://github.com/17-1-SKKU-OSS/011A/blob/master/spell_checker/spell_checker.py here is the text file which indicates what things should be fixed : https://github.com/17-1-SKKU-OSS/011A/tree/master/spell_checker/fix/caffe2 Closes https://github.com/caffe2/caffe2/pull/719 Differential Revision: D5165118 Pulled By: aaronmarkham fbshipit-source-id: 7fb8ef7a99d03cd5fd2f9ebdb01b9865e90fc37b	2017-06-14 18:22:39 -07:00
Xianjie Chen	795e7e64e8	add truncation for sparse feature Summary: truncate id list using the max length computed in compute meta, so that it has determined length, which is useful for position weighted pooling method. Reviewed By: sunwael Differential Revision: D5233739 fbshipit-source-id: f73deec1bb50144ba14c4f8cfa545e1ced5071ce	2017-06-13 10:46:53 -07:00
Jiyan Yang	c822e89956	Rename SparseToDense layer Summary: The SparseToDense layer is essentially calling the SparseToDenseMask op. This makes it impossible to call the functional layer with the true SparseToDense op. This diff is to rename the layer. Please let me know if I missed anything or you have a better name suggestion. Differential Revision: D5169353 fbshipit-source-id: 724d3c6dba81448a6db054f044176ffc7f708bdb	2017-06-09 12:48:27 -07:00
Wael Abdelghani	c291c97494	Add integration test for pos_w Summary: Title Reviewed By: azzolini Differential Revision: D5197307 fbshipit-source-id: 425bf8e7c5068ea544e5b2709b6bb27eef140bf3	2017-06-08 18:04:53 -07:00
Fedor Borisyuk	686470a6b8	Feature importance in dper 2.0: build network representation Summary: Changes to enable feature importance. Reviewed By: kennyhorror Differential Revision: D5075252 fbshipit-source-id: e5d46e129bcd5cbef77932c63b5a288dd57775d1	2017-06-05 18:03:34 -07:00
Wael Abdelghani	ebecafbcca	Support for position weighted in distributed PS Summary: Title Reviewed By: azzolini Differential Revision: D5081871 fbshipit-source-id: 68a97c2112522fbcbcdfd9e0f717b8bce60fe028	2017-06-05 17:04:42 -07:00
Wael Abdelghani	5447f5c0d7	Move position weighted to separate layer Reviewed By: kennyhorror Differential Revision: D5063086 fbshipit-source-id: 212c08946728437bcc8b6049438ae82235137ec6	2017-06-05 15:49:22 -07:00
Tao Wu	3bd6195891	removed Sum from simple_operator_layers.py; passed unit tests Summary: removed softmax, sigmoid, tanh, relu from simple_operator_layers.py; passed all unit tests Reviewed By: kittipatv Differential Revision: D5150271 fbshipit-source-id: abe611bf6c5de5caba189181e9e41d705d8c5c54	2017-06-02 15:03:16 -07:00
Jiyan Yang	6aff754dbc	Add batch normalization layer Summary: As desc. Reviewed By: xianjiec Differential Revision: D5077230 fbshipit-source-id: f73cdedac6d9a3542f8ef829b54fb4c713dcafd0	2017-05-26 16:46:52 -07:00
Andrey Malevich	6c12df3003	Fix export of SparseToDense layer. Summary: If there're 2 SparseToDense layers that are densifying same IdList feature it'll result in the situation, where we might export invalid input for the prediction in input specs. This diff is changing the behavior to support to use Alias to a new blob instead of passing things directly. Reviewed By: dzhulgakov Differential Revision: D5093754 fbshipit-source-id: ef4fa4ac3722331d6e72716bd0c6363b3a629cf7	2017-05-25 21:46:28 -07:00
Jiyan Yang	9bf1f16255	Add bias to cosine distance for two tower models Summary: Currently using two tower models with cosine distance results in bad calibration. Adding bias to the output of cosine term solves the problem. Reviewed By: xianjiec Differential Revision: D5132606 fbshipit-source-id: eb4fa75acf908db89954eeee67627b4a00572f61	2017-05-25 19:50:20 -07:00
Huazhong Ning	e394b60a9c	Support un-equal weight training for mtml models Reviewed By: queqichao Differential Revision: D5047939 fbshipit-source-id: 857d0d77e0413939e5774fa37d21b92a00d34bf0	2017-05-15 12:56:11 -07:00
Huazhong Ning	942f53b5a6	gradient impact of task layers on shared is configurable Reviewed By: chocjy Differential Revision: D4943948 fbshipit-source-id: 2e26dfb30c6893b60985f693a823646ed3d3e0e3	2017-05-11 20:34:04 -07:00
Xiaolong Wang	11bcdbc3f0	Load Parameters from Model Summary: In Dper utility, add a function `load_parameters_from_model_init_options` to allow init parameters from pretrained models Reviewed By: xianjiec Differential Revision: D4926075 fbshipit-source-id: 5ab563140b5b072c9ed076bbba1aca43e71c6ac5	2017-05-10 10:33:04 -07:00
Xianjie Chen	8a7f00d61b	fix mean pooling Summary: Segment based Ops requires increasing seg id, and without gap. Lengths based Ops does not have this requirements. Otherpooling methods, e.g., LogExpMean does not have Lengths based Ops available yet. Differential Revision: D5019165 fbshipit-source-id: ab01a220e10d4ed9fa2162939579d346607f905e	2017-05-08 01:09:07 -07:00
Kittipat Virochsiri	211eae127c	LastNWindowCollector Summary: Layer for LastNWindowCollector op. We need this since it's an in-place operator. Reviewed By: chocjy Differential Revision: D4981772 fbshipit-source-id: ec85dbf247d0944db422ad396771fa9308650883	2017-05-04 17:32:09 -07:00
Kittipat Virochsiri	22d4eaeb9e	JoinContext Summary: Layer to allow model to follow different paths for each instantiation context and join later. Together with tagging system cleanup (this is a separate issue), this should reduce the need to write a layer to differentiate between context. Re: tagging system clean up, we should make exclusion more explicit: EXCLUDE_FROM_<CONTEXT>. This would simplify instation code. TRAIN_ONLY should become a set of all EXCLUDE_FROM_*, except EXCLUDE_FROM_TRAIN. Reviewed By: kennyhorror Differential Revision: D4964949 fbshipit-source-id: ba6453b0deb92d1989404efb9d86e1ed25297202	2017-05-02 17:32:26 -07:00
Chonglin Sun	e8e93066e7	add workflow for user complicated embedding Summary: Correctly propagate request_only tag to all layer. Reviewed By: kennyhorror Differential Revision: D4751496 fbshipit-source-id: e65fd8cfe56d2989213d44e684a528ede691d316	2017-05-02 10:46:52 -07:00
Jiyan Yang	a458aa4b2a	Fix tags to be based on EXCLUDE_FROM_{CONTEXT} Summary: Cleaning up the tagging system. Introducing tags EXCLUDE_FROM_{CONTEXT}. Reviewed By: kennyhorror Differential Revision: D4974842 fbshipit-source-id: b0fa6772299bb70afa2192c39e45191c9f41336a	2017-05-02 09:32:27 -07:00
Kittipat Virochsiri	ffc6bad116	Concat axis=0 Summary: Previously, the code below would go out of bound. Reviewed By: xianjiec Differential Revision: D4968037 fbshipit-source-id: 3760e2cddc919c45d85ac644ac3fabf72dbaf666	2017-05-01 12:19:34 -07:00
Jiyan Yang	795dc1c326	Remove loss ops from eval net Summary: Current eval nets contain loss operators; see example: https://fburl.com/6otbe0n7, which is unnecessary. This diff is to remove them from the eval net. Differential Revision: D4934589 fbshipit-source-id: 1ba96c20a3a7ef720414acb4124002fb54cabfc7	2017-04-26 12:46:25 -07:00
Yangqing Jia	deb1327b6e	Re-apply #266 Summary: Closes https://github.com/caffe2/caffe2/pull/404 Differential Revision: D4943280 Pulled By: Yangqing fbshipit-source-id: c0988598d8ccb8329feac88382686324b90d4d46	2017-04-25 21:17:04 -07:00
Jiyan Yang	ef2701a57e	MapToRange layer Summary: A layer that takes raw ids as inputs and outputs the indices which can be used as labels. The mapping will be stored with the model. Reviewed By: kittipatv Differential Revision: D4902556 fbshipit-source-id: 647db47b0362142cdba997effa2ef7a5294c84ee	2017-04-25 16:03:58 -07:00
Yangqing Jia	a48062b1a2	temporarily fix sync script bugs changes by reverting partially https://github.com/caffe2/caffe2/pull/266/files	2017-04-24 15:49:22 -07:00
Huazhong Ning	f950a1b70f	create bucket-based calibration - model manipulation Summary: added a new context to layers.py Reviewed By: kennyhorror Differential Revision: D4817124 fbshipit-source-id: 36f08964b86092e81df24c1b9d4b167293a7ffb8	2017-04-18 22:01:23 -07:00
Huazhong Ning	ad6b53e401	allow to specify output dtypes for functional layers Summary: Currently, the functional layer infers the output types and shapes by running the operator once. But in cases where special input data are needed to run the operator, the inferrence may fail. This diff allows the caller to manually specify the output types and shapes if the auto infererence may fail. Reviewed By: kennyhorror Differential Revision: D4864003 fbshipit-source-id: ba242586ea384f76d745b29a450497135717bdcc	2017-04-18 16:34:52 -07:00
Kittipat Virochsiri	0a726af42e	Coerce input of FunctionalLayer to record Summary: Having to pack the input to schema doesn't make much sense since the structure is not recognized by operators anyway. Differential Revision: D4895686 fbshipit-source-id: df78884ed331f7bd0c69db4f86c682c52829ec76	2017-04-17 19:26:06 -07:00
Aaron Markham	b93a7b134a	doxygen configs and updated python files to inc. doxygen tags (#266 ) * updated ubuntu instructions * updated ubuntu notes and troubleshooting * updated tutorials using local files * added doxygen python blocks for docs generation * doxygen related files for generating docs	2017-04-14 16:30:33 -07:00
Kittipat Virochsiri	baf33161d4	GatherRecord layer Summary: Perform gather on the whole record. This will be used for negative random sampling. Reviewed By: kennyhorror Differential Revision: D4882430 fbshipit-source-id: 19e20f7307064755dc4140afb5ba47a699260289	2017-04-13 15:02:44 -07:00
Huazhong Ning	15c6f637d6	create bucket-based calibration - layer Summary: The basic idea of bucket-based calibration: 1. given a model and a calibration data set 2. apply the model to the calibration data set and sort the prediction scores 3. bucketize the prediction scores 4. for the samples in each bucket, compute the proportion of positive samples 5. build a set of piecewise linear functions that map from the bucket range to the proportion 6. appends an operator of piecewise linear transform to the prediction net that is supposed to calibrate the raw predictions. 7. to support calibration in realtime training, we create a new type of Net -- bucket calibration net. This needs a new Context to add_calibration_ops(), to export and load the new Net. This includes a series of diffs. This diff implements a layer that adds different operators for train/cali/eval for bucket based calibration. Reviewed By: dragonxlwang Differential Revision: D4817119 fbshipit-source-id: 44f8fcad2a94f40f7439cc1ad47e7bae5e17397d	2017-04-11 12:30:26 -07:00
Kittipat Virochsiri	5c32c82a6d	Add option to subtract log odd from sampled trained prediction. Summary: Useful for sampled softmax training Differential Revision: D4782673 fbshipit-source-id: 88195de60070a0bc16f5e06b9aad4dffd0484546	2017-04-03 17:50:58 -07:00
Kittipat Virochsiri	3b4c950862	Add option to use id_score_list_features column Summary: Somehow, feed-non-ranking training data usually have this type of column. Add option to support it. Reviewed By: xianjiec, kennyhorror Differential Revision: D4773960 fbshipit-source-id: 5a7ef4618a070e04f3cd8ddfcbf2b7441c00d92d	2017-04-03 17:03:09 -07:00
Xianjie Chen	9fc56793dd	fix trunk for push and small cleanup Summary: multiple places broken, blocking the push :( - fix the weighted training for ads and feeds - fix the publishing if no exporter model is selected - fix the feeds retrieval evaluation - added the default config for retrieval workflows. plan to use for flow test (in next diff) - clean up not used code - smaller hash size for faster canary test Reviewed By: chocjy Differential Revision: D4817829 fbshipit-source-id: e3d407314268b6487c22b1ee91f158532dda8807	2017-04-02 23:35:49 -07:00
Jiyan Yang	b401cb48fe	Make optimization methods configurable and allow flexible optimization settings Summary: This diff does the followings: 1. Add optimization options to model options in the UI for all workflows. 2. Allow different parameters to use different optimizers (or same optimizer with different settings, eg, learning rate). 3. Remove the default values for the `sparseDedupAggregator` field in the thrift file as the default value for that should just be `None` instead of 'sum'. 4. `fb/dper/layer_models/mlp_sparse.py` is deprecated. 5. Add calibration to two tower workflows. Reviewed By: kittipatv Differential Revision: D4767004 fbshipit-source-id: de92ea63fb0ff33f8581b1693479b723a68cd2d1	2017-04-01 23:02:21 -07:00
Ou Jin	cd4160c894	distributed training for dper2 Summary: Add distributed training to dper2 and keep the dper1 working. * Created a ModelDelegator to wrap ModelHelper and LayerModelHelper to mitigate the difference. * To get the average length for sparse feature, I extracted some information in feature_processor. There should be some better way to do it after we have new compute_meta. * metric right now only runs on the first trainer. * The model is saved correctly for evaluation. But I'm still not sure how to handle the weights for adagrad. Reviewed By: kennyhorror Differential Revision: D4767745 fbshipit-source-id: 0559d264827a7fd9327071e8367d1e84a936bea9	2017-03-30 19:04:50 -07:00
Kittipat Virochsiri	e1d64ea4d5	support multilabel in generic preprocessor Summary: Adding support for multilabel in multiclass workflow. `input_feature_schema` and `trainer_extra_schema` are now a function taking in the preprocessor option and output the schema. This allows dynamic schema definition based on the option. Changing default value will be in the next diff. Reviewed By: xianjiec Differential Revision: D4750064 fbshipit-source-id: 896143f432e963bc1723c0153749efeb39a83bec	2017-03-29 15:20:54 -07:00
Kittipat Virochsiri	3eb3507367	uniform_sampling layer Summary: This layer will be used to sample negative labels for sampled softmax. Differential Revision: D4773444 fbshipit-source-id: 605a979c09d07531293dd9472da9d2fa7439c619	2017-03-29 14:36:12 -07:00
Aaron Markham	58f7f2b441	doxygen python block added Summary: Closes https://github.com/caffe2/caffe2/pull/226 Differential Revision: D4793550 Pulled By: JoelMarcey fbshipit-source-id: cc33e58186304fa8dcac2ee9115dcc271d785b1e	2017-03-29 06:46:16 -07:00

1 2

83 commits