onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-23 22:13:38 +00:00

Author	SHA1	Message	Date
Tianlei Wu	268d2283c0	Export GPT-2 ONNX model without postion_ids and attention_mask inputs (#4852 ) * Export GPT-2 ONNX model without postion_ids and attention_mask inputs * allow benchmark_gpt2 on user's model * refactor: get_dummy_inputs returns a data class.	2020-08-24 13:05:25 -07:00
Tianlei Wu	316d1a9e69	Update benchmark for large model or model name with non-alphanumeric. (#4743 ) * Export model > 2GB using external data format	2020-08-10 12:58:01 -07:00
Tianlei Wu	3588c5b545	Add GPT-2 test generation to convert_to_onnx.py (#4670 ) * add gpt2 tester * add an option to include output latency.	2020-07-30 21:03:53 -07:00
Tianlei Wu	ace41b8064	Force return_tuple=True to handle transformers breaking change of output format. (#4599 )	2020-07-23 11:35:41 -07:00
Tianlei Wu	b42fe49f56	Add a script to convert gpt2 to onnx (#4541 ) * add a script convert_to_onnx.py and refactor benchmark	2020-07-17 23:14:42 -07:00
Tianlei Wu	25885cf7d0	Add option --torchscript in benchmark_gpt2.py (#4500 ) * support TorchScript * change onnx filename format * change output name prediction_scores to logits	2020-07-14 11:53:23 -07:00
Yufeng Li	3d4ac85124	Add quantization benchmark for transformer based model (#4482 ) * add support of quantization benchmark	2020-07-13 15:46:23 -07:00
Tianlei Wu	55f25a4bbf	Update Attention op to support attention mask for GPT-2 (#4330 ) * Support another two format of mask_index input: 2D attention mask, or 1D mask index with end and start positions. * Update dynamic axes of gpt2 with past state * Update script to fuse model with attention mask	2020-06-29 23:26:23 -07:00
Tianlei Wu	466511c1c3	Update gpt2 benchmark with position_ids and fp16 (#4275 ) * support position_ids input * support fp16 conversion for gpt2 past state * output results to csv file * Remove the useless check that output of matmul is in cuda	2020-06-19 14:01:37 -07:00
Cecilia Liu	0b5bbb16b8	Benchmark With IO Binding (#4206 ) * add io binding to benchmark.py	2020-06-15 10:06:33 -07:00
Tianlei Wu	2605faef88	Add past state support in Attention Op for GPT-2 (#4107 ) Update Attention op to allow past state input and output. Add fusion script and tests	2020-06-11 14:19:55 -07:00
Cecilia Liu	8813d205cc	Update GPT2 Model Benchmark Script to Support IO Binding (#4088 ) GPT2 benchmark support io binding	2020-06-01 15:07:48 -07:00
Tianlei Wu	60fa4b1f90	Update benchmark of gpt2 model with past state (#4043 ) * update benchmark_gpt2 to use past state only * update dynamic axes of input/output tensors * Remove --use_openmp option since it is default for onnxruntime 1.3 cpu. * Use same option names as benchmark.py	2020-05-28 13:55:43 -07:00
Tianlei Wu	782c6c24b2	Rename bert to transformers (#3946 ) * rename folder bert to transformers * rename bert_model_optimization.py to optimizer.py * update URL links in notebooks	2020-05-14 15:32:59 -07:00

14 commits