Commit graph

14 commits

Author SHA1 Message Date
Tianlei Wu
268d2283c0
Export GPT-2 ONNX model without postion_ids and attention_mask inputs (#4852)
* Export GPT-2 ONNX model without postion_ids and attention_mask inputs
* allow benchmark_gpt2 on user's model
* refactor:  get_dummy_inputs returns a data class.
2020-08-24 13:05:25 -07:00
Tianlei Wu
316d1a9e69
Update benchmark for large model or model name with non-alphanumeric. (#4743)
* Export model > 2GB using external data format
2020-08-10 12:58:01 -07:00
Tianlei Wu
3588c5b545
Add GPT-2 test generation to convert_to_onnx.py (#4670)
* add gpt2 tester
* add an option to include output latency.
2020-07-30 21:03:53 -07:00
Tianlei Wu
ace41b8064
Force return_tuple=True to handle transformers breaking change of output format. (#4599) 2020-07-23 11:35:41 -07:00
Tianlei Wu
b42fe49f56
Add a script to convert gpt2 to onnx (#4541)
* add a script convert_to_onnx.py and refactor benchmark
2020-07-17 23:14:42 -07:00
Tianlei Wu
25885cf7d0
Add option --torchscript in benchmark_gpt2.py (#4500)
* support TorchScript
* change onnx filename format
* change output name prediction_scores to logits
2020-07-14 11:53:23 -07:00
Yufeng Li
3d4ac85124
Add quantization benchmark for transformer based model (#4482)
* add support of quantization benchmark
2020-07-13 15:46:23 -07:00
Tianlei Wu
55f25a4bbf
Update Attention op to support attention mask for GPT-2 (#4330)
* Support another two format of mask_index input: 2D attention mask, or 1D mask index with end and start positions.
* Update dynamic axes of gpt2 with past state
* Update script to fuse model with attention mask
2020-06-29 23:26:23 -07:00
Tianlei Wu
466511c1c3
Update gpt2 benchmark with position_ids and fp16 (#4275)
* support position_ids input
* support fp16 conversion for gpt2 past state
* output results to csv file
* Remove the useless check that output of matmul is in cuda
2020-06-19 14:01:37 -07:00
Cecilia Liu
0b5bbb16b8
Benchmark With IO Binding (#4206)
* add io binding to benchmark.py
2020-06-15 10:06:33 -07:00
Tianlei Wu
2605faef88
Add past state support in Attention Op for GPT-2 (#4107)
Update Attention op to allow past state input and output.
Add fusion script and tests
2020-06-11 14:19:55 -07:00
Cecilia Liu
8813d205cc
Update GPT2 Model Benchmark Script to Support IO Binding (#4088)
GPT2 benchmark support io binding
2020-06-01 15:07:48 -07:00
Tianlei Wu
60fa4b1f90
Update benchmark of gpt2 model with past state (#4043)
* update benchmark_gpt2 to use past state only
* update dynamic axes of input/output tensors
* Remove --use_openmp option since it is default for onnxruntime 1.3 cpu.
* Use same option names as benchmark.py
2020-05-28 13:55:43 -07:00
Tianlei Wu
782c6c24b2
Rename bert to transformers (#3946)
* rename folder bert to transformers
* rename bert_model_optimization.py to optimizer.py
* update URL links in notebooks
2020-05-14 15:32:59 -07:00
Renamed from onnxruntime/python/tools/bert/benchmark_gpt2.py (Browse further)