onnxruntime

saymrwulf/onnxruntime

Fork 0

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-01 03:45:06 +00:00

Commit graph

Author	SHA1	Message	Date
Tianlei Wu	61fa5476d5	Update PyTorch Bert notebooks (#4239 ) update PyTorch Bert SquAD notebooks to use onnxruntim-tools and update usage of intra_op_num_threads. rename python files according to coding style Fix change_input_to_int32. update keras notebook to copy script from rel-1.3.0 branch (Will update them later)	2020-06-16 09:36:51 -07:00
Tianlei Wu	2605faef88	Add past state support in Attention Op for GPT-2 (#4107 ) Update Attention op to allow past state input and output. Add fusion script and tests	2020-06-11 14:19:55 -07:00
Tianlei Wu	7f750b65ce	support model > 2GB in transformer optimizer (#4038 ) * Enable optimizer on models with external data (>2GB) * Refactoring optimizer: move fusion to separate file * Update benchmark: (1) output datatime to csv (2) Add option --onnx_dir to benchmark.py for onnx model directory path (3) add gpt2-large (4) loose thrsholds for fp16 validation * update optimizer (1) Add attribute of ConstantOfShape in fp16 conversion (2) Use OnnxRuntime level 1 optimization * update bert_perf_test.py: rename --input_ids to --input_ids_name	2020-05-28 01:16:41 -07:00

Author

SHA1

Message

Date

Tianlei Wu

61fa5476d5

Update PyTorch Bert notebooks (#4239 )

update PyTorch Bert SquAD notebooks to use onnxruntim-tools and update usage of intra_op_num_threads.
rename python files according to coding style
Fix change_input_to_int32.
update keras notebook to copy script from rel-1.3.0 branch (Will update them later)

2020-06-16 09:36:51 -07:00

Tianlei Wu

2605faef88

Add past state support in Attention Op for GPT-2 (#4107 )

Update Attention op to allow past state input and output.
Add fusion script and tests

2020-06-11 14:19:55 -07:00

Tianlei Wu

7f750b65ce

support model > 2GB in transformer optimizer (#4038 )

* Enable optimizer on models with external data (>2GB)
* Refactoring optimizer: move fusion to separate file
* Update benchmark: (1) output datatime to csv (2) Add option --onnx_dir to benchmark.py for onnx model directory path (3) add gpt2-large (4) loose thrsholds for fp16 validation
* update optimizer (1) Add attribute of ConstantOfShape in fp16 conversion (2) Use OnnxRuntime level 1 optimization
* update bert_perf_test.py: rename --input_ids to --input_ids_name

2020-05-28 01:16:41 -07:00

3 commits