Commit graph

5334 commits

Author SHA1 Message Date
Zhang Lei
76dfe8108b
Optimize quantized LSTM (#8634)
* optimize some lstm gate computation. Remove no need string constructions.

* change gcc optimization flags for computation bound logics in rnn_helpers

* better qgemm for M=1

* Some improve on avx512

* add condition to limit GCC related marcros

* Correct QGemm assembly for M=1 AVX2 optimization to pass mlas_test.

* Fix rnn_helper build issue for wasm.

* better asm code here according to feedbacks.

* Remove customized vectorize and unroll option for GCC.
Using restrict on some function to help GCC to correctly vectorize it.
Rewrite clip_add_bias() to let GCC correctly vectorize it.

* Better restrict semantic for merge_lstm_gates_to_memory() by adding in_place().
Add MSC __restrict for the clip_add_bias() mthod to vectorize correctly.

* Force CI restart as it stucked by the onnxruntime-python-checks-ci-pipeline which can not restart.
2021-08-11 22:02:18 -07:00
Adrian Tsai
caacf249c5
Disable candy_opset9 WinML model test on Qualcomm Adreno (#8647)
Bug #31652854 also repros on Qualcomm Adreno (down to the exact same pixel). This change disables this model test for Qualcomm, in addition to the existing disablement for Intel.
2021-08-11 17:57:12 -07:00
liqun Fu
bec24ca4c1
create packaging pipeline to support cuda11.4 (#8663) 2021-08-11 17:44:57 -07:00
Zhang Lei
c6ef6b5bc8
Subgraph support for quantization tools (#8012)
By default, not do enable subgraph quantization to make it consistent with existing behavior.
It should be OK to enable it at quantize_dynamic mode with extra_options.
2021-08-11 16:35:52 -07:00
Changming Sun
c5c5d3499b
Rewrite dockerfiles/Dockerfile.arm32v7 (#8686) 2021-08-11 15:25:04 -07:00
Tang, Cheng
de2a53e46d
[eager mode] fix build and support customize shared provider entry point (#8680)
* fix build break

* support customize the name of shared provide lib's entry point

* fix non training build

* check error code

* check return code
2021-08-11 15:10:35 -07:00
Tianlei Wu
f661c18654
Fix attention perf regression (#8682)
* undo change in attention cpu
* fix perf regression
* disable persistent softmax by default
2021-08-11 12:07:18 -07:00
harshithapv
c24335246b
Support bool type for Pad Op and fix Unsqueeze in Tile grad for Opset 13 (#8602)
* changes

* tile grad unsqueeze fix for opset 13

* clean up

* remove bool support for opset 2 to 12 for Pad as it is not supported.

* Copy OperatorKernels.md from artifacts of Windows CI build.
2021-08-11 11:21:02 -07:00
Guoyu Wang
a13daf550b
iOS Coacopods spec fix (#8678) 2021-08-11 10:11:44 -07:00
ytaous
0725f80d2d
Revert "Fix Windows Store build (#8481)" (#8679)
This reverts commit 53e7831b53.
2021-08-11 00:37:36 -07:00
Tiago Koji Castro Shibata
53e7831b53
Fix Windows Store build (#8481)
* Remove APIs unavailable in Store in #8349, #8178, #8065

* Add UWP stubs of C runtime functions

* Remove UWP incompatible tests from UWP build

* Remove incompatible tests from Store

* Use UWP stubs in store only

* Skip partition check outside of Windows

* Remove unused WRL include

* Workaround Windows header not including what it uses

* Fix precompiled header name clash

* Workaround SDK bugs

* DXCore workaround in Win7

* Fix warning

* Fix more warnings

* Bump WinML to target Windows 8

* Fix more warnings

* Remove unnecessary workarounds
2021-08-10 15:19:30 -07:00
Xavier Dupré
064a385b59
Support int8 for operator Split (#8615)
* Support int8 for operator Split
2021-08-10 23:04:16 +02:00
Nick Kreeger
3a742f2910
Ensure that the BiasGelu op test runs on CPU. (#8671) 2021-08-10 14:59:41 -05:00
Sunghoon
e74f86059c
[js/web] enable SharedArrayBuffer feature for WebAssembly testing (#8651)
* Enable cross-origin isolation for WebAssembly multi-threading tests

* merging master
2021-08-10 11:22:07 -07:00
Changming Sun
ed17ca3595
Remove onnxruntime/core/protobuf (#8617)
* remove onnxruntime/core/protobuf

* Update How_To_Update_ONNX_Dev_Notes.md
2021-08-10 09:36:27 -07:00
Tracy Sharpe
f0073308d0
Fix CONCURRENCY_VISUALIZER build break (#8664) 2021-08-09 19:32:08 -07:00
liqun Fu
1a2b41dbbc
packaging pipeline produces -cpu- named packages due to a logical error (#8665) 2021-08-09 16:49:59 -07:00
Mike Roberts
cadb43a715
Fix 'SyntaxWarning: "is" with a literal' issues in Python transformers (#8658) 2021-08-09 15:03:52 -07:00
Edward Chen
20f006c580
Remove flake8 check from CMake build. (#8662) 2021-08-09 14:10:36 -07:00
Guoyu Wang
52a212e4f1
Bump ORT master version to 1.8.2 (#8646) 2021-08-09 11:10:29 -07:00
Tianlei Wu
3166a9b8e9
refine API of transformer optimizer (#8633)
* rename BertOptimizationOptions to FusionOptions
* remove disable_onnxruntime, and use opt_level to control whether onnxruntime graph optimization is used. 
* Change default opt_level for backward compatible. When opt_level is not specified, default value is based on model type.
2021-08-09 10:55:49 -07:00
Edward Chen
baf8c39a8d
Add Python checks pipeline (#7032)
This change adds a new pipeline for checking Python code. Currently this pipeline only runs flake8.
flake8 is also run as part of the CMake project builds, but we can switch over completely to the new pipeline later.
The .flake8 config file was also updated to make it easier to run standalone (flake8 --config ./.flake8) and some Python formatting issues were addressed in files that were not previously scanned.
2021-08-09 10:37:05 -07:00
Dmitri Smirnov
aaeb781132
Revert "Add netstandard2.0 target to OnnxRuntime.Managed (#8600)" (#8648)
This reverts commit e673d2e050.
2021-08-09 10:08:07 -07:00
Suffian Khan
6dd59a1117
revert onnx version (#8643) 2021-08-09 05:53:40 -07:00
mindest
a56e325eb8
constrain inputs for min/max grad UT (#8632)
* fix inputs for min/max grad UT

* use random inputs (truncated)
2021-08-07 18:29:06 +08:00
Yulong Wang
1ce4328846
[js/node] fix js node install (#8650) 2021-08-06 23:36:53 -07:00
Yulong Wang
c86b813519
[js] resolve CodeQL warnings for force strict mode (#8645) 2021-08-06 19:35:43 -07:00
liqun Fu
419fd5cc6e
reformat build suffix so that the latest is always correct (#8267) 2021-08-06 16:44:51 -07:00
Tianlei Wu
44ff80e816
re-enable gpt2 fusion tests (#8566)
Re-enable tests that disabled in PR 8530
Update import of test_optimizer.py so that the test could run in source directory.
Add a parameter to disable symbolic shape inference in fp16 conversion since it throws exception for some model.
2021-08-06 16:16:17 -07:00
Yulong Wang
1b902d0227
doc: add ort-web related instructions to update onnx doc (#8500)
* doc: update instructions for ort web docs

* revise readme
2021-08-06 15:09:11 -07:00
Tang, Cheng
6d3c2c85ef
Integrate eager mode source code into onnxruntime repo (#8584)
* integrate eager mode source codde; build with cmake and integrate the python test

* Adding the python path for importing libraries in the Eager mode

* fix clang break;check if training and python enabled

* handling the linking of torch libraries across multiple platforms

* merge and fix the naming

* add build instruction

Co-authored-by: Abhishek Jindal <abjindal@OrtTrainingDev0.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
Co-authored-by: ajindal1 <abjindal@microsoft.com>
2021-08-06 08:30:27 -07:00
Hariharan Seshadri
484e9de55c
Optimize sequence type usage on CUDA [1/n] (#8598) 2021-08-05 23:25:52 -07:00
Hariharan Seshadri
e791faeca5
Fix bug in CPU force fallback logic (#8597) 2021-08-05 21:36:28 -07:00
Yulong Wang
f3a1aebb33
[js/web] support override wasm file path (#8610) 2021-08-05 18:01:03 -07:00
liqun Fu
eab6c51413
to create a training cpu package for torch-ort documentation (#7845) 2021-08-05 16:43:37 -07:00
Changming Sun
0458821944
Delete linux-ort-srv-ci-pipeline.yml (#8628) 2021-08-05 15:06:07 -07:00
Tianlei Wu
24b14c650b
Add parity test for LayerNormalization (#8622) 2021-08-05 10:11:19 -07:00
Edward Chen
dda9f53bed
Build script logging updates (#8618)
Log build.py command line arguments.
Update subprocess logging to format arguments in way that is easier to copy.
2021-08-05 09:41:17 -07:00
Ashwini Khade
96eb9810ba
Update onnx (#8458)
* updates for picking pnnx commit

* add tests filter to c# tests

* plus test fixes

* fix versioning for contrib ops

* fix tests

* test filter for optional ops

* more versioning related updates

* fix test

* fix layernorm spec

* more updates

* update docs

* add more test filters

* more filters

* update binary size threshold

* update docs

* plus more fixes

* updates per review

* update to release commit

* add filters for optional type tests

* plus updates
2021-08-05 09:21:44 -07:00
Chun-Wei Chen
9d88b1de78
correct supported ONNX version (#8590) 2021-08-05 06:49:50 -07:00
Georg Jung
e673d2e050
Add netstandard2.0 target to OnnxRuntime.Managed (#8600) 2021-08-05 18:39:25 +10:00
Du Li
fa722d208b
[js/web] adding webgl pointwise conv kernel (#8418) 2021-08-04 20:46:08 -07:00
Edward Chen
1041fa34f4
Specify timeout for iOS packaging pipeline (#8616) 2021-08-04 11:17:50 -07:00
Edward Chen
f6da9ac6d6
Add more detail to model IR version mismatch error message. (#8607) 2021-08-04 11:16:37 -07:00
Changming Sun
6c69baf78e
Disable Training Windows GPU Debug build because it is failing (#8608) 2021-08-04 09:24:27 -07:00
austinpagan
1003db0058
Fixing mlas unittest failures in POWER (#8498) 2021-08-04 09:23:40 -07:00
Tianlei Wu
7b289a7927
Add test to evaluate Gelu and Fastgelu precision (#8592)
* test gelu and fastgelu precision
2021-08-03 15:35:19 -07:00
Changming Sun
375e86f0a0
Make DNNL EP not depending on onnx (#8588) 2021-08-03 14:11:36 -07:00
Edward Chen
e09321f4db
Update ORT format model conversion utility to optionally fail fast on model conversion failure. (#8589) 2021-08-03 11:12:56 -07:00
Weixing Zhang
deab284e4c
fix build failure with --cmake_extra_defines onnxruntime_DEBUG_NODE_INPUTS_OUTPUTS=1 (#8587)
* fix build failure with --cmake_extra_defines onnxruntime_DEBUG_NODE_INPUTS_OUTPUTS=1

* another compile error and add onnxruntime_USE_ROCM

* braces alignment

Co-authored-by: suffian khan <sukha@microsoft.com>
2021-08-03 09:02:49 -07:00