Gary Miguel
e2b1852eec
Build: respect onnxruntime_PREFER_SYSTEM_LIB for more things ( #9181 )
...
This is based on a patch applied locally by
https://github.com/conda-forge/onnxruntime-feedstock . Having this in
master seems useful.
2021-10-06 13:49:28 -07:00
Thiago Crepaldi
52d067402a
Fix all-or-nothing fallback for bad ORTModule init ( #9277 )
...
* Fix all-or-nothing fallback for bad ORTModule init
* Address comments
2021-10-06 15:12:27 -04:00
Suffian Khan
510b58c877
Increase AMD CI pipeline timeout to 120 min ( #9280 )
...
* increase timeout
* add timeout
* add timeout
* rename
2021-10-06 10:43:09 -07:00
Changming Sun
334980e016
Delete nocontribops pipelines
2021-10-06 10:30:32 -07:00
baijumeswani
bcdb411c8d
Implement FusedAdam for ORT adapted from DeepSpeed ( #9266 )
2021-10-05 20:50:34 -07:00
Guoyu Wang
a4d53c4ab5
fix training distributed ci failure ( #9273 )
2021-10-05 15:36:44 -07:00
ashbhandare
35c2102cfa
Fixes for GatherND, Multinomial ( #9143 )
...
* register gathernd kernel, aten multinomial
* fix CI, add test
* review comments
2021-10-05 14:51:58 -07:00
G. Ramalingam
0b77c9ca7c
Cleanup function definitions of contrib ops ( #9265 )
...
* Simplify function definitions
* Simplify fast-gelu function definition
* Simplify training function op body definitions
Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>
* Eliminate redundant function
Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>
* Formatting changes
Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>
* Minor formatting changes
Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>
* Add comment
Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>
* Specify int64 type for constant 1
Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>
2021-10-05 11:38:42 -07:00
Thiago Crepaldi
6e2f66ee9c
Allow custom exporter args + bug fix ( #9242 )
2021-10-04 11:32:42 -04:00
Jingqiao Fu
67ff339df7
fixed a profiler.py bug ( #9231 )
2021-10-03 20:28:20 -07:00
ashari4
113edbda64
Add bf16 specialization for IsDataType ( #9254 )
...
* Add bf16 specialization
* Fixed indent
2021-10-02 07:15:06 -07:00
Sheil Kumar
8f6fd014e4
Force Windows AI NuGet pipeline to use Windows SDK 19041 ( #9255 )
...
* Force Windows AI Nuget pipeline to use 19041 Windows SDK as 22000 casues a downlevel regression by importing LoadLibraryW
* move into quotes
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2021-10-01 21:46:14 -07:00
Faith Xu
9fe09cb72a
Update dockerfile readme ( #9241 )
...
* Update dockerfiles page
* Delete Dockerfile.server
* Delete Dockerfile.training
2021-10-01 17:28:26 -07:00
Tiago Koji Castro Shibata
11a391a88f
Port ARM64x support ( #9230 )
2021-10-01 13:06:43 -07:00
Guoyu Wang
60bbdf1403
Remove unused NodeArgs in Graph::Resolve ( #9213 )
...
* Remove unused NodeArgs
* Handle case where a node arg from an initializer from initializer_names_to_preserve
* Fix CI failure
* update test
* Fix outer scope node args failure
* Use NodeArg* as the key of the std::set instead of string
* Minor updates
2021-10-01 11:44:26 -07:00
Yulong Wang
8adb9ab85a
fix CodeQL warning for path-injection ( #9243 )
2021-10-01 11:32:00 -07:00
baijumeswani
45399d5ace
Remove TORCH_WARN to avoid torch string related operations that take up time ( #9238 )
2021-10-01 13:56:04 -04:00
Tang, Cheng
be4d887439
Fix ONNX exporter call with latest API for ORTrainer ( #9228 )
...
* update the exporter call with latest api in orttrainer
* use official export api instead of the private call
2021-10-01 13:49:55 -04:00
Yulong Wang
448325b254
[js/web] name ort web for consistency ( #9240 )
2021-09-30 22:53:26 -07:00
Tracy Sharpe
c23a216900
MLAS: fix AVXVNNI+Linux qgemv kernel ( #9234 )
2021-09-30 21:24:18 -07:00
Yulong Wang
e2d779246a
[wasm] remove deprecated prefix 'EXTRA_' in emcc flags ( #9211 )
2021-09-30 16:02:24 -07:00
Sheil Kumar
c6cb49c5a1
DirectML.dll load fails when executable path contains Non-English characters ( #9229 )
...
* enable unicode dml
* add wide string L prefix
* Add Fail Fast back
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2021-09-30 15:16:57 -07:00
Yulong Wang
634bb5ede0
fix CodeQL warning 'Remote property injection' ( #9224 )
2021-09-30 13:45:22 -07:00
Yulong Wang
8c57d51928
support WebAssembly SIMD for qgemm ( #9191 )
...
* support WebAssembly SIMD for qgemm
* remove '--experimental-wasm-bulk-memory' for test
2021-09-30 12:40:56 -07:00
G. Ramalingam
e79be39081
LayerNormGrad function body and LayerNorm inference/body fix ( #9160 )
...
* Add function body for LayerNormGrad
* Fix LayerNorm schema for multiple normalization dims
2021-09-30 12:03:08 -07:00
Changming Sun
e1b84eefcc
Revert "Revert "linux trt package pipeline ( #7537 )""
...
This reverts commit b606005858 .
2021-09-30 11:39:23 -07:00
Edward Chen
5326397a6a
[iOS] Facilitate usage of pods with custom builds ( #9216 )
...
Refactor iOS framework build/pod package creation into a separate script that can be used with custom builds.
Add documentation.
2021-09-30 08:44:00 -07:00
Thiago Crepaldi
ceb51dda4a
Support external torch cpp extensions on ORTModule ( #9223 )
2021-09-30 10:37:35 -04:00
RandySheriffH
ffca0b777b
Patching cuda profiler with enhancements ( #9214 )
2021-09-29 21:02:09 -07:00
Scott McKay
4a1b386f7c
#9182 removed the --is_store_build option but one place where that was used was missed. ( #9219 )
...
This should fix the relevant packaging pipelines.
2021-09-29 09:28:31 -07:00
satyajandhyala
278928a102
Added a test case for python gradient builder. ( #9207 )
...
* Register Cos operator gradient using ORTModule's register_gradient and compare gradient against PyTorch.
2021-09-29 09:24:12 -07:00
stevenlix
4f10024868
Fix shape inference issue in Gather op ( #9147 )
...
* add initializer checker for Gather with 1D input
* Check if indices value exists
* Update symbolic_shape_infer.py
* add unit test
* Update symbolic_shape_infer.py
* Update symbolic_shape_infer.py
2021-09-28 22:46:12 -07:00
Changming Sun
b606005858
Revert "linux trt package pipeline ( #7537 )"
...
This reverts commit faea7a222d .
2021-09-28 19:09:04 -07:00
RandySheriffH
058108bef9
Execution Provider Profiler ( #8406 )
...
* implement cuda provider
* define profiler common
* call start after register
* add memcpy event
* add cuda correlation
* format code
* add cupti to test path
* switch to CUpti_ActivityKernel3
* reset cupti path
* fix test case
* fix trt pipeline
* add namespace
* format code
* exclude training from testing
* remove mutex
2021-09-28 13:59:52 -07:00
Suffian Khan
6f580f07de
Switch AMD CI pipeline to use environment image from onnxruntimecibuildenvironment ( #9206 )
...
* shift docker image reference for amd ci pipeline
* fix service endpoint
* reduce perf tolerance
2021-09-28 13:06:16 -07:00
Changming Sun
1104e8d3e5
Linux Anybuild build pipeline ( #9091 )
2021-09-28 11:22:27 -07:00
ytaous
d3f859fe30
Dropout Vectorized Kernel ( #9157 )
...
* vectorized kernel
* fix build
* re-calibrate expected loss
* fix build
* re-calibrate convergence results
* more re-calibrate on loss
* divide kernels
* adress comments
* more calibration
* calibration
* per comments
* enable sync
Co-authored-by: Ethan Tao <ettao@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>
2021-09-27 17:19:12 -07:00
Wei-Sheng Chin
1b0816859f
Only wrap sub-modules which can be wrapped as ORTModule ( #9021 )
2021-09-27 17:18:22 -07:00
Yufeng Li
1a71687102
Add QDQ for output of node ( #9134 )
...
* Add QDQ for output of node
* keep output of removable activation
2021-09-27 16:55:23 -07:00
Sheil Kumar
f16bb37fae
make uwp store apps link to statically-linked crt desktop builds ( #9182 )
...
Co-authored-by: Sheil Kumar <sheilk@microsoft.com>
2021-09-27 15:37:15 -07:00
baijumeswani
c30cc9190a
Change the agent pool for orttraining-distributed pipeline ( #9179 )
2021-09-26 21:26:44 -07:00
Chi Lo
9fda95fec9
Change keepdims of ReduceMax/ReduceMin to always 1 when using quatization calibration MinMax approach ( #9167 )
...
* Change keepdims to always 1
* fix typo
* Refine code
2021-09-25 10:13:54 -07:00
baijumeswani
fd91bf91c9
Print full stacktrace exception when exporter fails ( #9169 )
2021-09-24 10:24:37 -04:00
Vincent Wang
39dc6ea8a3
Fix to_dlpack Failure on PyTorch-1.10 ( #9151 )
...
* workaround to_dlpack fail in new pt version
* add torch code link
2021-09-24 09:48:07 +08:00
George Wu
0888c6cc59
fix bug introduced by PR9130 ( #9166 )
2021-09-23 10:58:33 -07:00
Guoyu Wang
ddafe50199
Fix Android build break after Virtual Environment update to 20210919 ( #9163 )
2021-09-23 10:07:18 -07:00
Chi Lo
bde16eea68
Cleanup USE_TENSORRT macro ( #8593 )
...
* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support
* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support
* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support
* handle unused parameters
* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support
* Remove USE_TENSORRT macro and disable TRT EP at runtime if not support
* handle unused parameters
* Disable some testcases
* only include opset13 for testing and add a keyword filter set
* rename variable
* add back code which was accidentally commented on previous commit
* Adjust model test filter for opset14
2021-09-22 21:04:44 -07:00
ke1337
6e83392ff1
Bump up TVM version to avoid conflict with existing one ( #9159 )
...
* Bump up tvm version
* Bump up onnxruntime-tvm version
There are some c++17 related fixes in TVM
Co-authored-by: KeDengMS <kedeng@microsoft.com>
2021-09-22 17:39:19 -07:00
Ye Wang
4934455ab6
Bumping up to 1.10 ( #9006 )
...
* bump to 1.10
* Update Versioning.md
* Update README.rst
* Change opset version to 15
2021-09-22 16:34:28 -07:00
Jason
4e5bc8365b
Add Paddle2ONNX to Versioning.md ( #9067 )
...
* Add Paddle2ONNX to Versioning.md
2021-09-22 13:38:14 -07:00