onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-07-03 03:58:54 +00:00

Author	SHA1	Message	Date
Gary Miguel	e2b1852eec	Build: respect onnxruntime_PREFER_SYSTEM_LIB for more things (#9181 ) This is based on a patch applied locally by https://github.com/conda-forge/onnxruntime-feedstock. Having this in master seems useful.	2021-10-06 13:49:28 -07:00
Thiago Crepaldi	52d067402a	Fix all-or-nothing fallback for bad ORTModule init (#9277 ) * Fix all-or-nothing fallback for bad ORTModule init * Address comments	2021-10-06 15:12:27 -04:00
Suffian Khan	510b58c877	Increase AMD CI pipeline timeout to 120 min (#9280 ) * increase timeout * add timeout * add timeout * rename	2021-10-06 10:43:09 -07:00
Changming Sun	334980e016	Delete nocontribops pipelines	2021-10-06 10:30:32 -07:00
baijumeswani	bcdb411c8d	Implement FusedAdam for ORT adapted from DeepSpeed (#9266 )	2021-10-05 20:50:34 -07:00
Guoyu Wang	a4d53c4ab5	fix training distributed ci failure (#9273 )	2021-10-05 15:36:44 -07:00
ashbhandare	35c2102cfa	Fixes for GatherND, Multinomial (#9143 ) * register gathernd kernel, aten multinomial * fix CI, add test * review comments	2021-10-05 14:51:58 -07:00
G. Ramalingam	0b77c9ca7c	Cleanup function definitions of contrib ops (#9265 ) * Simplify function definitions * Simplify fast-gelu function definition * Simplify training function op body definitions Signed-off-by: Ganesan Ramalingam <grama@microsoft.com> * Eliminate redundant function Signed-off-by: Ganesan Ramalingam <grama@microsoft.com> * Formatting changes Signed-off-by: Ganesan Ramalingam <grama@microsoft.com> * Minor formatting changes Signed-off-by: Ganesan Ramalingam <grama@microsoft.com> * Add comment Signed-off-by: Ganesan Ramalingam <grama@microsoft.com> * Specify int64 type for constant 1 Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>	2021-10-05 11:38:42 -07:00
Thiago Crepaldi	6e2f66ee9c	Allow custom exporter args + bug fix (#9242 )	2021-10-04 11:32:42 -04:00
Jingqiao Fu	67ff339df7	fixed a profiler.py bug (#9231 )	2021-10-03 20:28:20 -07:00
ashari4	113edbda64	Add bf16 specialization for IsDataType (#9254 ) * Add bf16 specialization * Fixed indent	2021-10-02 07:15:06 -07:00
Sheil Kumar	8f6fd014e4	Force Windows AI NuGet pipeline to use Windows SDK 19041 (#9255 ) * Force Windows AI Nuget pipeline to use 19041 Windows SDK as 22000 casues a downlevel regression by importing LoadLibraryW * move into quotes Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-10-01 21:46:14 -07:00
Faith Xu	9fe09cb72a	Update dockerfile readme (#9241 ) * Update dockerfiles page * Delete Dockerfile.server * Delete Dockerfile.training	2021-10-01 17:28:26 -07:00
Tiago Koji Castro Shibata	11a391a88f	Port ARM64x support (#9230 )	2021-10-01 13:06:43 -07:00
Guoyu Wang	60bbdf1403	Remove unused NodeArgs in Graph::Resolve (#9213 ) * Remove unused NodeArgs * Handle case where a node arg from an initializer from initializer_names_to_preserve * Fix CI failure * update test * Fix outer scope node args failure * Use NodeArg* as the key of the std::set instead of string * Minor updates	2021-10-01 11:44:26 -07:00
Yulong Wang	8adb9ab85a	fix CodeQL warning for path-injection (#9243 )	2021-10-01 11:32:00 -07:00
baijumeswani	45399d5ace	Remove TORCH_WARN to avoid torch string related operations that take up time (#9238 )	2021-10-01 13:56:04 -04:00
Tang, Cheng	be4d887439	Fix ONNX exporter call with latest API for ORTrainer (#9228 ) * update the exporter call with latest api in orttrainer * use official export api instead of the private call	2021-10-01 13:49:55 -04:00
Yulong Wang	448325b254	[js/web] name ort web for consistency (#9240 )	2021-09-30 22:53:26 -07:00
Tracy Sharpe	c23a216900	MLAS: fix AVXVNNI+Linux qgemv kernel (#9234 )	2021-09-30 21:24:18 -07:00
Yulong Wang	e2d779246a	[wasm] remove deprecated prefix 'EXTRA_' in emcc flags (#9211 )	2021-09-30 16:02:24 -07:00
Sheil Kumar	c6cb49c5a1	DirectML.dll load fails when executable path contains Non-English characters (#9229 ) * enable unicode dml * add wide string L prefix * Add Fail Fast back Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-09-30 15:16:57 -07:00
Yulong Wang	634bb5ede0	fix CodeQL warning 'Remote property injection' (#9224 )	2021-09-30 13:45:22 -07:00
Yulong Wang	8c57d51928	support WebAssembly SIMD for qgemm (#9191 ) * support WebAssembly SIMD for qgemm * remove '--experimental-wasm-bulk-memory' for test	2021-09-30 12:40:56 -07:00
G. Ramalingam	e79be39081	LayerNormGrad function body and LayerNorm inference/body fix (#9160 ) * Add function body for LayerNormGrad * Fix LayerNorm schema for multiple normalization dims	2021-09-30 12:03:08 -07:00
Changming Sun	e1b84eefcc	Revert "Revert "linux trt package pipeline (#7537 )"" This reverts commit `b606005858`.	2021-09-30 11:39:23 -07:00
Edward Chen	5326397a6a	[iOS] Facilitate usage of pods with custom builds (#9216 ) Refactor iOS framework build/pod package creation into a separate script that can be used with custom builds. Add documentation.	2021-09-30 08:44:00 -07:00
Thiago Crepaldi	ceb51dda4a	Support external torch cpp extensions on ORTModule (#9223 )	2021-09-30 10:37:35 -04:00
RandySheriffH	ffca0b777b	Patching cuda profiler with enhancements (#9214 )	2021-09-29 21:02:09 -07:00
Scott McKay	4a1b386f7c	#9182 removed the `--is_store_build` option but one place where that was used was missed. (#9219 ) This should fix the relevant packaging pipelines.	2021-09-29 09:28:31 -07:00
satyajandhyala	278928a102	Added a test case for python gradient builder. (#9207 ) * Register Cos operator gradient using ORTModule's register_gradient and compare gradient against PyTorch.	2021-09-29 09:24:12 -07:00
stevenlix	4f10024868	Fix shape inference issue in Gather op (#9147 ) * add initializer checker for Gather with 1D input * Check if indices value exists * Update symbolic_shape_infer.py * add unit test * Update symbolic_shape_infer.py * Update symbolic_shape_infer.py	2021-09-28 22:46:12 -07:00
Changming Sun	b606005858	Revert "linux trt package pipeline (#7537 )" This reverts commit `faea7a222d`.	2021-09-28 19:09:04 -07:00
RandySheriffH	058108bef9	Execution Provider Profiler (#8406 ) * implement cuda provider * define profiler common * call start after register * add memcpy event * add cuda correlation * format code * add cupti to test path * switch to CUpti_ActivityKernel3 * reset cupti path * fix test case * fix trt pipeline * add namespace * format code * exclude training from testing * remove mutex	2021-09-28 13:59:52 -07:00
Suffian Khan	6f580f07de	Switch AMD CI pipeline to use environment image from onnxruntimecibuildenvironment (#9206 ) * shift docker image reference for amd ci pipeline * fix service endpoint * reduce perf tolerance	2021-09-28 13:06:16 -07:00
Changming Sun	1104e8d3e5	Linux Anybuild build pipeline (#9091 )	2021-09-28 11:22:27 -07:00
ytaous	d3f859fe30	Dropout Vectorized Kernel (#9157 ) * vectorized kernel * fix build * re-calibrate expected loss * fix build * re-calibrate convergence results * more re-calibrate on loss * divide kernels * adress comments * more calibration * calibration * per comments * enable sync Co-authored-by: Ethan Tao <ettao@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-09-27 17:19:12 -07:00
Wei-Sheng Chin	1b0816859f	Only wrap sub-modules which can be wrapped as ORTModule (#9021 )	2021-09-27 17:18:22 -07:00
Yufeng Li	1a71687102	Add QDQ for output of node (#9134 ) * Add QDQ for output of node * keep output of removable activation	2021-09-27 16:55:23 -07:00
Sheil Kumar	f16bb37fae	make uwp store apps link to statically-linked crt desktop builds (#9182 ) Co-authored-by: Sheil Kumar <sheilk@microsoft.com>	2021-09-27 15:37:15 -07:00
baijumeswani	c30cc9190a	Change the agent pool for orttraining-distributed pipeline (#9179 )	2021-09-26 21:26:44 -07:00
Chi Lo	9fda95fec9	Change keepdims of ReduceMax/ReduceMin to always 1 when using quatization calibration MinMax approach (#9167 ) * Change keepdims to always 1 * fix typo * Refine code	2021-09-25 10:13:54 -07:00
baijumeswani	fd91bf91c9	Print full stacktrace exception when exporter fails (#9169 )	2021-09-24 10:24:37 -04:00
Vincent Wang	39dc6ea8a3	Fix to_dlpack Failure on PyTorch-1.10 (#9151 ) * workaround to_dlpack fail in new pt version * add torch code link	2021-09-24 09:48:07 +08:00
George Wu	0888c6cc59	fix bug introduced by PR9130 (#9166 )	2021-09-23 10:58:33 -07:00
Guoyu Wang	ddafe50199	Fix Android build break after Virtual Environment update to 20210919 (#9163 )	2021-09-23 10:07:18 -07:00
Chi Lo	bde16eea68	Cleanup USE_TENSORRT macro (#8593 ) * Remove USE_TENSORRT macro and disable TRT EP at runtime if not support * Remove USE_TENSORRT macro and disable TRT EP at runtime if not support * Remove USE_TENSORRT macro and disable TRT EP at runtime if not support * handle unused parameters * Remove USE_TENSORRT macro and disable TRT EP at runtime if not support * Remove USE_TENSORRT macro and disable TRT EP at runtime if not support * handle unused parameters * Disable some testcases * only include opset13 for testing and add a keyword filter set * rename variable * add back code which was accidentally commented on previous commit * Adjust model test filter for opset14	2021-09-22 21:04:44 -07:00
ke1337	6e83392ff1	Bump up TVM version to avoid conflict with existing one (#9159 ) * Bump up tvm version * Bump up onnxruntime-tvm version There are some c++17 related fixes in TVM Co-authored-by: KeDengMS <kedeng@microsoft.com>	2021-09-22 17:39:19 -07:00
Ye Wang	4934455ab6	Bumping up to 1.10 (#9006 ) * bump to 1.10 * Update Versioning.md * Update README.rst * Change opset version to 15	2021-09-22 16:34:28 -07:00
Jason	4e5bc8365b	Add Paddle2ONNX to Versioning.md (#9067 ) * Add Paddle2ONNX to Versioning.md	2021-09-22 13:38:14 -07:00

1 2 3 4 5 ...

5640 commits