onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-21 21:52:11 +00:00

Author	SHA1	Message	Date
Sheil Kumar	af0001cdfd	1.9.1 Cherry-Picks (#9239 ) * Add full iOS job in package pipeline (#9036) * Add full ios xcframework job * create zip file of the xcframework * Bump up TVM version to avoid conflict with existing one (#9159) * Bump up tvm version * Bump up onnxruntime-tvm version There are some c++17 related fixes in TVM Co-authored-by: KeDengMS <kedeng@microsoft.com> * fix bug introduced by PR9130 (#9166) * make uwp store apps link to statically-linked crt desktop builds (#9182) Co-authored-by: Sheil Kumar <sheilk@microsoft.com> * #9182 removed the `--is_store_build` option but one place where that was used was missed. (#9219) This should fix the relevant packaging pipelines. * DirectML.dll load fails when executable path contains Non-English characters (#9229) * enable unicode dml * add wide string L prefix * Add Fail Fast back Co-authored-by: Sheil Kumar <sheilk@microsoft.com> * Fix Android build break after Virtual Environment update to 20210919 (#9163) Co-authored-by: Guoyu Wang <62914304+gwang-msft@users.noreply.github.com> Co-authored-by: ke1337 <22626095+ke1337@users.noreply.github.com> Co-authored-by: KeDengMS <kedeng@microsoft.com> Co-authored-by: George Wu <jywu@microsoft.com> Co-authored-by: Sheil Kumar <sheilk@microsoft.com> Co-authored-by: Scott McKay <skottmckay@gmail.com>	2021-10-01 07:35:48 -07:00
Suffian Khan	4daa14bc74	Fixes to rel-1.9.0 to compile and pass for AMD ROCm (#9144 ) * Revert "Fix nightly CI pipeline to generate ROCm 4.2 wheels and add ROCm 4.3.1 wheels (#9101)" This reverts commit `47888392ab`. * Add BatchNorm kernel for ROCm (#9014) * Add BatchNorm kernel for ROCm, update BN test * correct epsilon_ setting; limit min epsilon * Upgrade ROCm CI pipeline for ROCm 4.3.1 and permit run inside container (#9070) * try to run inside 4.3.1 container * no \ in container run command * remove networking options * try with adding video render groups * add job to build docker image * try without 1st stage * change alpha, beta to float * try adding service connection * retain huggingface directory * static video and render gid * use runtime expression for variables * install torch-ort * pin sacrebleu==1.5.1 * update curves for rocm 4.3.1 * try again * disable determinism and only check tail of loss curve and with a much larger threshold of 0.05 * disable RoBERTa due to high run variablity on ROCm 4.3.1 * put reduction unit tests back in * Fix nightly CI pipeline to generate ROCm 4.2 wheels and add ROCm 4.3.1 wheels (#9101) * make work for both rocm 4.2 and rocm 4.3.1 * fix rocm 4.3.1 docker image reference * fix CUDA_VERSION to ROCM_VERSION * fix ReduceConsts conflict def * add ifdef to miopen_common.h as well * trailing ws Co-authored-by: wangye <wangye@microsoft.com> Co-authored-by: mindest <30493312+mindest@users.noreply.github.com>	2021-09-21 18:07:07 -07:00
Ye Wang	66b3c31f76	Final round cherry-picks to 1.9.0 (#9133 ) * Fixing MORE mlas unittest failures in POWER (#8673) * Ensure ms-experimental domain Audio Ops build in mac pipeline (#8857) * Globally enable ms-experimental ops * change meaning of ms_experimental to mean all ms_experimental ops. Some experimental ops will still be enabled globally without this flag like audio ops. * add cmath * add cmath to signal_defs.cc * move audio back into experimental, verify on mac * remove experimental from mac builds Co-authored-by: Sheil Kumar <sheilk@microsoft.com> * Remove cpuinfo from WCOS builds (#9076) * Fix a bug for Openvino Python binding (#9130) * Fix default initialization value in C API header (#9126) * fix default initialization value in C API header * Fix conflicts * Nits * Do not generate nuget symbol packages on Linux * fix name conflict in 1.9 for Fix default initialization value in C API header * Fix nightly CI pipeline to generate ROCm 4.2 wheels and add ROCm 4.3.1 wheels (#9101) * make work for both rocm 4.2 and rocm 4.3.1 * fix rocm 4.3.1 docker image reference * fix CUDA_VERSION to ROCM_VERSION * fix ReduceConsts conflict def * add ifdef to miopen_common.h as well * trailing ws * remove OrtCUDAProviderOptions() and simply set value * revert to use custom ctor and fix tests Co-authored-by: austinpagan <fossum@us.ibm.com> Co-authored-by: Sheil Kumar <smk2007@gmail.com> Co-authored-by: Sheil Kumar <sheilk@microsoft.com> Co-authored-by: Tiago Koji Castro Shibata <ticastro@microsoft.com> Co-authored-by: Changming Sun <chasun@microsoft.com> Co-authored-by: Hariharan Seshadri <shariharan91@gmail.com> Co-authored-by: Suffian Khan <sukha@microsoft.com>	2021-09-21 12:18:03 -07:00
Changming Sun	b73bc79ad1	Add a pipeline for audio ops (#9102 )	2021-09-20 18:55:13 -07:00
Ye Wang	83dc22585c	Second round cherry-pick to rel-1.9.0 (#9062 ) * Adding async fetching for webgl backend (#8951) * Adding async fetching for webgl backend * fix PR comments and CI failure. * fixing a bug * adding a flag * Enable linking in exception throwing support library when build onnxruntime wasm. (#8973) * Enable linking in exception throwing support library when build onnxruntime webassembly containing onnxruntime-extensions. * Add flag in build.py to enable linking exceptions throwing library. * Update onnxruntime-extensions document and bind custom_ops build flag with use_extensions. * Update doc. * Update cgmanifest.json. Co-authored-by: Zuwei Zhao <zuzhao@microsoft.com> * Remove document text from error message in a couple of ops (#9003) * do not add pkg wheel entry to the index html file if it already exists (#9004) * do not add pkg wheel entry to the index html file if it already exists * [js/web] fix ort web e2e test (#9025) * Fix cmake POWER10 detection Recent commit `60c98a8` changed variable mlas_common_srcs which affects POWER10 detection. * Fix Where op type reduction processing (#9033) * Update type reduction script to track Where Op's second input type. * Clean up op_kernel_type_control.h includes. * Use more maintainable include. * Fix ROCm wheels CI pipeline break by installing latest protobuf from source (#9047) * install protobuf from source * fix rm command in Dockerfile * fix options on rm command * fix cd into protobuf source directory * try again * remove strip step * debug list the files * ls on /usr * more debug * more debug * adjust LD_LIBRARY_PATH * try remove protobuf before ORT build * [js/web] a bugfix and add tests for wasm proxy worker (#9048) * [js/web] add tests for wasm proxy worker * fix script src override * Set onnxruntime_DISABLE_RTTI to default OFF (#9049) Co-authored-by: Du Li <duli1@microsoft.com> Co-authored-by: Zuwei Zhao <4123666+Zuwei-Zhao@users.noreply.github.com> Co-authored-by: Zuwei Zhao <zuzhao@microsoft.com> Co-authored-by: Hariharan Seshadri <shariharan91@gmail.com> Co-authored-by: liqun Fu <liqfu@microsoft.com> Co-authored-by: Yulong Wang <yulongw@microsoft.com> Co-authored-by: Rajalakshmi Srinivasaraghavan <rajis@linux.ibm.com> Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com> Co-authored-by: Suffian Khan <sukha@microsoft.com> Co-authored-by: Changming Sun <chasun@microsoft.com>	2021-09-15 18:02:07 -07:00
Ye Wang	f202cf3280	First round cherry-pick to rel-1.9.0 (#9019 ) * fast reduction for reducemean (#8976) * Adding preprocessor checks for torch version during torch cpp extensions compilation (#8989) * custom autograd func memory refinement (#8993) * Release torch tensor referenced by torch gradient graph (created in PythonOp) * Update orttraining/orttraining/python/training/ortmodule/torch_cpp_extensions/torch_interop_utils/torch_interop_utils.cc * refine with comments Co-authored-by: Wei-Sheng Chin <wschin@outlook.com> * Fix issues in TensorRT EP (#8996) * fix big engine load issue and add cuda_cpu_alloc * remove redundancy * fix minor issues * [js/web] fix karma launch with chrome headless (#8998) * Update Nuget Packge Pipline to CUDA11.4 and TensorRT8 on Windows (#9000) * Update to CUDA11.4 and TensorRT-8.0.3.4 * update trt pool, remove cudnn from setup_env_gpu.bat * revert pool * test gpu package pipeline on t4 * back out changes * back out changes Co-authored-by: George Wu <jywu@microsoft.com> * Fix fuzz testing build blocking release. (#9008) * add model local function support (#8540) * updates for picking pnnx commit * add tests filter to c# tests * plus test fixes * fix versioning for contrib ops * fix tests * test filter for optional ops * more versioning related updates * fix test * fix layernorm spec * more updates * update docs * add more test filters * more filters * update binary size threshold * update docs * draft - enable model local function * enable model local functions in ORT * update to latest rel onnx commit * plus tests * plus more updates * plus updates * test updates * Fix for nested functions + shape inference * plus bug fix and updates per review * plus fixes per review * plus test updates * plus updates per review * plus fixes * fix a test Co-authored-by: Vincent Wang <wangwchpku@outlook.com> Co-authored-by: baijumeswani <bmeswani@microsoft.com> Co-authored-by: pengwa <pengwa@microsoft.com> Co-authored-by: Wei-Sheng Chin <wschin@outlook.com> Co-authored-by: stevenlix <38092805+stevenlix@users.noreply.github.com> Co-authored-by: Yulong Wang <yulongw@microsoft.com> Co-authored-by: Chi Lo <54722500+chilo-ms@users.noreply.github.com> Co-authored-by: George Wu <jywu@microsoft.com> Co-authored-by: Pranav Sharma <prs@microsoft.com> Co-authored-by: Ashwini Khade <askhade@microsoft.com>	2021-09-09 15:05:38 -07:00
Chi Lo	5ae4c54ab8	Fix bug for validating GPU packages (#8997 )	2021-09-08 02:06:53 -07:00
George Wu	a30d9f5317	fix windows gpu pipelines that use cuda 10.2 (training, reduced_ops and 10.2 validation) (#8994 ) * build for arch 52 * arch 52 * gpu arch 52	2021-09-07 22:01:06 -07:00
Changming Sun	91c15843cd	Fix a directml python packaging error (#8981 )	2021-09-07 16:29:33 -07:00
Changming Sun	0bb56a18cf	Add TRT header file to ORT GPU nuget package (#8962 )	2021-09-07 09:50:09 -07:00
Scott McKay	eebcc20f10	Add netstandard2.0 framework to nuget managed package. (#8960 ) * Add netstandard2.0 to nuget managed package. Re-does PR that was backed out due to packaging pipeline changes. Allows deprecation of netstandard1.1 in the following release as netstandard2 is the preferred lowest level framework.	2021-09-04 08:01:46 +10:00
Olivia Jain	a0c9408f0d	Make TRT Version Configurable (#8864 ) * copy changes from trt_and_mem * second edits * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * change to cuda 11.4 * build with cuda 11.4 * Update Dockerfile.ubuntu_cuda11_1_tensorrt7_2 * add cmake extra defines * cmake architectures * fix cmake arch * Delete ubuntu-18.04.Dockerfile * Rename Dockerfile.ubuntu_cuda11_1_tensorrt7_2 to Dockerfile.ubuntu_cuda11_4_tensorrt7_2 * Update linux-gpu-tensorrt-ci-perf-pipeline.yml * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * removing previous ort args * rename to cuda 11.4 * remove cuda 10_2 * delete trt 7.1 * remove 7.1 * Passing in cuda architecture to reduce build time * always add submodule sync due to recursive cloning * fix run command * add and * take away unused arms and share python installation script * Update linux-gpu-tensorrt-ci-perf-pipeline.yml * Update Dockerfile.tensorrt * cleanup file * install python directly on dockerfile - move to scripts in future * Update Dockerfile.custom-trt-perf * adding cuda 11.1 for missing Libnvrtc.so.11.1 * Delete install_python.sh	2021-09-03 13:32:27 -07:00
Chi Lo	1f576e1766	Detect necessary files inside GPU packages (#8955 ) * Rename files * Update YAML files * Update validation script and YAML	2021-09-03 13:28:28 -07:00
liqun Fu	a7f5bd226b	retarget torch181 to torch182 (#8947 ) Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-09-03 09:44:42 -07:00
Gary Miguel	47435311f4	Include pytorch_export_contrib_ops in inference builds (#8878 ) * Include pytorch_export_contrib_ops in inference builds Rename / move it from tools/python/register_custom_ops_pytorch_exporter to onnxruntime/python/tools/pytorch_export_contrib_ops. Rationale for inclusion in inference builds: This code is potentially useful for anyone using ORT, not just training. Rationale for new name: "Contrib op" is the nomenclature used within ORT to refer to the set of ops that are not in the standard op set but are included by default with ORT. This is more specific than "custom op", which is what the PyTorch exporter uses to refer to any non-standard op. Step 1 of addressing #8818. After this is merged I will update the docs. * Enable test_pytorch_export_contrib_ops.py in CI Fixes AB#1342330	2021-09-02 14:26:58 -07:00
Changming Sun	1a34775fe9	Fix the benchmark code (#8926 )	2021-09-02 10:36:24 -07:00
Changming Sun	fbb6f0f599	Fix an error in Nuget pipeline caused by merge conflict	2021-09-02 09:26:25 -07:00
Sunghoon	332c2ba4f4	[js/web] Integrate ONNX Runtime Web CI with BrowserStack (#8859 ) * Integrate ONNX Runtime Web CI with BrowserStack * Rename a pipeline from browserstack to multi-platform	2021-09-01 17:25:57 -07:00
liqun Fu	f126a12699	decouple pytorch from onnxruntime training build (#8815 )	2021-09-01 16:31:53 -07:00
Scott McKay	858989293d	Reduce binary size of strided copy used by Concat (#8913 ) * Change the strided copy to switch on data size not data type. Move to header so we can reduce on the enabled types. Setup type reduction for Concat now that it's using this implementation.	2021-09-02 08:19:20 +10:00
Changming Sun	6299a60bf8	Nuget: splitting PDB files to a separated package (#8903 )	2021-09-01 09:07:24 -07:00
Suffian Khan	00b0a9c127	Add hugging-face models loss curve and performance guards to ROCm CI pipeline. (#8915 ) * test running hf bert-large * try again * try again * include other models * correct names * disable deberta-v2-xxlarge * avoid torch.distributed * add compare json loss and perf for bert-large to test * fix sed expression * remove pytest * add more models * move unit tests u * display samples/sec	2021-09-01 09:03:10 -07:00
Hariharan Seshadri	acd9db7fad	Fix location planning for initializers used only in nested subgraphs (#8642 )	2021-09-01 00:02:08 -07:00
Changming Sun	a9a0d3f6fa	Update min supported macOS version to 10.14	2021-08-31 16:09:48 -07:00
Changming Sun	129722db37	Add android binary size monitor back (#8904 )	2021-08-31 14:13:55 -07:00
Olivia Jain	33c0b3e94b	Perf test fixes (#8863 ) * fix anubis wheel upload and symbolic shape infer location * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * Update linux-gpu-tensorrt-ci-perf-pipeline.yml for Azure Pipelines * fix symbolic path * use master and call mem_test after build * Update linux-gpu-tensorrt-ci-perf-pipeline.yml * use installed symbolic shape infer TODO: check upon error * catch symbolic shape errors	2021-08-31 10:03:47 -07:00
Maajid khan	b7129305be	[OpenVINO-EP] UEP v3.1 Release with OpenVINO 2021.4 (#8892 ) * Add command to skip tests * Remove support for OV_2021.3_LTS and ov_2021.1 Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Removed request_id parameter from all references request_id parameter was being used with ov_2020.3 release. Starting from 2020.4 OV release, input_name paramater is being used instead to get the KernelContext_GetInput. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Enabling CI Logs in the branch * CI Commits to enable logs * Enable CI Print * Added Imagescaler op to the supported op's list Fixes test_tiny_yolo_V2 opset 8 model to support fully on OV-EP. This model is the older variation of tiny_yolo_v2 model which has Imagescaler op. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Added ops to fully support yolov3 model -Added changes to support yolov3 opset 10 model fully on CPU_FP32. -This also increases the operator coverage for GPU hardware. There by enabling yolov3 model on GPU with fewer subgraphs. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Enabling tiny_yolov3 model fully on CPU ->Enabled tiny_yolov3 model fully on CPU. -> Also reduces the number of subgraphs to infer this model on GPU Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Adding GatherND op support for CPU and GPU ->This enables yolov3_pytorch model to work with fewer subgraphs on CPU and GPU Devices. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Fixes Albert model for ISV customer ConvTranspose op was getting rejected due to a condition. Fixed it. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Disabling this 4 cpp tests for openvino-ep These unit tests are failing with special conditions for conv_transpose op with output_shape attribute. so disabling them for now. Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Docker file changes for 2021.4-v3.1 * Remvoing duplicate code Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * ReduceMax No dimension supported * Fixes failing protobuf issue for docker Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Excluding openvinoep type for convtranpose test Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> * Disabled 2 Failing convtranspose tests with TensorRT EP Signed-off-by: MaajidKhan <n.maajidkhan@gmail.com> Co-authored-by: suryasidd <surya.siddharth.pemmaraju@intel.com> Co-authored-by: Aravind Gunda <aravindx.gunda@intel.com> Co-authored-by: sfatimar <sahar.fatima@intel/com>	2021-08-31 09:23:13 -07:00
Changming Sun	c6d9426ef2	Add binary size reporting back (#8883 )	2021-08-30 19:48:38 -07:00
Abhishek Jindal	868c8af9ac	Abjindal/eager mode pipeline (#8870 ) * Adding pipeline file for eager mode * adding the build eager mode flag * adding torch wheel files for installation * Changing pytorch version for change in wheel files * updating requirements file path * Removing Java and NodeJS from the build * removing import torch for testing build of eager mode * changing the build command * import torch * building eager mode separately * removing Java tests * python path issues * changing python path location * changing the build path file loc * installing torch before build * setting environment for building eager mode * Copying the build file and getting rid of flags * changing python path * adding missing packages * moving build eager mode code * changing python path to python3 * adding amd_hipify * adding logger file * install torch before build * change requirements file location * install torch before build eager * modifying eager mode build * modifying build location * adding new docker image * handling gradle move issue * Typo fix * changing deps file * adding java and nodejs * changing repo name for docker image * removing pybind * building only eager mode * changing the image name * removing install wheel package * build complete onnxruntime with eager mode * building wheel * enabling pybind * adding build eager mode flag in unit tests * removing build java nodejs * adding build command * removing java tests * moving Debug tests before Release * building Debug only case * changing debug test code * running the build eager mode with tests * adding build dir * adding build dir path * changing build dir path * changing build command for eager mode * building eager mode and running tests simultaneously * adding more flags to the pipeline * chaning flag * adding Debug and Release * changing torch to nightly build * changing torch version for nightly build * chaning torch version * move to Ubuntu image * adding pool * adding dockerfile for eager mode * adding python deps file for eager * modifying python deps file for eager * changing deps file * changing deps file statements * changing python path * REMOVING ECHO line * going to original docker file * changing docker file * changing to eager requirements file * changing python deps file * changing paths * changing cmake path * changing build script * changing python installation * running debug mode only * changing pipeline file * test name * test name * test name2 * changing requirements file * final flags for eager mode * previous pipeline * moving to ubuntu image and including some deps * adding cmake path * returning to manylinux image * removing unncecessary files for pipeline	2021-08-30 18:24:39 -07:00
Changming Sun	6df4e293ff	Remove unused code in tools/ci_build/github/azure-pipelines/nuget/templates/gpu.yml	2021-08-30 15:37:40 -07:00
Changming Sun	7cd46cb9c4	Fix a problem in Zip-Nuget-Java Packaging Pipeline	2021-08-30 14:51:36 -07:00
Edward Chen	b75c1081ca	[Objective-C] Enable static analysis, second try (#8875 ) The previous attempt to enable static analysis (#8842) didn't actually run the static analysis checks. - Run clang-tidy directly. - Address static analysis warnings.	2021-08-30 10:43:45 -07:00
satyajandhyala	84f9271a8d	Enable registering external custom op schemas on Linux (#8889 ) * Use manylinux instead of Ubuntu to run external custom ops build pipeline.	2021-08-30 10:13:47 -07:00
Changming Sun	03b680b940	Delete template.targets	2021-08-30 09:34:26 -07:00
Changming Sun	fa27c19342	Delete create_nuspect.py and template.nuspec	2021-08-30 09:34:26 -07:00
Changming Sun	1b5909dea8	Delete download_cmake.py (#8885 )	2021-08-30 09:34:08 -07:00
liqun Fu	c8dd0bf37e	to publish stable wheel to ort channel (#8873 ) Co-authored-by: liqun <liqun@OrtTrainingDev4.af05slrtruoetgaxwwjv5nsq5e.px.internal.cloudapp.net>	2021-08-30 09:33:01 -07:00
satyajandhyala	31926176ac	Support external custom operator schemas on Ubuntu (#8807 ) * Expose symbols in onnx and protobuf namespaces in python when building with --enable_external_custom_op_schemas * Add external onnx and protobuf files to wheel * Added an example to demonstrate external custom ops use-case * Added a Linux build pipeline to test external custom ops	2021-08-28 11:05:21 -07:00
Zuwei Zhao	89e8bff121	Enable selecting custom ops in onnxruntime-extensions. (#8826 ) * Enable selecting custom ops in onnxruntime-extensions. * Move cmake_helper.py. * Remove over-indented spaces. * Add doc. * Remove onnxruntime-extensions from git submodules, and user should pass path of onnxruntime-extensions for build. * Modify doc. * Remove argument --enable_onnxruntime_extensions and use --onnxruntime_extensions_path. * Fix build error. * Fix build error. * Use onnxruntime_extensions_path. * support both submodule and external source folders * refinement * Update cgmanifest.json * Support building onnxruntime-extensions from either git submodule or pre-pulled path. * Update doc. * more standard name * update docs * add the copyright header Co-authored-by: Zuwei Zhao <zuzhao@microsoft.com> Co-authored-by: Wenbing Li <wenbingl@outlook.com> Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>	2021-08-27 21:45:52 -07:00
Guoyu Wang	6a1939252f	Fix Android java API failure (#8865 ) * Fix Android Package break * Without java fix -- pipeline should fail * With java fix, should pass now * address CR comments	2021-08-27 15:58:56 -07:00
Scott McKay	0034ad72e6	Minimize changes to fix missing symbols used from C# (#8867 ) * Revert "Cleanup C# bindings to add EP (#8810)" This reverts commit `b21ea00020`. * Add back in a minimal set of changes. Provide stubs in for a limited set of things - things called from C# using a static lib of ORT built for mac/ios - things in OrtApis that are not included in the build by default - things in OrtApis that are excluded in a minimal build * Cleanup order or EPs in test * Fix unused function in ROCM build	2021-08-28 07:10:14 +10:00
Dmitri Smirnov	f3083f4bf3	Support of sparse initializers with smaller indices data type (#8834 ) Support of sparse initializers with smaller indices data type to save space. Make the script more efficient by selecting indices data type and checking resulting sparse bytes Exclude new code from SPARSE_TENSORS	2021-08-27 14:02:48 -07:00
Chi Lo	6a477acecf	Add tensorrt_provider_factory.h to artifact (#8869 )	2021-08-27 09:09:54 -07:00
Yulong Wang	e8564d6597	[js/web] update emsdk to v2.0.26 (#8653 ) * update emsdk to v2.0.26 * fix pooling build warning * fix build break * use pragma diagnostic semantic only when __GNUC__ is defined * fix build break * disable AttentionPastState_dynamic	2021-08-26 15:31:34 -07:00
Chi Lo	eb8f84e2a2	Fix issue of GPU tarball/zip/java package (#8850 ) * modify for test * modify for test * modify for test * modify for test * modify for test * modify for test * prepare for PR * Rename cuda directory to gpu directory in tarball * Fix gpu java package * fix bug * fix small bug	2021-08-26 10:16:16 -07:00
Edward Chen	0cfc4ec09d	[Objective-C] Enable static analysis (#8842 ) Add Objective-C API static analysis pipeline.	2021-08-26 09:13:52 -07:00
Changming Sun	ced2d8e597	Clean up TRT docker files (#8847 )	2021-08-25 22:26:31 -07:00
Changming Sun	9cd7d836f7	Delete Dockerfile.ubuntu_for_android (#8848 )	2021-08-25 22:25:14 -07:00
Scott McKay	b21ea00020	Cleanup C# bindings to add EP (#8810 ) Fix C# add EP bindings. Add stubs to ORT so that if EP is not included in the build we return a graceful error message. Move declaration of stubs into C API and out for EP so they're in one place and are easier to use (no extra header required in the C/C++ world and consistent with the CUDA EP setup). Fix inconsistency in ROCM EP. Cleanup a few other things.	2021-08-26 13:59:40 +10:00
Guoyu Wang	613a600471	relax android ci timeout to 180 minutes (#8844 )	2021-08-25 19:59:48 -07:00

1 2 3 4 5 ...

1198 commits