onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-14 20:48:00 +00:00

History

RandySheriffH 587e891cae CloudEP (#13855 ) Implement CloudEP for hybrid inferencing. The PR introduces zero new API, customers could configure session and run options to do inferencing with Azure [triton endpoint.](https://learn.microsoft.com/en-us/azure/machine-learning/how-to-deploy-with-triton?tabs=azure-cli%2Cendpoint) Sample configuration in python be like: ``` sess_opt.add_session_config_entry('cloud.endpoint_type', 'triton'); sess_opt.add_session_config_entry('cloud.uri', 'https://cloud.com'); sess_opt.add_session_config_entry('cloud.model_name', 'detection2'); sess_opt.add_session_config_entry('cloud.model_version', '7'); // optional, default 1 sess_opt.add_session_config_entry('cloud.verbose', '1'); // optional, default '0', meaning no verbose ... run_opt.add_run_config_entry('use_cloud', '1') # 0 for local inferencing, 1 for cloud endpoint. run_opt.add_run_config_entry('cloud.auth_key', '...') ... sess.run(None, {'input':input_}, run_opt) ``` Co-authored-by: Randy Shuai <rashuai@microsoft.com>		2023-01-03 10:03:15 -08:00
..
eager	Fix ORT Eager Mode to work with Pytorch 1.12 (#12323 )	2022-07-27 16:24:46 -04:00
post_to_dashboard	Pin version of post to dashboard scripts' dependencies and update them to work with recent version. (#10353 )	2022-01-21 19:35:58 -08:00
bundle_dlls_gpu.bat
bundle_nuget_with_native_headers.bat	Add TRT header file to ORT GPU nuget package (#8962 )	2021-09-07 09:50:09 -07:00
extract_nuget_files.ps1	Patch Protobuf and ONNX's cmake files and enforce BinSkim check (#13694 )	2022-11-18 10:09:47 -08:00
extract_nuget_files_gpu.ps1	Patch Protobuf and ONNX's cmake files and enforce BinSkim check (#13694 )	2022-11-18 10:09:47 -08:00
extract_zip_files_gpu.ps1
helpers.ps1	Remove unused git submodules (#13830 )	2022-12-07 21:59:16 -08:00
install_third_party_deps.ps1	Remove unused git submodules (#13830 )	2022-12-07 21:59:16 -08:00
jar_gpu_packaging.ps1
jar_packaging.ps1	remove wrong placed libs (#12201 )	2022-07-18 09:22:22 -07:00
post_binary_sizes_to_dashboard.py	Replace the occurrences of "master" to "main" in yaml files (#12534 )	2022-08-09 22:03:21 -07:00
post_code_coverage_to_dashboard.py	Format all python files under onnxruntime with black and isort (#11324 )	2022-04-26 09:35:16 -07:00
setup_env.bat	Add license header to some files. (#13074 )	2022-09-23 18:46:02 -07:00
setup_env_cloud.bat	CloudEP (#13855 )	2023-01-03 10:03:15 -08:00
setup_env_cuda_11.bat	Add license header to some files. (#13074 )	2022-09-23 18:46:02 -07:00
setup_env_gpu.bat	[TensorRT EP] support TensorRT 8.5 (#13867 )	2022-12-14 13:06:03 -08:00
setup_env_trt.bat	Move build machines with Nvidia M60 GPUs to Nvidia T4 (#13170 )	2022-10-25 11:21:13 -07:00
setup_env_x86.bat	Add license header to some files. (#13074 )	2022-09-23 18:46:02 -07:00