onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-05-16 21:00:14 +00:00

History

RandySheriffH 587e891cae CloudEP (#13855 ) Implement CloudEP for hybrid inferencing. The PR introduces zero new API, customers could configure session and run options to do inferencing with Azure [triton endpoint.](https://learn.microsoft.com/en-us/azure/machine-learning/how-to-deploy-with-triton?tabs=azure-cli%2Cendpoint) Sample configuration in python be like: ``` sess_opt.add_session_config_entry('cloud.endpoint_type', 'triton'); sess_opt.add_session_config_entry('cloud.uri', 'https://cloud.com'); sess_opt.add_session_config_entry('cloud.model_name', 'detection2'); sess_opt.add_session_config_entry('cloud.model_version', '7'); // optional, default 1 sess_opt.add_session_config_entry('cloud.verbose', '1'); // optional, default '0', meaning no verbose ... run_opt.add_run_config_entry('use_cloud', '1') # 0 for local inferencing, 1 for cloud endpoint. run_opt.add_run_config_entry('cloud.auth_key', '...') ... sess.run(None, {'input':input_}, run_opt) ``` Co-authored-by: Randy Shuai <rashuai@microsoft.com>		2023-01-03 10:03:15 -08:00
..
common	Update absl to the latest release (#13990 )	2022-12-19 14:25:13 -08:00
eager	support register external ep lib information (#8897 )	2021-08-31 20:51:22 -07:00
framework	Multi-stream execution support (#13495 )	2022-12-15 07:39:29 -08:00
graph	CloudEP (#13855 )	2023-01-03 10:03:15 -08:00
optimizer	Pass SessionOptions to XnnpackProviderFactoryCreator. (#13318 )	2022-12-10 14:23:46 +08:00
platform	Improve thread pool creation failure handling. (#13313 )	2022-10-15 17:57:19 -07:00
providers	[CANN] Add the ability to run graph (#13728 )	2022-12-16 06:57:40 -08:00
session	Support for custom op variadic inputs/outputs (#13946 )	2022-12-23 11:41:15 -08:00