onnxruntime/tools
RandySheriffH 587e891cae
CloudEP (#13855)
Implement CloudEP for hybrid inferencing.
The PR introduces zero new API, customers could configure session and
run options to do inferencing with Azure [triton
endpoint.](https://learn.microsoft.com/en-us/azure/machine-learning/how-to-deploy-with-triton?tabs=azure-cli%2Cendpoint)
Sample configuration in python be like:

```
sess_opt.add_session_config_entry('cloud.endpoint_type', 'triton');
sess_opt.add_session_config_entry('cloud.uri', 'https://cloud.com');
sess_opt.add_session_config_entry('cloud.model_name', 'detection2');
sess_opt.add_session_config_entry('cloud.model_version', '7'); // optional, default 1
sess_opt.add_session_config_entry('cloud.verbose', '1'); // optional, default '0', meaning no verbose
...
run_opt.add_run_config_entry('use_cloud', '1') # 0 for local inferencing, 1 for cloud endpoint.
run_opt.add_run_config_entry('cloud.auth_key', '...')
...
sess.run(None, {'input':input_}, run_opt)
```

Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2023-01-03 10:03:15 -08:00
..
android_custom_build Replace references to onnxruntime 'master' with 'main' in Dockerfiles. (#12550) 2022-08-16 14:13:05 -07:00
ci_build CloudEP (#13855) 2023-01-03 10:03:15 -08:00
doc Format all python files under onnxruntime with black and isort (#11324) 2022-04-26 09:35:16 -07:00
nuget Enable nuget packages for on device training (#13637) 2022-12-05 14:54:09 -08:00
perf_view fix json format (#11046) 2022-03-30 16:15:33 -07:00
python [React Native CI] Record more info to debug E2E test (#13329) 2022-10-18 17:21:28 -07:00