onnxruntime/tools/ci_build/github
RandySheriffH 587e891cae
CloudEP (#13855)
Implement CloudEP for hybrid inferencing.
The PR introduces zero new API, customers could configure session and
run options to do inferencing with Azure [triton
endpoint.](https://learn.microsoft.com/en-us/azure/machine-learning/how-to-deploy-with-triton?tabs=azure-cli%2Cendpoint)
Sample configuration in python be like:

```
sess_opt.add_session_config_entry('cloud.endpoint_type', 'triton');
sess_opt.add_session_config_entry('cloud.uri', 'https://cloud.com');
sess_opt.add_session_config_entry('cloud.model_name', 'detection2');
sess_opt.add_session_config_entry('cloud.model_version', '7'); // optional, default 1
sess_opt.add_session_config_entry('cloud.verbose', '1'); // optional, default '0', meaning no verbose
...
run_opt.add_run_config_entry('use_cloud', '1') # 0 for local inferencing, 1 for cloud endpoint.
run_opt.add_run_config_entry('cloud.auth_key', '...')
...
sess.run(None, {'input':input_}, run_opt)
```

Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2023-01-03 10:03:15 -08:00
..
android Add onnxruntime_BUILD_UNIT_TESTS=OFF definition to iOS package build options. (#13238) 2022-10-10 18:00:17 -07:00
apple Remove SafeInt dependency from Objective-C API. (#13698) 2022-11-18 17:06:12 -08:00
azure-pipelines CloudEP (#13855) 2023-01-03 10:03:15 -08:00
js Use full ORT package for onnxruntime-react-native. (#13037) 2022-09-23 07:20:03 +10:00
linux CloudEP (#13855) 2023-01-03 10:03:15 -08:00
pai [ROCm] Update ROCm and MigraphX CI to ROCm5.4 (#14011) 2022-12-22 10:01:05 +08:00
python_checks Add Python checks pipeline (#7032) 2021-08-09 10:37:05 -07:00
windows CloudEP (#13855) 2023-01-03 10:03:15 -08:00
Doxyfile_csharp.cfg Add Xamarin support (#9436) 2021-10-27 20:07:07 +10:00