onnxruntime/tools/ci_build
RandySheriffH 587e891cae
CloudEP (#13855)
Implement CloudEP for hybrid inferencing.
The PR introduces zero new API, customers could configure session and
run options to do inferencing with Azure [triton
endpoint.](https://learn.microsoft.com/en-us/azure/machine-learning/how-to-deploy-with-triton?tabs=azure-cli%2Cendpoint)
Sample configuration in python be like:

```
sess_opt.add_session_config_entry('cloud.endpoint_type', 'triton');
sess_opt.add_session_config_entry('cloud.uri', 'https://cloud.com');
sess_opt.add_session_config_entry('cloud.model_name', 'detection2');
sess_opt.add_session_config_entry('cloud.model_version', '7'); // optional, default 1
sess_opt.add_session_config_entry('cloud.verbose', '1'); // optional, default '0', meaning no verbose
...
run_opt.add_run_config_entry('use_cloud', '1') # 0 for local inferencing, 1 for cloud endpoint.
run_opt.add_run_config_entry('cloud.auth_key', '...')
...
sess.run(None, {'input':input_}, run_opt)
```

Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2023-01-03 10:03:15 -08:00
..
github CloudEP (#13855) 2023-01-03 10:03:15 -08:00
__init__.py
amd_hipify.py Multi-stream execution support (#13495) 2022-12-15 07:39:29 -08:00
build.py CloudEP (#13855) 2023-01-03 10:03:15 -08:00
clean_docker_image_cache.py
coverage.py
gen_def.py Add CANN EP (#12416) 2022-09-22 14:53:40 -07:00
get_docker_image.py Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
logger.py
op_registration_utils.py Update kernel matching logic: decouple from op schemas and remove kernel def hashes (#12791) 2022-09-20 14:24:59 -07:00
op_registration_validator.py Update CUDA ArgMin/ArgMax op kernels to have end version 11 since opset 12+ is not supported yet. (#13983) 2022-12-21 19:01:00 -05:00
policheck_exclusions.xml Exculde hipify option from policheck (#13431) 2022-10-25 16:35:16 +08:00
reduce_op_kernels.py Update kernel matching logic: decouple from op schemas and remove kernel def hashes (#12791) 2022-09-20 14:24:59 -07:00
replace_urls_in_deps.py Move C/C++ deps' URLs to deps.txt (#13769) 2022-11-29 18:06:35 -08:00
requirements.txt Deprecate one step beam search (#14046) 2022-12-22 23:14:31 -08:00
update_tsaoptions.py Add license header to some files. (#13074) 2022-09-23 18:46:02 -07:00
upload_python_package_to_azure_storage.py