onnxruntime/cmake
Abhishek Udupa 83c59d2594
Session-aware and thread-safe CUDA profiler (#13706)
### Description
The existing CUDA profiler is neither session-aware, nor thread-safe.
This PR ensures both.

### Motivation and Context
[PR 13549](https://github.com/microsoft/onnxruntime/pull/13549) brought
thread-safety and session-awareness to the ROCm profiler. This PR brings
the same goodness to the CUDA profiler as well.

Sample outputs of a profiling run from the StableDiffusion model (this
model was chosen because it requires orchestration of multiple sessions,
and verifies that the profilers are now indeed session-aware) on both
CUDA and ROCm EPs are attached, along with a script that checks that the
trace files generated by the profile are well-formed.

Update 11/29: Updated the profile outputs. The older profile outputs
exhibited an issue where some timestamps were wildly out of range,
leading to problems visualizing the traces. The bug has been fixed and
the profile outputs have been updated, along with an update to the check
script to ensure that timestamps are monotonically increasing.


[sd_profile_outputs_cuda.tar.gz](https://github.com/microsoft/onnxruntime/files/10118088/sd_profile_outputs_cuda.tar.gz)

[sd_profile_outputs_rocm.tar.gz](https://github.com/microsoft/onnxruntime/files/10118089/sd_profile_outputs_rocm.tar.gz)

[check_profile_output_well_formedness.zip](https://github.com/microsoft/onnxruntime/files/10118090/check_profile_output_well_formedness.zip)

Co-authored-by: Abhishek Udupa <abhishek.udupa@microsoft.com>
2022-12-09 13:22:12 -08:00
..
external Add protobuf version constraint (#13870) 2022-12-08 16:14:16 -08:00
patches Patch Protobuf and ONNX's cmake files and enforce BinSkim check (#13694) 2022-11-18 10:09:47 -08:00
tensorboard Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
adjust_global_compile_flags.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
CMakeLists.txt Improve the handling of /external:I (#13904) 2022-12-09 11:44:32 -08:00
CMakeSettings.json
codeconv.runsettings
deps.txt Remove unused git submodules (#13830) 2022-12-07 21:59:16 -08:00
EnableVisualStudioCodeAnalysis.props Fix SDL warnings in CPU EP (#9975) 2021-12-19 20:54:29 -08:00
gdk_toolchain.cmake Enable building with a GDK (#11126) 2022-04-07 15:06:31 -07:00
Info.plist.in
libonnxruntime.pc.cmake.in
nuget_helpers.cmake
onnxruntime.cmake Remove miscellaneous nuphar configs (#13070) 2022-09-26 13:41:28 -07:00
onnxruntime_codegen_tvm.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnxruntime_common.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnxruntime_config.h.in [wasm] update emscripten v2.0.34 (#10391) 2022-01-26 14:46:02 -08:00
onnxruntime_csharp.cmake Enable nuget packages for on device training (#13637) 2022-12-05 14:54:09 -08:00
onnxruntime_eager.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnxruntime_flatbuffers.cmake Switch GSL to MS GSL 4.0.0 (#13416) 2022-10-29 04:15:20 -07:00
onnxruntime_framework.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnxruntime_fuzz_test.cmake
onnxruntime_graph.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnxruntime_ios.toolchain.cmake
onnxruntime_java.cmake Add linux and macos arm64 java aritifacts (#10981) 2022-03-25 16:23:17 -07:00
onnxruntime_java_unittests.cmake
onnxruntime_kernel_explorer.cmake Share TunableOp between CUDA and ROCM EP (#13560) 2022-11-11 13:56:44 +08:00
onnxruntime_language_interop_ops.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnxruntime_mlas.cmake Switch GSL to MS GSL 4.0.0 (#13416) 2022-10-29 04:15:20 -07:00
onnxruntime_nodejs.cmake
onnxruntime_objectivec.cmake Remove SafeInt dependency from Objective-C API. (#13698) 2022-11-18 17:06:12 -08:00
onnxruntime_opschema_lib.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnxruntime_optimizer.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnxruntime_providers.cmake Remove unused git submodules (#13830) 2022-12-07 21:59:16 -08:00
onnxruntime_pyop.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnxruntime_python.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnxruntime_rocm_hipify.cmake Session-aware and thread-safe CUDA profiler (#13706) 2022-12-09 13:22:12 -08:00
onnxruntime_session.cmake Fix hipify to avoid nccl_service.h: No such file or directory (#13852) 2022-12-07 09:10:37 +08:00
onnxruntime_snpe_provider.cmake cmake changes for SNPE EP (#11821) 2022-06-13 08:15:37 -07:00
onnxruntime_training.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnxruntime_unittests.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnxruntime_util.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
onnxruntime_webassembly.cmake [wasm] disable closure compiler in debug build (#13865) 2022-12-08 13:18:19 -08:00
precompiled_header.cmake
Sdl.ruleset Update Sdl.ruleset to remove C26812 from the rules (#12695) 2022-09-01 20:05:20 -07:00
set_winapi_family_desktop.h
target_delayload.cmake Remove Windows Store specific code 2022-03-17 23:38:14 -07:00
uwp_stubs.h
wcos_rules_override.cmake
winml.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00
winml_cppwinrt.cmake
winml_sdk_helpers.cmake
winml_unittests.cmake Improve dependency management (#13523) 2022-12-01 09:51:59 -08:00