onnxruntime/onnxruntime
Tianlei Wu 77b45c6503
Add Stable Diffusion Benchmark on A100-PCIE-80GB (#16702)
0(1) Fix a bug in https://github.com/microsoft/onnxruntime/pull/16560
that UNet shall be set fp16 flag.
(2) Remove wget in requirements since it is no longer needed.
(3) Add benchmark numbers in A100-PCIE-80GB. Note that CUDA EP have
issue to run in batch size 4 so the number is not added.
2023-07-14 10:37:00 -07:00
..
contrib_ops [ROCm] TunableOp: add hipBLASLt tuning logic (#16338) 2023-07-14 08:20:58 +08:00
core [ROCm] TunableOp: add hipBLASLt tuning logic (#16338) 2023-07-14 08:20:58 +08:00
python Add Stable Diffusion Benchmark on A100-PCIE-80GB (#16702) 2023-07-14 10:37:00 -07:00
test Triton Codegen for ORTModule (#15831) 2023-07-13 18:17:58 +08:00
tool/etw Run clang-format in CI (#15524) 2023-04-18 09:26:58 -07:00
wasm [js/web] enable ONNX Runtime Web error messages in JS (#16335) 2023-06-15 09:45:41 -07:00
__init__.py ExecutionProvider API refactor - move allocator from EP level to SessionState level and indexed by OrtDevice (#15833) 2023-06-19 17:44:45 -07:00
ReformatSource.ps1 Run clang-format in CI (#15524) 2023-04-18 09:26:58 -07:00
ReformatSourcePython.bat
VSCodeCoverage.runsettings