onnxruntime

mirror of https://github.com/saymrwulf/onnxruntime.git synced 2026-06-09 00:30:53 +00:00

History

Tianlei Wu 77b45c6503 Add Stable Diffusion Benchmark on A100-PCIE-80GB (#16702 ) 0(1) Fix a bug in https://github.com/microsoft/onnxruntime/pull/16560 that UNet shall be set fp16 flag. (2) Remove wget in requirements since it is no longer needed. (3) Add benchmark numbers in A100-PCIE-80GB. Note that CUDA EP have issue to run in batch size 4 so the number is not added.		2023-07-14 10:37:00 -07:00
..
contrib_ops	[ROCm] TunableOp: add hipBLASLt tuning logic (#16338 )	2023-07-14 08:20:58 +08:00
core	[ROCm] TunableOp: add hipBLASLt tuning logic (#16338 )	2023-07-14 08:20:58 +08:00
python	Add Stable Diffusion Benchmark on A100-PCIE-80GB (#16702 )	2023-07-14 10:37:00 -07:00
test	Triton Codegen for ORTModule (#15831 )	2023-07-13 18:17:58 +08:00
tool/etw	Run clang-format in CI (#15524 )	2023-04-18 09:26:58 -07:00
wasm	[js/web] enable ONNX Runtime Web error messages in JS (#16335 )	2023-06-15 09:45:41 -07:00
__init__.py	ExecutionProvider API refactor - move allocator from EP level to SessionState level and indexed by OrtDevice (#15833 )	2023-06-19 17:44:45 -07:00
ReformatSource.ps1	Run clang-format in CI (#15524 )	2023-04-18 09:26:58 -07:00
ReformatSourcePython.bat
VSCodeCoverage.runsettings