pytorch/benchmarks
2024-06-29 23:24:06 +00:00
..
distributed Remove ProcessGroupCudaP2P and change async-TP to use SymmetricMemory (#128762) 2024-06-25 22:32:21 +00:00
dynamo Revert "[BE][Easy] replace import pathlib with from pathlib import Path (#129426)" 2024-06-29 23:24:06 +00:00
fastrnns [5/N][Easy] fix typo for usort config in pyproject.toml (kown -> known): sort torch (#127126) 2024-05-27 14:49:57 +00:00
framework_overhead_benchmark [Caffe2]Remove Caffe2 scripts and benchmarks (#126747) 2024-06-05 23:46:31 +00:00
functional_autograd_benchmark Typo fixes: et al. (#127811) 2024-06-06 01:03:25 +00:00
fuser [5/N][Easy] fix typo for usort config in pyproject.toml (kown -> known): sort torch (#127126) 2024-05-27 14:49:57 +00:00
gpt_fast [GPT-fast] Update micro benchmark numbers as A100-50G (#129799) 2024-06-29 04:36:07 +00:00
inference [5/N][Easy] fix typo for usort config in pyproject.toml (kown -> known): sort torch (#127126) 2024-05-27 14:49:57 +00:00
instruction_counts [5/N][Easy] fix typo for usort config in pyproject.toml (kown -> known): sort torch (#127126) 2024-05-27 14:49:57 +00:00
nested
operator_benchmark Add 1 test case for Convtranspose1D in op microbenchmark (#127216) 2024-06-12 05:33:54 +00:00
overrides_benchmark [5/N][Easy] fix typo for usort config in pyproject.toml (kown -> known): sort torch (#127126) 2024-05-27 14:49:57 +00:00
profiler_benchmark [5/N][Easy] fix typo for usort config in pyproject.toml (kown -> known): sort torch (#127126) 2024-05-27 14:49:57 +00:00
record_function_benchmark [Caffe2]Remove Caffe2 scripts and benchmarks (#126747) 2024-06-05 23:46:31 +00:00
serialization [5/N][Easy] fix typo for usort config in pyproject.toml (kown -> known): sort torch (#127126) 2024-05-27 14:49:57 +00:00
sparse [5/N][Easy] fix typo for usort config in pyproject.toml (kown -> known): sort torch (#127126) 2024-05-27 14:49:57 +00:00
static_runtime [Static Runtime] Fix & run gen_static_runtime_ops (#128299) 2024-06-11 16:27:39 +00:00
tensorexpr [5/N][Easy] fix typo for usort config in pyproject.toml (kown -> known): sort torch (#127126) 2024-05-27 14:49:57 +00:00
transformer Improved flexattention bwd perf + added configurations for benchmarks (#129013) 2024-06-21 15:58:53 +00:00
compare-fastrnn-results.py
compare.sh
README.md
upload_scribe.py

PyTorch Benchmarks

This folder contains scripts that produce reproducible timings of various PyTorch features.

It also provides mechanisms to compare PyTorch with other frameworks.

Setup environment

Make sure you're on a machine with CUDA, torchvision, and pytorch installed. Install in the following order:

# Install torchvision. It comes with the pytorch stable release binary
conda install pytorch torchvision -c pytorch

# Install the latest pytorch master from source.
# It should supersede the installation from the release binary.
cd $PYTORCH_HOME
python setup.py build develop

# Check the pytorch installation version
python -c "import torch; print(torch.__version__)"

Benchmark List

Please refer to each subfolder to discover each benchmark suite. Links are provided where descriptions exist: