pytorch/benchmarks
2024-08-21 23:10:12 +00:00
..
distributed
dynamo Increase max total number of dynamo partitions to 15 (#134153) 2024-08-21 23:10:12 +00:00
fastrnns
framework_overhead_benchmark
functional_autograd_benchmark [BE][Easy] enable ruff rule PIE790: unnecessary pass statement (#133200) 2024-08-15 15:50:19 +00:00
fuser
gpt_fast [BC breaking] move benchmarking + prefer inductor path (#132827) 2024-08-08 00:47:45 +00:00
inference
instruction_counts Add instruction count benchmark to run on pull requests (#131475) 2024-08-12 05:20:26 +00:00
nested
operator_benchmark Fix out_tensor device in diag_test.py (#134020) 2024-08-21 20:43:39 +00:00
overrides_benchmark
profiler_benchmark
record_function_benchmark
serialization
sparse [BC breaking] move benchmarking + prefer inductor path (#132827) 2024-08-08 00:47:45 +00:00
static_runtime
tensorexpr
transformer Add explicit GQA support. (#131559) 2024-08-09 21:25:35 +00:00
compare-fastrnn-results.py
compare.sh
README.md
upload_scribe.py

PyTorch Benchmarks

This folder contains scripts that produce reproducible timings of various PyTorch features.

It also provides mechanisms to compare PyTorch with other frameworks.

Setup environment

Make sure you're on a machine with CUDA, torchvision, and pytorch installed. Install in the following order:

# Install torchvision. It comes with the pytorch stable release binary
conda install pytorch torchvision -c pytorch

# Install the latest pytorch master from source.
# It should supersede the installation from the release binary.
cd $PYTORCH_HOME
python setup.py build develop

# Check the pytorch installation version
python -c "import torch; print(torch.__version__)"

Benchmark List

Please refer to each subfolder to discover each benchmark suite. Links are provided where descriptions exist: