pytorch/benchmarks
Taylor Robie 87f7fdfd5c Allow instruction counting to use shared memory as a staging ground. (And a couple other tweaks.) (#56711)
Summary:
This is actually something I discovered a while ago with the wall of serotonin. It was really easy for large scale runs to get bottlenecked on disk access. I have a hack in the working files of that machine to use `/dev/shm`, but I figured I should formalize and actually make a respectable utility.

I also added a param to tweak the run cadence and print when a CorePool is created; these are just to make the CI logs a bit nicer. (A printout each second on a 40 minute CI job is a bit much...)

Pull Request resolved: https://github.com/pytorch/pytorch/pull/56711

Reviewed By: agolynski

Differential Revision: D28392248

Pulled By: robieta

fbshipit-source-id: b6aa7445c488d8e4ab9d4b31ab18df4e12783d8f
2021-05-12 20:37:41 -07:00
..
cpp [clang-tidy] Exclude cppcoreguidelines-avoid-magic-numbers (#57841) 2021-05-07 20:02:33 -07:00
distributed test experiment script (#57925) 2021-05-12 10:22:47 -07:00
fastrnns Add lint for unqualified noqa (#56272) 2021-04-19 13:16:18 -07:00
framework_overhead_benchmark
functional_autograd_benchmark Add lint for unqualified type: ignore (#56290) 2021-04-21 08:07:23 -07:00
instruction_counts Allow instruction counting to use shared memory as a staging ground. (And a couple other tweaks.) (#56711) 2021-05-12 20:37:41 -07:00
operator_benchmark Improve BatchNorm1d performance (CUDA) (#57786) 2021-05-08 19:09:29 -07:00
overrides_benchmark Remove legacy constructor calls from pytorch codebase. (#54142) 2021-04-11 15:45:17 -07:00
profiler_benchmark
record_function_benchmark
serialization
sparse Add CSR (compressed sparse row) layout for sparse tensors (#50937) 2021-04-12 10:09:12 -07:00
static_runtime [Static Runtime] Fix bug in aten::clone (#58100) 2021-05-11 22:47:25 -07:00
tensorexpr [NNC] Implementation for aten::cat without conditionals. (#53128) 2021-03-07 22:57:02 -08:00
compare-fastrnn-results.py
compare.sh
README.md Add CSR (compressed sparse row) layout for sparse tensors (#50937) 2021-04-12 10:09:12 -07:00
upload_scribe.py

PyTorch Benchmarks

This folder contains scripts that produce reproducible timings of various PyTorch features.

It also provides mechanisms to compare PyTorch with other frameworks.

Setup environment

Make sure you're on a machine with CUDA, torchvision, and pytorch installed. Install in the following order:

# Install torchvision. It comes with the pytorch stable release binary
conda install pytorch torchvision -c pytorch

# Install the latest pytorch master from source.
# It should supersede the installation from the release binary.
cd $PYTORCH_HOME
python setup.py build develop

# Check the pytorch installation version
python -c "import torch; print(torch.__version__)"

Benchmark List

Please refer to each subfolder to discover each benchmark suite