pytorch/caffe2
Summer Deng c10c73f047 Int8 FC performance debugging (#17700)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/17700

Add performance debugging utilities in DNNLOWP FC operator and the python script

Reviewed By: amylittleyang

Differential Revision: D14321299

fbshipit-source-id: 50dbd7b352a1da5d2ecb659d8003e71e70750063
2019-03-08 19:03:54 -08:00
..
contrib Replace tensor.type().scalarType() calls with tensor.scalar_type() 2019-03-08 14:08:18 -08:00
core Open registration for c10 thread pool (#17788) 2019-03-08 15:38:41 -08:00
cuda_rtc Change ConvPoolOp<Context>::SetOutputSize to ConvPoolOp<Context>::GetOutputSize (#17764) 2019-03-07 18:38:53 -08:00
db
distributed
experiments
ideep Support failback for more operators in ideep (#17747) 2019-03-08 05:48:22 -08:00
image Open registration for c10 thread pool (#17788) 2019-03-08 15:38:41 -08:00
mobile Change ConvPoolOp<Context>::SetOutputSize to ConvPoolOp<Context>::GetOutputSize (#17764) 2019-03-07 18:38:53 -08:00
mpi
observers Remove GPU dependency from ProfileObserver (#17592) 2019-03-04 10:00:46 -08:00
onnx Do not rename net boundary inputs/outputs during ssaRewrite. (#17545) 2019-03-06 14:26:58 -08:00
operators Int8 FC performance debugging (#17700) 2019-03-08 19:03:54 -08:00
opt Catch exceptions in bound_shape_inference (#17775) 2019-03-08 13:18:28 -08:00
perfkernels use fp16<->fp32 intrinsic (#17496) 2019-03-07 02:23:07 -08:00
predictor add command line option to use hive filler; add README (#17619) 2019-03-01 13:56:15 -08:00
proto
python move half<->float conversions to oss operators (#17548) 2019-03-07 13:00:13 -08:00
quantization Int8 FC performance debugging (#17700) 2019-03-08 19:03:54 -08:00
queue
serialize
sgd
share Change ConvPoolOp<Context>::SetOutputSize to ConvPoolOp<Context>::GetOutputSize (#17764) 2019-03-07 18:38:53 -08:00
test
transforms
utils Optimize LayerNormOp (#17604) 2019-03-08 17:38:14 -08:00
video Open registration for c10 thread pool (#17788) 2019-03-08 15:38:41 -08:00
.clang-format
__init__.py
CMakeLists.txt Remove nomscheduler (#17693) 2019-03-06 10:48:13 -08:00
README.md
release-notes.md
requirements.txt
VERSION_NUMBER

Caffe2

Jenkins Build Status

Caffe2 is a lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind.

Questions and Feedback

Please use Github issues (https://github.com/pytorch/pytorch/issues) to ask questions, report bugs, and request new features.

Further Resources on Caffe2.ai