pytorch/caffe2
Ilia Cherniavskii f5c95d5cf1 Source code level attribution in profiler (#43898)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/43898

Adding with_source parameter to enable tracking source code
(filename and line) in profiler for eager, torchscript and autograd
modes

Test Plan:
python test/test_profiler.py
```
Name                                 Self CPU total %  Self CPU total   CPU total %      CPU total        CPU time avg     Number of Calls  Source Location
-----------------------------------  ---------------  ---------------  ---------------  ---------------  ---------------  ---------------  --------------------------------------------
ts_method_1                          10.43%           235.364us        36.46%           822.920us        822.920us        1                test/test_profiler.py(70): test_source
aten::add                            7.52%            169.833us        8.88%            200.439us        200.439us        1                test/test_profiler.py(69): test_source
aten::normal_                        6.26%            141.380us        6.26%            141.380us        141.380us        1                test/test_profiler.py(67): test_source
aten::add                            5.80%            130.830us        8.41%            189.800us        63.267us         3                test/test_profiler.py(72): test_source
aten::sum                            5.02%            113.340us        8.39%            189.475us        189.475us        1                test/test_profiler.py(64): ts_method_1
aten::add                            4.58%            103.346us        6.33%            142.847us        142.847us        1                test/test_profiler.py(62): ts_method_1
aten::mul                            4.05%            91.498us         9.62%            217.113us        217.113us        1                test/test_profiler.py(71): test_source
aten::add                            4.03%            90.880us         5.60%            126.405us        126.405us        1                test/test_profiler.py(58): ts_method_2
aten::empty                          3.49%            78.735us         3.49%            78.735us         19.684us         4                test/test_profiler.py(72): test_source
```

Reviewed By: ngimel

Differential Revision: D23432664

Pulled By: ilia-cher

fbshipit-source-id: 83ad7ebe0c2502494d3b48c4e687802db9c77615
2020-09-30 00:57:35 -07:00
..
contrib Remove __future__ imports for legacy Python2 supports (#45033) 2020-09-23 17:57:02 -07:00
core Remove hacky_wrapper from BackendSelect kernels (#44062) 2020-09-25 09:04:03 -07:00
cuda_rtc
db Formatting cmake (to lowercase without space for if/elseif/else/endif) (#35521) 2020-03-27 14:25:17 -07:00
distributed [Distributed] DeleteKey API for c10d TCP Store (#45401) 2020-09-28 15:30:39 -07:00
experiments Remove __future__ imports for legacy Python2 supports (#45033) 2020-09-23 17:57:02 -07:00
ideep Ensure IDEEP transpose operator works correctly 2020-08-11 12:58:31 -07:00
image
mobile Make msg() and msg_with_backtrace() private (#37094) 2020-05-04 11:54:34 -07:00
mpi [codemod][caffe2] Run clang-format - 5/7 2020-06-30 15:45:11 -07:00
observers [codemod][caffe2] Run clang-format - 5/7 2020-06-30 15:45:11 -07:00
onnx Add python inferface to create OfflineTensor (#42516) 2020-08-04 01:31:34 -07:00
operators Adding Type Double to Caffe2 Mean Op 2020-09-28 13:35:29 -07:00
opt Add operator to compute the equalization scale (#45096) 2020-09-24 15:19:49 -07:00
perfkernels Remove __future__ imports for legacy Python2 supports (#45033) 2020-09-23 17:57:02 -07:00
predictor [caffe2] Special handling of If/AsyncIf op in RemoveOpsByType (#42286) 2020-07-30 02:02:20 -07:00
proto Annotate torch.utils.(tensorboard/show_pickle/hypify) (#44216) 2020-09-29 18:14:26 -07:00
python [c2] Fix int types for learning rate 2020-09-26 10:59:29 -07:00
quantization Adding Histogram Binning Calibration to DSNN and Adding Type Double to Caffe2 ParallelSumOp/SumReluOp 2020-09-28 15:21:31 -07:00
queue [caffe2] adds Cancel to SafeDequeueBlobsOp and SafeEnqueueBlobsOp (#45177) 2020-09-24 14:22:46 -07:00
serialize Enable TF32 support for cuDNN (#40737) 2020-09-01 15:34:24 -07:00
sgd [c2] Fix int types for learning rate 2020-09-26 10:59:29 -07:00
share Update pthreadpool to pthreadpool:029c88620802e1361ccf41d1970bd5b07fd6b7bb. (#40524) 2020-07-09 10:00:36 -07:00
test
transforms Replaced whitelisted with allowed (#41867) 2020-07-23 16:53:51 -07:00
utils CUDA RTX30 series support (#45489) 2020-09-29 18:19:23 -07:00
video [ROCm][Caffe2] Enable MIOpen 3D Pooling (#38260) 2020-07-08 17:42:55 -07:00
.clang-format
__init__.py remediation of S205607 2020-07-17 17:19:47 -07:00
c2_aten_srcs.bzl
CMakeLists.txt Source code level attribution in profiler (#43898) 2020-09-30 00:57:35 -07:00
README.md Grammar patch 1 (.md) (#41599) 2020-07-20 10:19:08 -07:00
release-notes.md
requirements.txt
VERSION_NUMBER

Caffe2

Jenkins Build Status

Caffe2 is a lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind.

Questions and Feedback

Please use GitHub issues (https://github.com/pytorch/pytorch/issues) to ask questions, report bugs, and request new features.

Further Resources on Caffe2.ai