pytorch/caffe2/core
Scott Wolchok c47464ed95 [PyTorch] Further reduce cost of TypeMeta::_typeMetaData (by 10x!) (#98105)
Currently we should be paying a small cost for the
thread-safe initialization of `index`. Now we should eliminate that
cost. (10x figure in the title comes from internal benchmark that just
calls `TypeMeta::Match<caffe2::Tensor>()` in a loop).

Differential Revision: [D44597852](https://our.internmc.facebook.com/intern/diff/D44597852/)
Pull Request resolved: https://github.com/pytorch/pytorch/pull/98105
Approved by: https://github.com/ezyang
2023-04-12 17:44:48 +00:00
..
hip Replace all CHECK_ and DCHECK_ with TORCH_* macros (#82032) 2022-07-26 01:20:44 +00:00
nomnigraph Update CMakeLists.txt (#87030) 2022-10-28 04:56:40 +00:00
__init__.py
allocator.cc
allocator.h
blob.h [caffe2] Micro-optimizations in BlobGetMutableTensor (#98103) 2023-04-10 19:43:02 +00:00
blob_gpu_test.cc
blob_serialization.cc [caffe2] Don't copy Tensor dims during deserialization (#79471) 2022-07-12 21:36:26 +00:00
blob_serialization.h [caffe2] Don't copy Tensor dims during deserialization (#79471) 2022-07-12 21:36:26 +00:00
blob_serialization_gpu.cc
blob_serializer_base.h
blob_stats.cc
blob_stats.h
blob_test.cc Fix sign-compare in caffe2 cpp tests 2022-04-05 00:08:05 +00:00
CMakeLists.txt Remove caffe2 mobile (#84338) 2022-09-08 01:49:55 +00:00
common.cc
common.h
common_cudnn.cc Replace all CHECK_ and DCHECK_ with TORCH_* macros (#82032) 2022-07-26 01:20:44 +00:00
common_cudnn.h
common_gpu.cc
common_gpu.h [CUDA] Drop CUDA 10 support (#89582) 2023-01-05 05:11:53 +00:00
common_omp.h
common_test.cc
context.cc
context.h
context_base.cc
context_base.h
context_gpu.cu Fix C2 Ambiguous namespace (#89534) 2022-12-07 23:36:41 +00:00
context_gpu.h [caffe2] dont call cudnnDestroy on thread exit (crashes on windows with cuda 11/12) (#95382) 2023-03-10 06:42:51 +00:00
context_gpu_test.cc Replace all CHECK_ and DCHECK_ with TORCH_* macros (#82032) 2022-07-26 01:20:44 +00:00
context_test.cc cleanup unused include (#93359) 2023-02-04 02:15:50 +00:00
cudnn_wrappers.h Replace all CHECK_ and DCHECK_ with TORCH_* macros (#82032) 2022-07-26 01:20:44 +00:00
db.cc
db.h
distributions_stubs.h
event.cc
event.h
event_cpu.h
event_gpu.cc
event_gpu_test.cc
event_test.cc
export_c10_op_to_caffe2.cc
export_c10_op_to_caffe2.h
export_caffe2_op_to_c10.h [caffe2] Remove OperatorBase::newstyle_outputs_ (#67093) 2023-01-23 22:41:59 +00:00
flags.h
graph.cc
graph.h
graph_test.cc
init.cc
init.h
init_denormals.cc
init_intrinsics_check.cc
init_omp.cc
init_test.cc
int8_serialization.cc
logging.h
macros.h
macros.h.in remove unused CAFFE2_VERSION macros (#97337) 2023-03-24 16:02:35 +00:00
memonger.cc
memonger.h
module.cc
module.h
module_test.cc
net.cc
net.h
net_async_base.cc
net_async_base.h
net_async_scheduling.cc
net_async_scheduling.h
net_async_task.cc
net_async_task.h
net_async_task_future.cc
net_async_task_future.h
net_async_task_graph.cc
net_async_task_graph.h
net_async_tracing.cc
net_async_tracing.h
net_async_tracing_test.cc
net_dag_utils.cc
net_dag_utils.h
net_dag_utils_test.cc
net_gpu_test.cc Replace all CHECK_ and DCHECK_ with TORCH_* macros (#82032) 2022-07-26 01:20:44 +00:00
net_parallel.cc
net_parallel.h
net_simple.cc
net_simple.h
net_simple_refcount.cc
net_simple_refcount.h
net_simple_refcount_test.cc
net_test.cc Replace all CHECK_ and DCHECK_ with TORCH_* macros (#82032) 2022-07-26 01:20:44 +00:00
numa.cc
numa.h
observer.h
observer_test.cc
operator.cc [caffe2] Remove OperatorBase::newstyle_outputs_ (#67093) 2023-01-23 22:41:59 +00:00
operator.h [caffe2] Remove OperatorBase::newstyle_outputs_ (#67093) 2023-01-23 22:41:59 +00:00
operator_gpu_test.cc
operator_gradient.h
operator_schema.cc
operator_schema.h Revert "Fix crash on unload torch cpu dll (#67632)" 2022-08-02 00:56:18 +00:00
operator_schema_test.cc
operator_test.cc
parallel_net_test.cc Replace all CHECK_ and DCHECK_ with TORCH_* macros (#82032) 2022-07-26 01:20:44 +00:00
plan_executor.cc some reference and move fixes (#95942) 2023-03-10 03:44:09 +00:00
plan_executor.h
plan_executor_test.cc
prof_dag_counters.cc turn on -Werror=type-limits in our Bazel CPU build 2022-06-10 10:04:08 +00:00
prof_dag_counters.h
qtensor.cc
qtensor.h Replace all CHECK_ and DCHECK_ with TORCH_* macros (#82032) 2022-07-26 01:20:44 +00:00
qtensor_serialization.cc
qtensor_serialization.h
scope_guard.h
serialization_test.cc Fix sign-compare in caffe2 cpp tests 2022-04-05 00:08:05 +00:00
static_tracepoint.h
static_tracepoint_elfx86.h
stats.cc
stats.h Remove -Wno-unused-variable from utils.cmake (take 2) (#75538) 2022-04-20 17:41:59 +00:00
stats_test.cc
storage.h
tensor.cc [PyTorch] Further reduce cost of TypeMeta::_typeMetaData (by 10x!) (#98105) 2023-04-12 17:44:48 +00:00
tensor.h [PyTorch] Further reduce cost of TypeMeta::_typeMetaData (by 10x!) (#98105) 2023-04-12 17:44:48 +00:00
tensor_impl.h
tensor_int8.cc
tensor_int8.h
test_utils.cc
test_utils.h
timer.h
timer_test.cc
transform.cc
transform.h
transform_test.cc Fix sign-compare in caffe2 cpp tests 2022-04-05 00:08:05 +00:00
types.cc
types.h
workspace.cc
workspace.h
workspace_test.cc