pytorch

mirror of https://github.com/saymrwulf/pytorch.git synced 2026-05-15 21:00:47 +00:00

History

Edward Z. Yang a6630bcf87 Profile guided optimization for automatic_dynamic (#139001 ) Previously: https://github.com/pytorch/pytorch/pull/138052 but the implementation is done from scratch, so I open a new PR. This implements the ability to save and load profiles of automatic dynamic decisions, so on subsequent runs we can directly make something automatically dynamic. Unlike the previous implementation, this cache is never enabled by default; instead, you have to specify a "job id" that says it's OK to share results. We will be able to automatically populate this id for internal MAST jobs but for generic OSS users you will have to explicitly opt into it. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Differential Revision: [D65065497](https://our.internmc.facebook.com/intern/diff/D65065497) Pull Request resolved: https://github.com/pytorch/pytorch/pull/139001 Approved by: https://github.com/oulgen		2024-11-01 21:43:25 +00:00
..
_static	Add a temporary Survey about the search (#139096 )	2024-10-28 23:43:25 +00:00
_templates	Add a temporary Survey about the search (#139096 )	2024-10-28 23:43:25 +00:00
community	Update maintainers for inductor and x86 CPU (#136839 )	2024-10-11 07:24:07 +00:00
elastic
notes	Add utility to get all unsafe globals in checkpoint (no pickletools dependency) (#139221 )	2024-11-01 19:31:39 +00:00
rpc	[Doc] fix some typos (found by codespell and typos) (#132544 )	2024-08-05 17:21:56 +00:00
scripts	[Doc] fix some typos (found by codespell and typos) (#132544 )	2024-08-05 17:21:56 +00:00
accelerator.rst	Introduce a device-agnostic runtime API design (#132204 )	2024-10-27 10:37:09 +00:00
amp.rst	Update document for autocast on CPU (#135299 )	2024-09-13 09:11:47 +00:00
autograd.rst
backends.rst	Clarify opt-einsum usage, fix #127109 (#137596 )	2024-10-09 20:31:24 +00:00
benchmark_utils.rst
bottleneck.rst
checkpoint.rst
complex_numbers.rst
cond.rst	[Doc] fix some typos (found by codespell and typos) (#132544 )	2024-08-05 17:21:56 +00:00
conf.py	Update copyrights to 2024 (#138638 )	2024-10-22 21:00:58 +00:00
config_mod.rst
cpp_extension.rst
cpp_index.rst
cpu.rst
cuda._sanitizer.rst
cuda.rst	raw_alloc ignores PYTORCH_NO_CUDA_MEMORY_CACHING (#131114 )	2024-10-04 15:36:29 +00:00
cuda.tunable.rst	[ROCm] Tunableop record untuned (#128813 )	2024-10-09 21:59:03 +00:00
cuda_environment_variables.rst
cudnn_persistent_rnn.rst
cudnn_rnn_determinism.rst
data.rst
ddp_comm_hooks.rst
debugging_environment_variables.rst
deploy.rst
deterministic.rst
distributed.algorithms.join.rst
distributed.checkpoint.rst	[Doc] fix some typos (found by codespell and typos) (#132544 )	2024-08-05 17:21:56 +00:00
distributed.elastic.rst
distributed.optim.rst
distributed.pipelining.rst	[Pipelining] Refactor Interleaved1F1B and ZeroBubble (#137783 )	2024-10-16 03:05:14 +00:00
distributed.rst	[reland][dtensor] move DTensor to public namespace (#134203 )	2024-09-08 17:08:40 +00:00
distributed.tensor.parallel.rst	Update link in distributed.tensor.parallel.rst (#136103 )	2024-09-15 19:36:29 +00:00
distributed.tensor.rst	[dtensor][experimental] expose DTensor Context Parallel API (#137038 )	2024-10-02 18:00:23 +00:00
distributions.rst
dlpack.rst
docutils.conf
export.ir_spec.rst
export.rst	Replace torch.export default decomp table to be lazily populated (#137650 )	2024-10-18 19:28:52 +00:00
fft.rst
fsdp.rst
func.api.rst
func.batch_norm.rst
func.migrating.rst
func.rst
func.ux_limitations.rst
func.whirlwind_tour.rst
future_mod.rst
futures.rst
fx.experimental.rst	Remove parallel_and and parallel_or (#138135 )	2024-10-23 00:22:22 +00:00
fx.rst	Consolidate SymDispatchMode into ProxyTensorMode (#132674 )	2024-08-08 12:02:54 +00:00
hub.rst
index.rst	Introduce a device-agnostic runtime API design (#132204 )	2024-10-27 10:37:09 +00:00
jit.rst
jit_builtin_functions.rst
jit_language_reference.rst	[Doc] fix some typos (found by codespell and typos) (#132544 )	2024-08-05 17:21:56 +00:00
jit_language_reference_v2.rst	[Doc] fix some typos (found by codespell and typos) (#132544 )	2024-08-05 17:21:56 +00:00
jit_python_reference.rst
jit_unsupported.rst
jit_utils.rst
library.rst	Link directly to new Custom Ops Landing Page (#137933 )	2024-10-15 21:18:21 +00:00
linalg.rst
logging.rst
masked.rst	Add MaskedTensor passthrough: unfold, F.Unfold, F.Fold, stack (#125262 )	2024-09-06 19:06:23 +00:00
math-quantizer-equation.png
meta.rst
miscellaneous_environment_variables.rst	Add environment variable to force no weights_only load (#138225 )	2024-10-21 23:26:15 +00:00
mobile_optimizer.rst	Add ExecuTorch warning to mobile_optimizer (#134697 )	2024-09-04 17:47:14 +00:00
model_zoo.rst
module_tracker.rst
monitor.rst
mps.rst
mps_environment_variables.rst
mtia.rst	[MTIA] Support torch.cuda.get_device_capability equivalent API on MTIA (#135889 )	2024-09-17 17:42:56 +00:00
multiprocessing.rst
name_inference.rst
named_tensor.rst
nested.rst
nn.attention.bias.rst
nn.attention.experimental.rst	[Flex Attention] Paged Attention (#137164 )	2024-10-29 17:05:22 +00:00
nn.attention.flex_attention.rst	FlexAttention support for NJT (#136792 )	2024-10-28 20:01:27 +00:00
nn.attention.rst	[Flex Attention] Paged Attention (#137164 )	2024-10-29 17:05:22 +00:00
nn.functional.rst
nn.init.rst
nn.rst	Make adding Buffers more like adding Parameters (#125971 )	2024-07-31 10:32:40 +00:00
onnx.rst	[ONNX] Improves documentation of ONNX exporter (#135372 )	2024-09-09 15:09:01 +00:00
onnx_dynamo.rst	[ONNX] Improves documentation of ONNX exporter (#135372 )	2024-09-09 15:09:01 +00:00
onnx_dynamo_onnxruntime_backend.rst
onnx_torchscript.rst	[ONNX] Remove deprecated export_to_pretty_string (#137790 )	2024-10-21 18:17:48 +00:00
onnx_torchscript_supported_aten_ops.rst
optim.rst	Ensure SWA boundary conditions w.r.t. definition (#133773 )	2024-10-31 18:24:08 +00:00
package.rst
profiler.rst
quantization-accuracy-debugging.rst
quantization-backend-configuration.rst
quantization-support.rst	Update pt2e numeric debugger to use node.meta["custom"] field (#134040 )	2024-08-27 19:51:03 +00:00
quantization.rst
random.rst
rpc.rst
signal.rst
size.rst
sparse.rst	SparseCsrCUDA: cuDSS backend for linalg.solve (#129856 )	2024-08-22 07:57:30 +00:00
special.rst
storage.rst
tensor_attributes.rst	Refine the logic of device construction when only device index is given (#129119 )	2024-07-15 14:34:29 +00:00
tensor_view.rst
tensorboard.rst
tensors.rst
testing.rst
threading_environment_variables.rst
torch.ao.ns._numeric_suite.rst
torch.ao.ns._numeric_suite_fx.rst
torch.compiler.config.rst	Profile guided optimization for automatic_dynamic (#139001 )	2024-11-01 21:43:25 +00:00
torch.compiler.rst	Profile guided optimization for automatic_dynamic (#139001 )	2024-11-01 21:43:25 +00:00
torch.compiler_aot_inductor.rst
torch.compiler_api.rst	[dynamo] add torch.compiler.set_stance (#137504 )	2024-10-16 16:18:25 +00:00
torch.compiler_best_practices_for_backends.rst
torch.compiler_cudagraph_trees.rst
torch.compiler_custom_backends.rst
torch.compiler_dynamic_shapes.rst
torch.compiler_dynamo_deepdive.rst
torch.compiler_dynamo_overview.rst
torch.compiler_fake_tensor.rst	[BE] Reroute all uses of proxy_tensor.maybe_disable_fake_tensor_mode to fake_tensor.unset_fake_temporarily (#132770 )	2024-08-08 23:07:23 +00:00
torch.compiler_faq.rst	[dynamo] Retire CompileProfiler (#135133 )	2024-09-05 01:08:40 +00:00
torch.compiler_fine_grain_apis.rst	[Doc] fix some typos (found by codespell and typos) (#132544 )	2024-08-05 17:21:56 +00:00
torch.compiler_get_started.rst	[Inductor] Update AttrsDescriptor instantiation for Triton changes (#137458 )	2024-10-14 20:20:29 +00:00
torch.compiler_inductor_profiling.rst
torch.compiler_ir.rst
torch.compiler_nn_module.rst
torch.compiler_performance_dashboard.rst
torch.compiler_profiling_torch_compile.rst	[EZ] Fix spelling typo (#136157 )	2024-09-16 19:30:30 +00:00
torch.compiler_transformations.rst
torch.compiler_troubleshooting.rst	Add link to torch.compile the missing manual in troubleshooting (#137301 )	2024-10-04 18:19:30 +00:00
torch.overrides.rst
torch.rst	Add docs page for `torch.inf` and `torch.nan` (#138430 )	2024-10-31 05:46:46 +00:00
torch_cuda_memory.rst
torch_environment_variables.rst
torch_nccl_environment_variables.rst	[c10d][doc] Add docs for ENV variables TORCH_NCCL_ASYNC_ERROR_HANDLING TORCH_NCCL_TRACE_CPP_STACK and TORCH_NCCL_COORD_CHECK_MILSEC (#132920 )	2024-08-09 21:08:20 +00:00
type_info.rst
utils.rst
xpu.rst	Add torch.xpu.get_arch_list and torch.xpu.get_gencode_flags for XPU (#137773 )	2024-10-18 02:28:08 +00:00