pytorch

mirror of https://github.com/saymrwulf/pytorch.git synced 2026-05-14 20:57:59 +00:00

History

Sherlock Huang cf2de4e230 Introduce aoti_call_delegate HOP (#145630 ) Summary: Previously, aoti compile node is represented as a kernel-less custom op in the exported program. The node was not eager runnable, which is a common practice for numerical validation during lowering. I introduce a new HOP to address this. The schema is following ``` aoti_call_delegate(lower_moduel: AOTInductorEPModule, original_gm: fx.GraphModule, weights: List[Tensor], inputs: List[Tensor]) ``` There are a few problems exposed by HOP - AOTI expects a FX graph with weights as getattr nodes, aka stateful graph. HOP expect graph_module arguments to be stateless. Export serializer also expect a stateless graph. Currently, to make AOTI happy, I am making `original_gm` stateful, and bypassing the serialization for `original_gm`. - As a result, the HOP is not re-traceable, as functionalization on stateful graph module argument will fail. Test Plan: buck2 test 'fbcode//mode/opt' fbcode//deeplearning/aot_inductor/cpu/test:cpu_lowering_utils_test Reviewed By: zhxchen17 Differential Revision: D68359391 Pull Request resolved: https://github.com/pytorch/pytorch/pull/145630 Approved by: https://github.com/zou3519		2025-01-31 04:57:36 +00:00
..
__init__.py	Introduce aoti_call_delegate HOP (#145630 )	2025-01-31 04:57:36 +00:00
aoti_call_delegate.py	Introduce aoti_call_delegate HOP (#145630 )	2025-01-31 04:57:36 +00:00
associative_scan.py	Require that all HOPs be imported at `import torch` time (#145939 )	2025-01-29 22:27:52 +00:00
auto_functionalize.py	PEP585 update - torch/_higher_order_ops torch/_subclasses torch/backends torch/compiler torch/cuda torch/masked torch/mtia torch/nested (#145202 )	2025-01-20 22:37:26 +00:00
cond.py	[cond] remove warning for unsupported tuple returns (#145766 )	2025-01-28 03:13:36 +00:00
effects.py	PEP585 update - torch/_higher_order_ops torch/_subclasses torch/backends torch/compiler torch/cuda torch/masked torch/mtia torch/nested (#145202 )	2025-01-20 22:37:26 +00:00
executorch_call_delegate.py
flex_attention.py	[hop][be] add utils for more comprehensive input alias and mutation (#145298 )	2025-01-23 18:12:28 +00:00
foreach_map.py	PEP585 update - torch/_higher_order_ops torch/_subclasses torch/backends torch/compiler torch/cuda torch/masked torch/mtia torch/nested (#145202 )	2025-01-20 22:37:26 +00:00
hints_wrap.py	[hop][be] add utils for more comprehensive input alias and mutation (#145298 )	2025-01-23 18:12:28 +00:00
invoke_subgraph.py	PEP585 update - torch/_higher_order_ops torch/_subclasses torch/backends torch/compiler torch/cuda torch/masked torch/mtia torch/nested (#145202 )	2025-01-20 22:37:26 +00:00
map.py
out_dtype.py	[BE] typing for decorators - library (#138969 )	2025-01-15 17:08:55 +00:00
prim_hop_base.py	[BE] typing for decorators - library (#138969 )	2025-01-15 17:08:55 +00:00
run_const_graph.py	[export] Unify single and multiple return for hops (#143227 )	2025-01-13 03:31:14 +00:00
scan.py	[scan] scan dim handling in user-facing scan() (#145179 )	2025-01-30 21:09:07 +00:00
strict_mode.py
torchbind.py	Remove unused Python variables in torch/[_-a]* (#133492 )	2024-12-12 17:39:14 +00:00
triton_kernel_wrap.py	[inductor] Make triton kernel autotune config defaults backward-compatible (#145494 )	2025-01-29 00:31:39 +00:00
utils.py	[while_loop] specialize when cond_fn return constants (#144515 )	2025-01-30 19:02:34 +00:00
while_loop.py	[hop] fix unbacked_bindings meta for while_loop (#143559 )	2025-01-30 21:33:09 +00:00
wrap.py	Require that all HOPs be imported at `import torch` time (#145939 )	2025-01-29 22:27:52 +00:00