pytorch/torch/_inductor
2024-10-16 13:54:21 +00:00
..
autoheuristic
codegen Remove an unused variable in _inductor/codegen/simd.py (#138000) 2024-10-16 13:54:21 +00:00
compile_worker
fx_passes Port Inductor dataclasses to be kw_only (#137768) 2024-10-14 10:33:43 +00:00
kernel Dont decompose aten.baddmm in inductor (#137904) 2024-10-15 14:54:56 +00:00
package [aoti] Accept constant inputs (#137197) 2024-10-03 20:59:33 +00:00
runtime [Inductor]: Support updated Triton AttrsDescriptor (#137757) 2024-10-15 19:34:59 +00:00
__init__.py [inductor] Enable coordinate descent tuning with max-autotune (#136867) 2024-10-10 07:29:52 +00:00
aoti_eager.py
async_compile.py [ROCm][AOTI] add CK backend (#135641) 2024-10-07 23:53:58 +00:00
autotune_process.py Port Inductor dataclasses to be kw_only (#137768) 2024-10-14 10:33:43 +00:00
bisect_helper.py Search through config changes in compiler bisector (#137346) 2024-10-11 20:24:54 +00:00
bounds.py
codecache.py BundledAutotuneCache (take 2) (#137902) 2024-10-15 18:39:47 +00:00
comm_analysis.py
comms.py
compile_fx.py Set Cuda context in inductor and dont initialize wrong cuda device in fake_tensor (#137603) 2024-10-10 18:25:22 +00:00
config.py enable auto functionalize v2 by default (#136685) 2024-10-15 19:04:42 +00:00
constant_folding.py
cpp_builder.py [AOTI] Add standalone version of TORCH_CHECK (#136873) 2024-10-08 15:30:01 +00:00
cpu_vec_isa.py Extend vectorization with SVE(ARM) with Torch Compile (Inductor) (#134672) 2024-10-10 13:20:40 +00:00
cudagraph_trees.py Invalidate StorageImpl instances when tensor is overwritten with cudagraphs (#125264) 2024-10-09 00:05:52 +00:00
cudagraph_utils.py
custom_graph_pass.py [inductor] Add API to make post_grad_custom passes cache-able (#137298) 2024-10-07 21:11:54 +00:00
debug.py
decomposition.py Dont decompose aten.baddmm in inductor (#137904) 2024-10-15 14:54:56 +00:00
dependencies.py [inductor] Fix reduction_hint sum to single element (#137754) 2024-10-13 01:08:23 +00:00
exc.py Type _inductor/exc.py (#137595) 2024-10-09 23:15:06 +00:00
extern_node_serializer.py
freezing.py
fx_utils.py
graph.py BundledAutotuneCache (take 2) (#137902) 2024-10-15 18:39:47 +00:00
hooks.py
index_propagation.py
inductor_prims.py
ir.py Refactored debug_extra to be on ChoiceCaller (and called description) (#137857) 2024-10-15 00:48:14 +00:00
jagged_lowerings.py Add lowering for aten.searchsorted (#135701) 2024-10-04 19:26:05 +00:00
loop_body.py [inductor] Add LoopBody.op_counts (#137945) 2024-10-16 06:35:10 +00:00
lowering.py Add host-side Triton TMA support to Dynamo (#137677) 2024-10-16 02:18:48 +00:00
memory.py
metrics.py
mkldnn_ir.py Port Inductor dataclasses to be kw_only (#137768) 2024-10-14 10:33:43 +00:00
mkldnn_lowerings.py
ops_handler.py Add lowering for aten.searchsorted (#135701) 2024-10-04 19:26:05 +00:00
optimize_indexing.py type _inductor/optimize_indexing.py (#137599) 2024-10-09 23:29:47 +00:00
pattern_matcher.py Update mutation checking in pattern matcher (#137448) 2024-10-08 16:56:40 +00:00
quantized_lowerings.py type _inductor/quantized_lowerings.py (#137598) 2024-10-09 23:29:26 +00:00
remote_cache.py BundledAutotuneCache (take 2) (#137902) 2024-10-15 18:39:47 +00:00
scheduler.py Add hooks to Scheduler nodes for generating device-specific debug strings (#135015) 2024-10-11 20:30:49 +00:00
script.ld
select_algorithm.py Refactored debug_extra to be on ChoiceCaller (and called description) (#137857) 2024-10-15 00:48:14 +00:00
sizevars.py [aotinductor] Add helper fn to atomically apply size_hint to an expr w/ unbacked symints (#137537) 2024-10-10 17:11:24 +00:00
subgraph_lowering.py
test_case.py
test_operators.py
utils.py [Inductor UT] Generalize newly introduced inductor UTs for intel GPU (Part 3) (#136947) 2024-10-12 13:21:20 +00:00
virtualized.py
wrapper_benchmark.py