mirror of
https://github.com/saymrwulf/pytorch.git
synced 2026-05-14 20:57:59 +00:00
Adds a `invoke_quant` higher order operator as proposed [here](https://docs.google.com/document/d/1s2PfJlq6Q1F8l11CkTIC69BW1rEnGEgs6YmBC7hu8rA/edit?tab=t.0).
The primary motivations are
- Unifying scattered reasoning for quant operators throughout the code base
- Easy of pattern matching - see this very large pattern match expression [here](
|
||
|---|---|---|
| .. | ||
| autoheuristic | ||
| codegen | ||
| compile_worker | ||
| fx_passes | ||
| kernel | ||
| package | ||
| runtime | ||
| __init__.py | ||
| analyze_preserves_zero_mask.py | ||
| aoti_eager.py | ||
| async_compile.py | ||
| autotune_process.py | ||
| bounds.py | ||
| choices.py | ||
| codecache.py | ||
| comm_analysis.py | ||
| comm_lowering.py | ||
| comms.py | ||
| compile_fx.py | ||
| compiler_bisector.py | ||
| config.py | ||
| constant_folding.py | ||
| cpp_builder.py | ||
| cpu_vec_isa.py | ||
| cudagraph_trees.py | ||
| cudagraph_utils.py | ||
| custom_graph_pass.py | ||
| debug.py | ||
| decomposition.py | ||
| dependencies.py | ||
| dtype_propagation.py | ||
| exc.py | ||
| extern_node_serializer.py | ||
| freezing.py | ||
| freezing_utils.py | ||
| fuzzer.py | ||
| fx_utils.py | ||
| graph.py | ||
| hooks.py | ||
| index_propagation.py | ||
| inductor_prims.py | ||
| ir.py | ||
| jagged_lowerings.py | ||
| loop_body.py | ||
| lowering.py | ||
| memory.py | ||
| metrics.py | ||
| mkldnn_ir.py | ||
| mkldnn_lowerings.py | ||
| mock_cache.py | ||
| ops_handler.py | ||
| optimize_indexing.py | ||
| output_code.py | ||
| pattern_matcher.py | ||
| quantized_lowerings.py | ||
| remote_cache.py | ||
| scheduler.py | ||
| script.ld | ||
| select_algorithm.py | ||
| sizevars.py | ||
| subgraph_lowering.py | ||
| test_case.py | ||
| test_operators.py | ||
| triton_bundler.py | ||
| utils.py | ||
| virtualized.py | ||
| wrapper_benchmark.py | ||