mirror of
https://github.com/saymrwulf/onnxruntime.git
synced 2026-07-01 03:45:06 +00:00
Quantize LSTM: 1. dynamically quantizes MatMul inside the LSTM. It doesn't quantize activation function. 2. support per-channel on the input weight and recurrent weight. |
||
|---|---|---|
| .. | ||
| featurizer_ops | ||
| quantization | ||
| tensorrt/perf | ||
| transformers | ||
| __init__.py | ||
| onnxruntime_test.py | ||
| symbolic_shape_infer.py | ||