mirror of
https://github.com/saymrwulf/onnxruntime.git
synced 2026-05-29 23:06:41 +00:00
### Description Implement softcap for gqa. ### Motivation and Context Fixes certain models like Gemma-2 which need softcap to work so they don't output nan's. |
||
|---|---|---|
| .. | ||
| abseil | ||
| composable_kernel | ||
| coremltools | ||
| cpuinfo | ||
| cutlass | ||
| eigen | ||
| flatbuffers | ||
| gsl | ||
| neural_speed | ||
| onnx | ||
| protobuf | ||
| xnnpack | ||
| .gitattributes | ||