onnxruntime/onnxruntime
Yufeng Li d6280e26bd
check rotary_embedding with seq length (#20547)
### Description
<!-- Describe your changes. -->
with past/present shared same buffer, the present seq length is
different with total sequence length. The size of cos/sin cache should
be checked with sequence length.


### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->
2024-05-03 09:43:53 -07:00
..
contrib_ops check rotary_embedding with seq length (#20547) 2024-05-03 09:43:53 -07:00
core [QNN EP] Conv ConvTranspose 3D support (#20507) 2024-05-03 08:55:31 -07:00
python [TensorRT EP] support TensorRT 10-GA (#20506) 2024-05-01 11:10:53 -07:00
test [QNN EP] Conv ConvTranspose 3D support (#20507) 2024-05-03 08:55:31 -07:00
tool/etw
wasm [js/web] rewrite backend resolve to allow multiple EPs (#19735) 2024-03-15 11:47:45 -07:00
__init__.py Bump up version in main from 1.18.0 to 1.19.0 (#20489) 2024-04-29 20:21:41 -07:00
ReformatSource.ps1
ReformatSourcePython.bat
VSCodeCoverage.runsettings