onnxruntime/js/web/lib/wasm/jsep/webgpu/ops
Satya Kumar Jandhyala ae78cdb5d7
[JS/WebGPU] MultiheadAttention bugfix (#20447)
### Description
Fixed pastkey, key and pastvalue, value concatenation condition and
fixed index error. Added new test cases.



### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->
2024-04-24 08:43:14 -07:00
..
3rd-party [js/webgpu] perform uniform consistency check (#20019) 2024-03-26 17:14:43 -07:00
argminmax.ts [JS/Web] Added uniforms to Reduce, Resize and Split Ops. (#18727) 2023-12-12 11:12:23 -08:00
attention.ts [JS/WebGPU] MultiheadAttention bugfix (#20447) 2024-04-24 08:43:14 -07:00
batch-norm.ts [js/webgpu] Use DataType as uniform cpu type (#19281) 2024-01-30 21:05:08 -08:00
bias-add.ts
bias-split-gelu.ts [js/webgpu] support customop FastGelu (#19392) 2024-02-06 09:07:31 -08:00
binary-op.ts [js/webgpu] Refactor createTensorShapeVariables (#18883) 2024-02-01 17:59:00 -08:00
common.ts [js/webgpu] fixes for fp16 attention (#20440) 2024-04-24 08:01:28 -07:00
concat.ts [JS/WebGPU] Multihead attention improvements (#20286) 2024-04-23 12:39:49 -07:00
conv-grouped.ts [js/webgpu] perform uniform consistency check (#20019) 2024-03-26 17:14:43 -07:00
conv-transpose.ts fix ConvTranspose 1D (#20194) 2024-04-05 10:05:32 -07:00
conv.ts [js/webgpu] Enable GroupedConvVectorize path (#19791) 2024-03-12 22:25:07 -07:00
cumsum.ts fix csum and enable ut (#20355) 2024-04-17 15:01:06 -07:00
depth-to-space.ts [js/webgpu] implement DepthToSpace operator in webgpu (#19948) 2024-04-10 12:13:46 -07:00
einsum.ts [js/webgpu] Use DataType as uniform cpu type (#19281) 2024-01-30 21:05:08 -08:00
expand.ts [js/webgpu] Refactor createTensorShapeVariables (#18883) 2024-02-01 17:59:00 -08:00
fast-gelu.ts [js/webgpu] support customop FastGelu (#19392) 2024-02-06 09:07:31 -08:00
fuse-utils.ts [js/webgpu] Add LeakyRelu activation for fusedConv (#19369) 2024-02-02 09:06:38 -08:00
gather-elements.ts [js/webgpu] Refactor createTensorShapeVariables (#18883) 2024-02-01 17:59:00 -08:00
gather.ts [js/webgpu] minor fixes to make tinyllama work (#19564) 2024-02-23 15:45:30 -08:00
gemm.ts [js/webgpu] Use DataType as uniform cpu type (#19281) 2024-01-30 21:05:08 -08:00
instance-norm.ts [js/webgpu] Use global id in attention and instance-norm (#20008) 2024-04-02 01:42:39 -07:00
layer-norm.ts [js/web] support SimplifiedLayerNorm and SkipSimplifiedLayerNorm (#20277) 2024-04-11 14:08:50 -07:00
matmul.ts [js/webgpu] Add LeakyRelu activation for fusedConv (#19369) 2024-02-02 09:06:38 -08:00
matmulnbits.ts [JS/WebGPU] Improve MatMulNBits perf (#19974) 2024-04-12 11:03:05 -07:00
multihead-attentiion.ts [JS/WebGPU] Multihead attention improvements (#20286) 2024-04-23 12:39:49 -07:00
pad.ts [js/webgpu] perform uniform consistency check (#20019) 2024-03-26 17:14:43 -07:00
pool.ts [js/webgpu] fix maxpool / fp16 (#19981) 2024-03-19 16:15:49 -07:00
range.ts [js/webgpu] Use DataType as uniform cpu type (#19281) 2024-01-30 21:05:08 -08:00
reduce-shared.ts accumulate in fp32 for Reduce* (#19868) 2024-03-18 08:28:43 -07:00
reduce.ts [js/webgpu] Refactor createTensorShapeVariables (#18883) 2024-02-01 17:59:00 -08:00
resize.ts [js/webgpu] Refactor createTensorShapeVariables (#18883) 2024-02-01 17:59:00 -08:00
rotary-embedding.ts [js/webgpu] Implement com.microsoft.RotaryEmbedding (#20209) 2024-04-08 09:11:26 -07:00
skip-layer-norm.ts fix fp16 for skiplayernorm (#20381) 2024-04-19 12:12:02 -07:00
slice.ts [js/webgpu] Refactor createTensorShapeVariables (#18883) 2024-02-01 17:59:00 -08:00
softmax.ts [js/webgpu] perform uniform consistency check (#20019) 2024-03-26 17:14:43 -07:00
split.ts [js/webgpu] Create Split indices helpers by rank, not by shape (#19554) 2024-02-20 09:24:34 -08:00
tile.ts [js/webgpu] Refactor createTensorShapeVariables (#18883) 2024-02-01 17:59:00 -08:00
transpose.ts [js/webgpu] Refactor createTensorShapeVariables (#18883) 2024-02-01 17:59:00 -08:00
unary-op.ts [js/webgpu] support customop FastGelu (#19392) 2024-02-06 09:07:31 -08:00
where.ts [JS/WebGPU] Fix Split and Where to handle corner cases. (#19613) 2024-02-23 00:21:15 -08:00