onnxruntime/js/web/lib/wasm/jsep/webgpu/ops
Satya Kumar Jandhyala b33216be4c
[JS/WebGPU] Improve MatMulNBits perf (#19974)
### Description
<!-- Describe your changes. -->
Improve performance using shared memory


### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->
2024-04-12 11:03:05 -07:00
..
3rd-party
argminmax.ts
attention.ts
batch-norm.ts
bias-add.ts
bias-split-gelu.ts
binary-op.ts
common.ts
concat.ts
conv-grouped.ts
conv-transpose.ts
conv.ts
cumsum.ts
depth-to-space.ts
einsum.ts
expand.ts
fast-gelu.ts
fuse-utils.ts
gather-elements.ts
gather.ts
gemm.ts
instance-norm.ts
layer-norm.ts
matmul.ts
matmulnbits.ts
multi-head-attentiion.ts
pad.ts
pool.ts
range.ts
reduce-shared.ts
reduce.ts
resize.ts
rotary-embedding.ts
skip-layer-norm.ts
slice.ts
softmax.ts
split.ts
tile.ts
transpose.ts
unary-op.ts
where.ts