onnxruntime/js/web/lib/wasm/jsep/webgpu/ops
Arthur Islamov d0519a7603
[js/web] BiasSplitGelu and BiasAdd kernels (#17161)
### Description
Two contrib kernels that supposed to speed-up StableDiffusion according
to this doc
https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/python/tools/transformers/models/stable_diffusion/README.md

However, there is no noticable effect in speed or memory consumption. So
i guess the only way to make it faster is to implement
MultiHeadAttention but i'm not capable of doing that right now. So i'll
focus on existing PRs and finding the JSEP kernel that produces
incorrect results. It should be one of the old ones (i suspect Conv or
ConvTranspose), as SD was not generating images correctly on webgpu
since i started working on it. I hoped someone else would fix that by
the time i finish with kernels/optimizations 😅

---------

Co-authored-by: Guenther Schmuelling <guschmue@microsoft.com>
Co-authored-by: Yulong Wang <7679871+fs-eire@users.noreply.github.com>
2023-10-03 12:20:20 -07:00
..
3rd-party [js/web] FP16 Conv, ConvTranspose and MatMul (#17514) 2023-09-30 00:00:23 -07:00
argminmax.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
bias-add.ts [js/web] BiasSplitGelu and BiasAdd kernels (#17161) 2023-10-03 12:20:20 -07:00
bias-split-gelu.ts [js/web] BiasSplitGelu and BiasAdd kernels (#17161) 2023-10-03 12:20:20 -07:00
binary-op.ts [js/webgpu] Allow binary ops with scalar to use the vectorize path (#17589) 2023-09-21 20:55:08 -07:00
common.ts [js/webgpu] Optimize Gather op (#17625) 2023-09-21 21:00:36 -07:00
concat.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
conv-grouped.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
conv-transpose.ts [js/web] FP16 Conv, ConvTranspose and MatMul (#17514) 2023-09-30 00:00:23 -07:00
conv.ts [js/web] FP16 Conv, ConvTranspose and MatMul (#17514) 2023-09-30 00:00:23 -07:00
conv2d-mm.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
conv2dtranspose-mm.ts [JS/Web] Add ConvTranspose implementation using MatMul (#17573) 2023-09-29 11:00:44 -07:00
einsum.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
expand.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
fuse-utils.ts
gather-elements.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
gather.ts [js/webgpu] Optimize Gather op (#17625) 2023-09-21 21:00:36 -07:00
gemm.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
instance-norm.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
layer-norm.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
matmul.ts [js/web] FP16 Conv, ConvTranspose and MatMul (#17514) 2023-09-30 00:00:23 -07:00
pad.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
pool.ts [js/web] fp16 Pool & Reduce (#17512) 2023-09-21 14:52:13 -07:00
range.ts [JS/WebGPU] support Range operator (#17233) 2023-09-30 02:05:32 -07:00
reduce.ts [js/web] fp16 Pool & Reduce (#17512) 2023-09-21 14:52:13 -07:00
resize.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
skip-layer-norm.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
slice.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
softmax.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
split.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
tile.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
transpose.ts [js/web] revise TensorView (#17473) 2023-09-14 21:14:44 -07:00
unary-op.ts [js/web] FP16 binary and unary ops (#17515) 2023-09-18 15:43:32 -07:00