WebAssembly / Web runtime (both for wasm-simd and WebGPU) #8216

vadimkantorov · 2024-05-02T22:38:32Z

vadimkantorov
May 2, 2024

I'm wondering if ExecuTorch can be compiled for WebAssembly target? As far as I understand, XNNPACK exists for wasm-simd, so theoretically at least for CPU it can be done? (e.g. to be compared with tflite+tfjs, ort-web and tvm-wasm at least for some popular models like MobileNets)

(This is especially interesting if strong fusion/codegen can be done to produce fused wasm-simd code/fused WebGPU programs - although maybe this is an ask for Inductor)

SS-JIA · 2024-05-03T18:21:18Z

SS-JIA
May 3, 2024
Collaborator

cc: @mcr229 or @digantdesai regarding running XNNPACK via wasm

0 replies

SS-JIA · 2024-05-03T19:49:00Z

SS-JIA
May 3, 2024
Collaborator

Also cc: @mergennachin

0 replies

JacobSzwejbka · 2024-05-09T18:42:24Z

JacobSzwejbka
May 9, 2024
Collaborator

I've talked with @digantdesai about this before. I think for xnnpack he mentioned it should just be plug and play. Ive been wanting to try out wasm for sometime now just havent had the bandwidth.

0 replies

vadimkantorov · 2024-05-09T20:03:34Z

vadimkantorov
May 9, 2024
Author

I also wonder about the fusion capabilities of executorch :) Does it allow Inductor codegen'd fused kernels (e.g. think quant/dequant fused into the flash attn kernel directly, with positional embedding computation also fused into this kernel)?

Another interesting backend is webgpu/wgpu: https://github.com/huggingface/ratchet or even directly wgpu/wgsl shaders could in theory be a compilation target for fused kernels

But even if executorch does not support wild codegen/fusions - it's still be good to have it as a baseline with comparisons against ort-web and tflate-tfjs and tvm-wasm and ggml compiled to wasm. This should show roughly where all these frameworks stand (especially if compiling is relatively doable)

0 replies

vadimkantorov · 2024-05-09T20:05:30Z

vadimkantorov
May 9, 2024
Author

And given that currently PyTorch does not have its own inference wasm/WebGPU story, having executorch compiled to wasm-simd might be a nice baseline to have (especially if it's minimalistic and relatively simple to compile)

0 replies

kimishpatel · 2024-11-21T15:03:41Z

kimishpatel
Nov 21, 2024
Collaborator

I suspect much of the core should compilable with emscripten cpp compiler. Probably not optimized operators though and not too sure about backends/xnnpack

0 replies

vadimkantorov · 2024-11-21T15:26:24Z

vadimkantorov
Nov 21, 2024
Author

Maybe best would be adding some sort of GitHub Actions CI test compiling it with emscripten... (even if no tests using it exist so far)

0 replies

digantdesai · 2024-11-21T16:29:30Z

digantdesai
Nov 21, 2024
Collaborator

not too sure about backends/xnnpack

It should be, given a bunch of WASM[SIMD] kernels. I haven't tried it myself though. IIRC there aren't any CI for that on github/xnnpack either.

0 replies

vadimkantorov · 2024-11-21T16:33:43Z

vadimkantorov
Nov 21, 2024
Author

xnnpack is also known to compile (and maybe even tested) for wasm/simd, so somehow this should be achievable... don't know if any compact backend library/project exists for webgpu kernels

1 reply

bhack May 22, 2025

It is still not regularly tested officially

google/XNNPACK#6453

WebAssembly / Web runtime (both for wasm-simd and WebGPU) #8216

Uh oh!

Uh oh!

vadimkantorov May 2, 2024

Replies: 9 comments · 1 reply

Uh oh!

SS-JIA May 3, 2024 Collaborator

Uh oh!

SS-JIA May 3, 2024 Collaborator

Uh oh!

JacobSzwejbka May 9, 2024 Collaborator

Uh oh!

vadimkantorov May 9, 2024 Author

Uh oh!

vadimkantorov May 9, 2024 Author

Uh oh!

kimishpatel Nov 21, 2024 Collaborator

Uh oh!

vadimkantorov Nov 21, 2024 Author

Uh oh!

digantdesai Nov 21, 2024 Collaborator

Uh oh!

Uh oh!

vadimkantorov Nov 21, 2024 Author

Uh oh!

bhack May 22, 2025

vadimkantorov
May 2, 2024

Replies: 9 comments 1 reply

SS-JIA
May 3, 2024
Collaborator

SS-JIA
May 3, 2024
Collaborator

JacobSzwejbka
May 9, 2024
Collaborator

vadimkantorov
May 9, 2024
Author

vadimkantorov
May 9, 2024
Author

kimishpatel
Nov 21, 2024
Collaborator

vadimkantorov
Nov 21, 2024
Author

digantdesai
Nov 21, 2024
Collaborator

vadimkantorov
Nov 21, 2024
Author