transformers.js icon indicating copy to clipboard operation
transformers.js copied to clipboard

Gemma3n browser support

Open bil-ash opened this issue 5 months ago • 3 comments

Feature request

As mentioned in https://github.com/huggingface/transformers.js/pull/1348 , gemma3n is currently not supported in web browser, so please bring web browser support.

Motivation

It is important because gemma3n is a very good model, both with text-only modality or with multimodality, would be nice to be able to run it client-side

Your contribution

None. I tried converting gemma3n-2b to ONNX using the script in this repo, thinking that I might be able to contribute something, but the conversion failed.

bil-ash avatar Jul 07 '25 01:07 bil-ash

Working on it! :)

xenova avatar Jul 11 '25 03:07 xenova

@xenova onnxruntime is adding 2-bit quantization support for av2, avx512 and neon. Also, recent emscripten versions support avx2 intrinsics. If these 2 are combined to build onnxruntime with 2-bit quantized models inference support for wasm runtime, maybe gemma3n can be run in browser.

bil-ash avatar Jul 26 '25 14:07 bil-ash

Working on it! :)

Hi @xenova Is there any update on WebGPU support for Gemma 3n ?

xnohat avatar Sep 09 '25 10:09 xnohat