huggingface.js icon indicating copy to clipboard operation
huggingface.js copied to clipboard

feat(local-apps): Add Nexa SDK integration

Open mengshengwu opened this issue 2 months ago β€’ 1 comments

Introduction

Hello! Friends from Huggingface πŸ‘‹

Thank you for maintaining this amazing library and ecosystem.
Following a recent discussion with Hugging Face (see here), your CEO expressed interest in exploring collaboration to bring on-device model inference to the community. This PR proposes adding Nexa SDK to the local apps section as a new runtime integration option.

About Nexa SDK

Nexa SDK is an on-device inference framework that runs any model on any device, across any backend. It runs on CPUs, GPUs, NPUs with backend support for CUDA, Metal, Vulkan, and Qualcomm / Intel / AMD NPU. It handles multiple input modalities including text πŸ“, image πŸ–ΌοΈ, and audio 🎧. The SDK includes an OpenAI-compatible API server with support for JSON schema-based function calling and streaming. It supports model formats such as GGUF, MLX, Nexa AI's own .nexa format, enabling efficient quantized inference across diverse platforms.


Example Demos

Multi-Image Reasoning Demo

πŸ–ΌοΈ Multi-Image Reasoning
Spot the difference across two images in multi-round dialogue.

Image + Audio Function Call Demo

🎀 Image + Text β†’ Function Call
Snap a poster, add a voice note, and the AI agent creates a calendar event locally.

Multi-Audio Comparison Demo

🎢 Multi-Audio Comparison
Identify differences between two music clips β€” fully offline.


Related Links


Logo File

Nexa-Logo-Black


Thank you for your time reviewing this PR!
We’re excited to explore how Nexa SDK can extend Hugging Face models to mobile and edge platforms. πŸ™

mengshengwu avatar Oct 14 '25 04:10 mengshengwu

@SBrandeis @Wauplin @julien-c @pcuenca look forward to your review feedbacks!

zhiyuan8 avatar Nov 01 '25 20:11 zhiyuan8