Latest SOTA models supported on Qualcomm NPU.
AI & ML interests
On Device AI Deployment and Research
Recent Activity
Latest SOTA models supported on Intel NPU
Language Models that takes vision input and/or audio input, hand picked by Nexa Team.
Language Models that takes vision input and/or audio input, hand picked by Nexa Team.
NexaQuant compresses models with 100% accuracy recovery.
Latest SOTA models supported on Qualcomm NPU.
Nexa AI infra to support Qwen3VL running on GPU/NPU/CPU
Latest SOTA models supported on Intel NPU
Latest SOTA models supported on Apple Neural Engine
Language Models that takes vision input and/or audio input, hand picked by Nexa Team.
Text Generations Models in MLX format, hand picked by Nexa Team.
Language Models that takes vision input and/or audio input, hand picked by Nexa Team.
Text Generations Models in GGUF format, hand picked by Nexa Team.
NexaQuant compresses models with 100% accuracy recovery.
Tiny, multimodal on-device models developed by Nexa AI.