Instructions to use Outlier-Ai/Outlier-Vision-35B-A3B-MLX-4bit with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use Outlier-Ai/Outlier-Vision-35B-A3B-MLX-4bit with MLX:
# Make sure mlx-vlm is installed # pip install --upgrade mlx-vlm from mlx_vlm import load, generate from mlx_vlm.prompt_utils import apply_chat_template from mlx_vlm.utils import load_config # Load the model model, processor = load("Outlier-Ai/Outlier-Vision-35B-A3B-MLX-4bit") config = load_config("Outlier-Ai/Outlier-Vision-35B-A3B-MLX-4bit") # Prepare input image = ["http://images.cocodataset.org/val2017/000000039769.jpg"] prompt = "Describe this image." # Apply chat template formatted_prompt = apply_chat_template( processor, config, prompt, num_images=1 ) # Generate output output = generate(model, processor, formatted_prompt, image) print(output) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
- Pi new
How to use Outlier-Ai/Outlier-Vision-35B-A3B-MLX-4bit with Pi:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "Outlier-Ai/Outlier-Vision-35B-A3B-MLX-4bit"
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "mlx-lm": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "Outlier-Ai/Outlier-Vision-35B-A3B-MLX-4bit" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use Outlier-Ai/Outlier-Vision-35B-A3B-MLX-4bit with Hermes Agent:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "Outlier-Ai/Outlier-Vision-35B-A3B-MLX-4bit"
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default Outlier-Ai/Outlier-Vision-35B-A3B-MLX-4bit
Run Hermes
hermes
Part of the Outlier shipping lineup. Outlier is a free macOS app that runs this model locally, with one click. Apple Silicon only.
Outlier Vision 35B-A3B (MLX 4-bit)
Multimodal MoE tier with image+text input (35B params, ~3.6B active per token). Optimized for image+text analysis, not code generation — use Core or Code for coding workflows.
Try it in Outlier
The simplest way to use this model is through the Outlier app — open the tier picker, select Outlier Vision, click download, and chat. No setup, no Python, no MLX install, no token quotas.
➡ Download Outlier — outlier.host
A screenshot of the tier picker is at outlier.host/screenshots/tier-picker.png.
Load this directly (power users)
If you want the raw MLX-4bit weights without the app:
pip install mlx-lm
python -m mlx_lm.generate \
--model Outlier-Ai/Outlier-Vision-35B-A3B-MLX-4bit \
--prompt "Write a quicksort in Python." \
--max-tokens 512
from mlx_lm import load, generate
model, tokenizer = load("Outlier-Ai/Outlier-Vision-35B-A3B-MLX-4bit")
print(generate(model, tokenizer, prompt="Hello", max_tokens=256))
Verified benchmarks
For σ-qualified MMLU, HumanEval, and Mac inference-speed numbers — with full provenance (source file, command, n, stderr, date) — see outlier.host/benchmarks.
Other Outlier shipping tiers
- Outlier Nano 4B (entry tier, ~3 GB)
- Outlier Lite 9B (balanced, ~6 GB)
- Outlier Quick 26B-A4B MoE (~16 GB)
- Outlier Core 27B (default, ~16 GB)
- Outlier Code 27B (code-tuned, ~16 GB)
License
Apache 2.0 (inherits from upstream base model). Conversion artifact only — the underlying weights are governed by the base model's license.
- Downloads last month
- 190
4-bit
Model tree for Outlier-Ai/Outlier-Vision-35B-A3B-MLX-4bit
Base model
Qwen/Qwen3.6-35B-A3BEvaluation results
- accuracy on MMLU (5-shot, n=14042)test set self-reported0.835
- pass@1 on HumanEvaltest set self-reported0.610