Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method
-
cerebras/Qwen3-Coder-REAP-363B-A35B-FP8
Text Generation • 363B • Updated • 50 • 14 -
cerebras/Qwen3-Coder-REAP-246B-A35B-FP8
Text Generation • 246B • Updated • 125 • 19 -
cerebras/Qwen3-Coder-REAP-363B-A35B
Text Generation • 363B • Updated • 40 • 3 -
cerebras/Qwen3-Coder-REAP-246B-A35B
Text Generation • 246B • Updated • 34 • 6