4.5 non air locally exposed to Claude Code
#24 opened 29 days ago
by
giulianocarioca
rope_scaling for GLM-4.5
1
#23 opened about 2 months ago
by
Zenonnn
Anticipated Availability of GPTQModel Format Models (W4A16/W8A16)
#22 opened 2 months ago
by
X-SZM
Unable download the generated code
3
#21 opened 2 months ago
by
ssfarzad
Fixed π¨ GGUF Tool calling β MCP working β
1
#19 opened 3 months ago
by
xbruce22
Upload Marginal adapttaion.pdf
#17 opened 3 months ago
by
thenunabdo
Please create 8-bit MLX - No-one has it anywhere...
#16 opened 3 months ago
by
Darkslayerofdark
Questions on FP8 inference, parallel requests, and context length with 4x H200s
2
#15 opened 3 months ago
by
sultan93
Does its api support formot?
#14 opened 3 months ago
by
Connde
Impressive Broad Knowledge
π
π
5
8
#12 opened 3 months ago
by
phil111
Thinking tokens issue
π
2
11
#9 opened 3 months ago
by
iyanello
Benchmarks for non-thinking mode
π
4
2
#8 opened 3 months ago
by
PSM24
Thankyou GLM Team for the wonderful MOE Model
π₯
6
1
#7 opened 3 months ago
by
Narutoouz
AWQ 4Bit / GPTQ with full precision gates and head? Please
8
#4 opened 3 months ago
by
chriswritescode
We Have Gemini At Home
4
#1 opened 3 months ago
by
MarinaraSpaghetti