FP8 Quant Please
#11 opened 4 months ago
by
rjmehta
Best model for SD?
1
#10 opened 4 months ago
by
darkstar3537
AWQ Quantization plz.
👍
1
#9 opened 4 months ago
by
hyunw55
can use vllm deploy this model?
2
#8 opened 4 months ago
by
tianyer
How do I separate the reasoning from the reply?
1
#7 opened 4 months ago
by
Lockout
TTFT deteriorates rapidly after Concurrency reaches 72.
1
#5 opened 5 months ago
by
theGreatGuy
update metadata
#3 opened 5 months ago
by
nickname100231
Can we expect you to open source older versions of Kimi that you developed in-house?
#2 opened 5 months ago
by
win10
gguf model?
👍
2
3
#1 opened 5 months ago
by
segmond