FP8 Quant Please
#11 opened 11 days ago
by
rjmehta
Best model for SD?
#10 opened 16 days ago
by
darkstar3537
AWQ Quantization plz.
👍
1
#9 opened about 1 month ago
by
hyunw55
can use vllm deploy this model?
2
#8 opened about 1 month ago
by
tianyer
How do I separate the reasoning from the reply?
1
#7 opened about 1 month ago
by
Lockout

TTFT deteriorates rapidly after Concurrency reaches 72.
1
#5 opened about 1 month ago
by
theGreatGuy

update metadata
#3 opened about 1 month ago
by
nickname100231
Can we expect you to open source older versions of Kimi that you developed in-house?
#2 opened about 1 month ago
by
win10

gguf model?
👍
2
3
#1 opened about 1 month ago
by
segmond