πͺ InferenceService name updated
#8 opened 14 days ago
by
ckavili

change-name
#7 opened 18 days ago
by
robertgshaw
Overview states 109b, should be 17b
#6 opened 20 days ago
by
jcordes

Failing to quantize using your method
#4 opened 3 months ago
by
redd2dead

VLLM launch parametrs
π
3
#3 opened 3 months ago
by
Clutchkin
Why not FP8 with static and per-tensor quantization?
π
1
2
#2 opened 4 months ago
by
wanzhenchn
Thank you uploading this.
β€οΈ
6
#1 opened 4 months ago
by
chriswritescode
