RuntimeError: Cannot load `awq` weight, make sure the model is already quantized
1
#17 opened 6 months ago
by
markba
RuntimeError: probability tensor contains either inf, nan or element < 0
#16 opened 7 months ago
by
mstachow
Update README.md
#15 opened 8 months ago
by
megladagon
sglang的chat-template什么时候能设置v2.5-vl
#14 opened 9 months ago
by
kevinBusinessGenrator
修改视觉模型的tokens_per_second没有生效
#13 opened 9 months ago
by
kevinBusinessGenrator
ImportError: cannot import name 'shard_checkpoint' from 'transformers.modeling_utils
➕
1
3
#12 opened 9 months ago
by
rj979797
Change the image_processor_type
#11 opened 9 months ago
by
SorenDreano
如何提高推理速率
#10 opened 9 months ago
by
kevinBusinessGenrator
我们现在使用的图片token数量远没有达到32768个,如何能够降低这个数量
1
#9 opened 9 months ago
by
kevinBusinessGenrator
VLLM部署报错
3
#8 opened 9 months ago
by
classdemo
intermediate_size改动
#6 opened 9 months ago
by
iMountTai
is this bug? "image_processor_type": "Qwen2_5_VLImageProcessor",
5
#5 opened 9 months ago
by
artheru
How to speed up inference?
1
#4 opened 9 months ago
by
vegasscientific
Add link to paper in model card
#3 opened 9 months ago
by
nielsr
AWQ量化版损失有多大?可以再进行SFT微调吗?
👍
2
#1 opened 9 months ago
by
HediZhao