add AIBOM
#11 opened 5 months ago
by
RiccardoDav
The method get_max_length of 'DynamicCache' is deprecated and has been removed in transformer 4.49
#10 opened 8 months ago
by
login256
Fix for missing blank space at the end of chat template.
#9 opened 9 months ago
by
ShaneTian
OOM with int4 quant
#8 opened 10 months ago
by
chungimungi
I know this is insane but is it possible?
#7 opened 12 months ago
by
Assbang
MMLU benchmark performance on math domain
#6 opened about 1 year ago
by
Fighoture
Use try-except for flash_attn
#5 opened about 1 year ago
by
LiangliangMa
deepseek-v2-lite模型怎么微调?
1
#2 opened over 1 year ago
by
guowl