slower than qwen 2.5 on a100 40gb
#10 opened about 2 months ago
by
ambivalent02
The `seen_tokens` attribute is deprecated and will be removed in v4.41. Use the `cache_position` model input instead.
1
#9 opened 5 months ago
by
ctranslate2-4you
Add link to paper
#8 opened 5 months ago
by
nielsr