Fix typo
Browse files
README.md
CHANGED
@@ -41,7 +41,7 @@ The model is a decoder-only transformer using GQA and NoPE (with 3:1 ratio), it
|
|
41 |
### Key features
|
42 |
- Instruct model optimized for **hybrid reasoning**
|
43 |
- **Fully open model**: open weights + full training details including public data mixture and training configs
|
44 |
-
- **Long context:** Trained on 64k context and
|
45 |
- **Multilingual**: 6 natively supported (English, French, Spanish, German, Italian, and Portuguese)
|
46 |
|
47 |
For more details refer to our blog post: https://hf.co/blog/smollm3
|
|
|
41 |
### Key features
|
42 |
- Instruct model optimized for **hybrid reasoning**
|
43 |
- **Fully open model**: open weights + full training details including public data mixture and training configs
|
44 |
+
- **Long context:** Trained on 64k context and supports up to **128k tokens** using YARN extrapolation
|
45 |
- **Multilingual**: 6 natively supported (English, French, Spanish, German, Italian, and Portuguese)
|
46 |
|
47 |
For more details refer to our blog post: https://hf.co/blog/smollm3
|