HuggingFaceTB
/

SmolLM3-3B-Base

Text Generation

Transformers.js

Model card Files Files and versions

eliebak HF Staff commited on Jul 8

Commit

01580ef

·

verified ·

1 Parent(s): c07119c

Update README.md

Files changed (1) hide show

README.md +15 -0

README.md CHANGED Viewed

@@ -67,6 +67,21 @@ print(tokenizer.decode(outputs[0]))
 For local inference, you can use `llama.cpp`, `ONNX`, `MLX` and `MLC`. You can find quantized checkpoints in this collection (https://huggingface.co/collections/HuggingFaceTB/smollm3-686d33c1fdffe8e635317e23).
 ## Evaluation
 In this section, we report the evaluation results of SmolLM3 model. All evaluations are zero-shot unless stated otherwise, and we use [lighteval](https://github.com/huggingface/lighteval) to run them.

 For local inference, you can use `llama.cpp`, `ONNX`, `MLX` and `MLC`. You can find quantized checkpoints in this collection (https://huggingface.co/collections/HuggingFaceTB/smollm3-686d33c1fdffe8e635317e23).
+### Long context processing
+The current `config.json` is set for context length up to 65,536 tokens. To handle longer inputs (128k or 256k), we utilize YaRN you can change the `max_position_embeddings` and rope_scaling` to:
+```
+{
+  ...,
+  "rope_scaling": {
+    "factor": 2.0, #2x65536=131 072
+    "original_max_position_embeddings": 65536,
+    "type": "yarn"
+  }
+}
+```
 ## Evaluation
 In this section, we report the evaluation results of SmolLM3 model. All evaluations are zero-shot unless stated otherwise, and we use [lighteval](https://github.com/huggingface/lighteval) to run them.