Update README.md to include ExecuTorch
#30
by
SocialLocalMobile
- opened
README.md
CHANGED
@@ -198,7 +198,7 @@ text = tokenizer.apply_chat_template(
|
|
198 |
)
|
199 |
```
|
200 |
|
201 |
-
For local inference, you can use `llama.cpp`, `ONNX`, `MLX` and `
|
202 |
|
203 |
### vLLM and SGLang
|
204 |
|
|
|
198 |
)
|
199 |
```
|
200 |
|
201 |
+
For local inference, you can use `llama.cpp`, `ONNX`, `MLX`, `MLC` and `ExecuTorch`. You can find quantized checkpoints in this collection (https://huggingface.co/collections/HuggingFaceTB/smollm3-686d33c1fdffe8e635317e23)
|
202 |
|
203 |
### vLLM and SGLang
|
204 |
|