Update README.md
#11
by
yuchenxie
- opened
README.md
CHANGED
|
@@ -290,7 +290,7 @@ YaRN is currently supported by several inference frameworks, e.g., `transformers
|
|
| 290 |
|
| 291 |
For `vllm`, you can use
|
| 292 |
```shell
|
| 293 |
-
vllm serve ... --rope-scaling '{"
|
| 294 |
```
|
| 295 |
|
| 296 |
For `sglang`, you can use
|
|
|
|
| 290 |
|
| 291 |
For `vllm`, you can use
|
| 292 |
```shell
|
| 293 |
+
vllm serve ... --rope-scaling '{"rope_type":"yarn","factor":4.0,"original_max_position_embeddings":32768}' --max-model-len 131072
|
| 294 |
```
|
| 295 |
|
| 296 |
For `sglang`, you can use
|