Translation
Safetensors
mistral
YuLu0713 commited on
Commit
e145f3c
·
verified ·
1 Parent(s): 7897d20

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +81 -5
README.md CHANGED
@@ -1,5 +1,81 @@
1
- ---
2
- license: other
3
- license_name: openmdw
4
- license_link: LICENSE
5
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ license_name: openmdw
4
+ license_link: LICENSE
5
+ ---
6
+ # Seed-X-PPO-7B
7
+ <a href="https://github.com/ByteDance-Seed/Seed-X-7B/blob/main/Technical_Report.pdf">
8
+ <img src="https://img.shields.io/badge/Seed--X-Report-blue"></a>
9
+ <a href="XXXX">
10
+ <img src="https://img.shields.io/badge/Seed--X-Hugging Face-brightgreen"></a>
11
+ <a href="https://github.com/ByteDance-Seed/Seed-X-7B/blob/main/LICENSE.openmdw">
12
+ <img src="https://img.shields.io/badge/License-OpenMDW-yellow"></a>
13
+
14
+ ## Introduction
15
+ We are excited to introduce **Seed-X**, a powerful open-source multilingual translation language model series, including instruction and reasoning models, with 7B parameters pushing the boundaries of translation capabilities.
16
+ We develop Seed-X as an accessible, off-the-shelf tool to support the community in advancing translation research and applications:
17
+ * **Exceptional translation capabilities**: Seed-X exhibits state-of-the-art translation capabilities, on par with or outperforming ultra-large models like Gemini-2.5, Claude-3.5, and GPT-4, as validated by human evaluations and automatic metrics.
18
+ * **Deployment and inference-friendly**: With a compact 7B parameter count and mistral architecture, Seed-X offers outstanding translation performance in a lightweight and efficient package, ideal for deployment and inference.
19
+ * **Broad domain coverage**: Seed-X excels on a highly challenging translation test set spanning diverse domains, including the internet, science and technology, office dialogues, e-commerce, biomedicine, finance, law, literature, and entertainment.
20
+ ![performance](/imgs/model_comparsion.png)
21
+
22
+ This repo contains the Seed-X-PPO model, with the following features:
23
+ * Type: Causal language models
24
+ * Training Stage: Pretraining & Post-training
25
+ * Support: Multilingual translation among 28 languages
26
+
27
+ ## Model Downloads
28
+ | Model Name | Description | Download |
29
+ | ----------- | ----------- |-----------
30
+ | Seed-X-Instruct | Instruction-tuned for alignment with user intent. |🤗 [Model]()|
31
+ | 👉 **Seed-X-PPO** | RL trained to boost translation capabilities. | 🤗 [Model]()|
32
+ |Seed-X-RM | Reward model to evaluate the quality of translation.| 🤗 [Model]()|
33
+
34
+ ## Quickstart
35
+ Here is a simple example demonstrating how to load the model and perform translation using ```vllm```
36
+ ```python
37
+ from vllm import LLM, SamplingParams
38
+
39
+ model_path = "./ByteDance-Seed/Seed-X-PPO-7B"
40
+
41
+ model = LLM(model=model_path,
42
+ max_num_seqs=512,
43
+ tensor_parallel_size=8,
44
+ enable_prefix_caching=True,
45
+ gpu_memory_utilization=0.95)
46
+
47
+ messages = [
48
+ "Translate the following English sentence into Chinese:\nMay the force be with you <zh>", # without CoT
49
+ "Translate the following English sentence into Chinese and explain it in detail:\nMay the force be with you <zh>" # with CoT
50
+ ]
51
+
52
+ sampling_params = SamplingParams(temperature=0,
53
+ max_tokens=4096,
54
+ skip_special_tokens=True)
55
+
56
+ results = model.generate(messages, sampling_params)
57
+ responses = [res.outputs[0].text.strip() for res in results]
58
+
59
+ print(responses)
60
+ ```
61
+ ## Evaluation
62
+ We evaluated Seed-X on a diverse set of translation benchmarks, including FLORES-200, WMT-25, and a publicly released [challenge set](https://github.com/ByteDance-Seed/Seed-X-7B/tree/main/challenge_set) accompanied by human evaluations.
63
+ ![humen_eval](/imgs/humen_eval.png)
64
+ For detailed benchmark results and analysis, please refer to our [Technical Report](https://github.com/ByteDance-Seed/Seed-X-7B/blob/main/Technical_Report.pdf).
65
+
66
+ ## License
67
+ This project is licensed under OpenMDW. See the [LICENSE](https://github.com/ByteDance-Seed/Seed-X-7B/blob/main/LICENSE.openmdw) flie for details.
68
+
69
+ ## Citation
70
+ If you find Seed-X useful for your research and applications, feel free to give us a star ⭐ or cite us using:
71
+ ```bibtex
72
+ @Article{XXX,
73
+ title={XXXXXXXXXXX},
74
+ author={XXX,XXX,XXX,XXX},
75
+ year={2025},
76
+ eprint={XXXX.XXXXX},
77
+ archivePrefix={arXiv},
78
+ primaryClass={cs.XX}
79
+ }
80
+ ```
81
+ We will soon publish our technical report on Arxiv.