Synthetic data derived from finepdfs
MultiSynt
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
MultiSynt is a collaborative initiative between OpenEuroLLM and EuroLLM focused on developing high-quality multilingual synthetic datasets for language model pretraining. By combining expertise from both organizations, MultiSynt aims to advance the creation of multilingual synthetic training data that supports diverse European languages to enable more inclusive AI development across languages.
models
23
MultiSynt/nemotron-cc-icelandic-tower9b
Updated
•
18
MultiSynt/nemotron-cc-icelandic-opus
Updated
•
39
MultiSynt/nemotron-cc-danish-tower9b
Updated
•
74
MultiSynt/nemotron-cc-danish-opus
Updated
•
158
MultiSynt/nemotron-cc-portuguese-opus
Updated
•
90
MultiSynt/nemotron-cc-dutch-opus
Updated
•
92
MultiSynt/nemotron-cc-basque-opus
Updated
•
440
MultiSynt/nemotron-cc-dutch-tower9b
Updated
•
1.07k
MultiSynt/nemotron-cc-spanish-tower9b
Updated
•
26
MultiSynt/nemotron-cc-italian-opus
Updated
•
38
datasets
24
MultiSynt/nemotron-cc-portuguese-tower9b
Viewer
•
Updated
•
136M
•
37
MultiSynt/nemotron-cc-italian-tower9b
Viewer
•
Updated
•
136M
•
166
MultiSynt/nemotron-cc-polish-tower9b
Viewer
•
Updated
•
136M
•
103
MultiSynt/nemotron-cc-french-tower9b
Viewer
•
Updated
•
135M
•
90
MultiSynt/nemotron-cc-spanish-opus-qe
Viewer
•
Updated
•
3.29B
•
83
MultiSynt/nemotron-cc-dutch-tower9b
Viewer
•
Updated
•
135M
•
204
MultiSynt/nemotron-cc-icelandic-tower9b
Viewer
•
Updated
•
136M
•
90
MultiSynt/finepdfs-summaries
Viewer
•
Updated
•
1.57B
•
1.83k
•
1
MultiSynt/nemotron-cc-danish-tower9b
Viewer
•
Updated
•
138M
•
1.44k
MultiSynt/nemotron-cc-finnish-opus-qe
Viewer
•
Updated
•
3.29B
•
423