finepdfs-synthetic Synthetic data derived from finepdfs MultiSynt/finepdfs-summaries Viewer • Updated Oct 2 • 1.57B • 1.79k • 1
Nemotron-cc Translations A ~100B token sample from nemotron-cc translated into various languages. We use three different translation models: OPUS-MT, Tower+ 9B and Tower+ 72B. MultiSynt/nemotron-cc-german-tower72b Viewer • Updated Sep 2 • 133M • 1.04k • 1 MultiSynt/nemotron-cc-finnish-tower72b Viewer • Updated Sep 2 • 140M • 14 MultiSynt/nemotron-cc-swedish-tower72b Viewer • Updated Sep 18 • 137M • 21 MultiSynt/nemotron-cc-spanish-tower72b Viewer • Updated Sep 12 • 138M • 431
finepdfs-synthetic Synthetic data derived from finepdfs MultiSynt/finepdfs-summaries Viewer • Updated Oct 2 • 1.57B • 1.79k • 1
Nemotron-cc Translations A ~100B token sample from nemotron-cc translated into various languages. We use three different translation models: OPUS-MT, Tower+ 9B and Tower+ 72B. MultiSynt/nemotron-cc-german-tower72b Viewer • Updated Sep 2 • 133M • 1.04k • 1 MultiSynt/nemotron-cc-finnish-tower72b Viewer • Updated Sep 2 • 140M • 14 MultiSynt/nemotron-cc-swedish-tower72b Viewer • Updated Sep 18 • 137M • 21 MultiSynt/nemotron-cc-spanish-tower72b Viewer • Updated Sep 12 • 138M • 431