Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

BigScience Data

non-profit
https://bigscience.huggingface.co
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

mariagrandury  authored a paper 27 days ago
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data
mariagrandury  authored a paper 27 days ago
Measuring what Matters: Construct Validity in Large Language Model Benchmarks
pjox  authored a paper about 2 months ago
SciLaD: A Large-Scale, Transparent, Reproducible Dataset for Natural Scientific Language Processing
View all activity

Albert Villanova del Moral's profile pictureLeandro von Werra's profile pictureMario Šaško's profile pictureJörg Frohberg's profile pictureQuentin Lhoest's profile pictureChristopher Akiki's profile pictureViolette's profile pictureIz Beltagy's profile pictureYacine Jernite's profile pictureHugo Laurençon's profile pictureManuel Romero's profile pictureLucile Saulnier's profile pictureThomas Wang's profile pictureTeven Le Scao's profile pictureSasha Luccioni's profile pictureHuu Nguyen's profile pictureKyle Lo's profile pictureRoman Castagné's profile pictureStella Biderman's profile pictureSourab Mangrulkar's profile pictureLoubna Ben Allal's profile pictureFrancesco De Toni's profile picturegerard dupont's profile pictureAngie McMillan-Major's profile pictureMasoud's profile pictureHendrik Strobelt's profile pictureMargaret Mitchell's profile pictureYounes B's profile pictureDavid McClure's profile pictureYozh's profile pictureNiklas Muennighoff's profile pictureAleksandra Piktus's profile pictureSheng Shen's profile pictureBen Schmidt's profile pictureRon Au's profile pictureMarielle Lange's profile pictureAnna Rogers's profile pictureColin Raffel's profile pictureTristan Thrush's profile pictureCarlos Muñoz Ferrandis's profile pictureNazneen Rajani's profile pictureBritney Muller's profile picturehelen's profile pictureMostofa Patwary's profile picturePedro Ortiz Suarez's profile pictureDouwe Kiela's profile pictureMaría Grandury's profile pictureNikhil Kandpal's profile pictureXinyu ZHANG's profile pictureOdunayo Ogundepo's profile pictureJimmy Lin's profile picturePete's profile pictureUnso Eun Seo Jo's profile pictureChris Emezue's profile pictureZaid Alyafeai's profile pictureAndrea Soria's profile picturePaulo Villegas's profile pictureManan Dey's profile pictureM Saiful Bari's profile pictureThomas Wolf's profile pictureJonathan Li's profile pictureChangran Hu's profile pictureThakker's profile pictureTerra Blevins's profile pictureMurray Kang's profile pictureNa's profile pictureZeerak's profile pictureRichard Diehl Martinez's profile picturePierre-Carl Langlais's profile pictureDemetris's profile pictureGuilherme Penedo's profile picture

bigscience-data 's models 8

bigscience-data/sgpt-bloom-1b7-nli

Sentence Similarity • 2B • Updated Jan 27, 2025 • 46 • 11

bigscience-data/tokenizer_alpha_NFKC_250k

Updated Feb 17, 2022

bigscience-data/tokenizer_equal_NFKC_250k

Updated Feb 16, 2022

bigscience-data/tokenizer_alpha_nfkc_24M

Updated Feb 16, 2022

bigscience-data/tokenizer_equal_nfkc_24M

Updated Feb 15, 2022

bigscience-data/tokenizer_equal_weight_NFKC_v1

Updated Feb 14, 2022

bigscience-data/tokenizer_alpha_weight_NFKC

Updated Feb 14, 2022

bigscience-data/tokenizer_v0

Updated Feb 8, 2022
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs