Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Thomas Wang's picture
267 5

Thomas Wang

TimeRobber
pololiv's profile picture corneille97's profile picture NoahDigitech's profile picture
·
  • thomasw21

AI & ML interests

Large Language Models, Efficient NLP, NeRF

Organizations

BigScience Workshop's profile picture HF Internships's profile picture BigScience Catalogue Data's profile picture BigScience Data's profile picture BigScience Catalogue Data Dev's profile picture Team 7's profile picture BigCode's profile picture ShapeNet's profile picture

authored a paper over 1 year ago

OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

Paper • 2306.16527 • Published Jun 21, 2023 • 46
authored 2 papers almost 2 years ago

Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 53

FinGPT: Large Generative Models for a Small Language

Paper • 2311.05640 • Published Nov 3, 2023 • 31
authored 6 papers over 2 years ago

What Language Model to Train if You Have One Million GPU Hours?

Paper • 2210.15424 • Published Oct 27, 2022 • 2

StarCoder: may the source be with you!

Paper • 2305.06161 • Published May 9, 2023 • 31

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Paper • 2303.03915 • Published Mar 7, 2023 • 7

Crosslingual Generalization through Multitask Finetuning

Paper • 2211.01786 • Published Nov 3, 2022 • 2

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 34

Multitask Prompted Training Enables Zero-Shot Task Generalization

Paper • 2110.08207 • Published Oct 15, 2021 • 2
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs