Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
EssentialAI
's Collections
Essential-Web v1.0
Rethinking Reflection in Pre-Training
Essential-Web v1.0
updated
Jun 18
Upvote
9
Essential-Web v1.0: 24T tokens of organized web data
Paper
•
2506.14111
•
Published
Jun 17
•
46
EssentialAI/essential-web-v1.0
Preview
•
Updated
Oct 2
•
50.5k
•
205
EssentialAI/eai-distill-0.5b
0.6B
•
Updated
Jun 18
•
882
•
23
EssentialAI/eai-taxonomy-math-w-fm
Viewer
•
Updated
Jun 22
•
21.6M
•
989
•
5
EssentialAI/eai-taxonomy-code-w-dclm
Viewer
•
Updated
Jun 22
•
274M
•
2.37k
•
8
EssentialAI/eai-taxonomy-code-w-dclm-100b-sample
Viewer
•
Updated
Jun 22
•
46.2M
•
750
•
2
EssentialAI/eai-taxonomy-med-w-dclm
Viewer
•
Updated
Jun 22
•
81.2M
•
544
•
8
EssentialAI/eai-taxonomy-med-w-dclm-100b-sample
Viewer
•
Updated
Jun 22
•
36.6M
•
211
•
2
EssentialAI/eai-taxonomy-stem-w-dclm
Preview
•
Updated
Jun 22
•
513
•
5
EssentialAI/eai-taxonomy-stem-w-dclm-100b-sample
Viewer
•
Updated
Jun 22
•
35.5M
•
429
•
4
Upvote
9
+5
Share collection
View history
Collection guide
Browse collections