Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
EssentialAI 's Collections
Essential-Web v1.0
Rethinking Reflection in Pre-Training

Essential-Web v1.0

updated Jun 18
Upvote
9

  • Essential-Web v1.0: 24T tokens of organized web data

    Paper • 2506.14111 • Published Jun 17 • 46

  • EssentialAI/essential-web-v1.0

    Preview • Updated Oct 2 • 50.5k • 205

  • EssentialAI/eai-distill-0.5b

    0.6B • Updated Jun 18 • 882 • 23

  • EssentialAI/eai-taxonomy-math-w-fm

    Viewer • Updated Jun 22 • 21.6M • 989 • 5

  • EssentialAI/eai-taxonomy-code-w-dclm

    Viewer • Updated Jun 22 • 274M • 2.37k • 8

  • EssentialAI/eai-taxonomy-code-w-dclm-100b-sample

    Viewer • Updated Jun 22 • 46.2M • 750 • 2

  • EssentialAI/eai-taxonomy-med-w-dclm

    Viewer • Updated Jun 22 • 81.2M • 544 • 8

  • EssentialAI/eai-taxonomy-med-w-dclm-100b-sample

    Viewer • Updated Jun 22 • 36.6M • 211 • 2

  • EssentialAI/eai-taxonomy-stem-w-dclm

    Preview • Updated Jun 22 • 513 • 5

  • EssentialAI/eai-taxonomy-stem-w-dclm-100b-sample

    Viewer • Updated Jun 22 • 35.5M • 429 • 4
Upvote
9
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs