Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
appvoid 's Collections
cool spaces
main releases
cool datasets

cool datasets

updated 6 days ago

some interesting datasets to use for language modeling

Upvote
-

  • appvoid/raw-corpus

    Viewer • Updated Feb 23 • 1.6M • 19

  • pszemraj/simple_wikipedia

    Viewer • Updated Sep 9, 2023 • 238k • 676 • 7

  • common-pile/youtube

    Viewer • Updated Jun 6 • 1.13M • 167 • 10

  • srinivasbilla/self-instruct-base

    Viewer • Updated Jan 24, 2023 • 82.6k • 51 • 5

  • agentlans/high-quality-english-sentences

    Viewer • Updated Oct 1, 2024 • 1.71M • 1.62k • 21

  • agentlans/note-taking-v2

    Viewer • Updated Sep 22 • 17.6k • 38

  • PleIAs/SYNTH

    Viewer • Updated 10 days ago • 68M • 40.7k • 158
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs