PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper • 2511.09057 • Published Nov 12 • 76
Running 131 TxT360: Trillion Extracted Text 📖 131 Explore and analyze the TxT360 dataset for LLM pre-training
Essential-Web v1.0: 24T tokens of organized web data Paper • 2506.14111 • Published Jun 17 • 46
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published Jun 17 • 49
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective Paper • 2506.14965 • Published Jun 17 • 49
Asymmetry in Low-Rank Adapters of Foundation Models Paper • 2402.16842 • Published Feb 26, 2024 • 2
tinyBenchmarks: evaluating LLMs with fewer examples Paper • 2402.14992 • Published Feb 22, 2024 • 17
Large Language Model Routing with Benchmark Datasets Paper • 2309.15789 • Published Sep 27, 2023 • 1