NeuralOS: Towards Simulating Operating Systems via Neural Generative Models Paper • 2507.08800 • Published 17 days ago • 71
From Chat Logs to Collective Insights: Aggregative Question Answering Paper • 2505.23765 • Published May 29 • 5
Learn to Reason Efficiently with Adaptive Length-based Reward Shaping Paper • 2505.15612 • Published May 21 • 34
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper • 2501.18511 • Published Jan 30 • 20
Magpie Reasoning Datasets Collection Reasoning datasets built by Magpie and its friends! • 8 items • Updated Jan 27 • 10
MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures Paper • 2410.13754 • Published Oct 17, 2024 • 76
WildVis: Open Source Visualizer for Million-Scale Chat Logs in the Wild Paper • 2409.03753 • Published Sep 5, 2024 • 19
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing Paper • 2406.08464 • Published Jun 12, 2024 • 70
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step Paper • 2405.14838 • Published May 23, 2024 • 2
Tree Prompting: Efficient Task Adaptation without Fine-Tuning Paper • 2310.14034 • Published Oct 21, 2023 • 3