Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning Paper • 2505.20161 • Published May 26 • 1
Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact Verifiers Paper • 2506.13342 • Published Jun 16
The Surprising Effectiveness of Membership Inference with Simple N-Gram Coverage Paper • 2508.09603 • Published Aug 13 • 2
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model Paper • 2508.14444 • Published Aug 20 • 36
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge Paper • 2510.18941 • Published 13 days ago • 7