InfoSynth: Information-Guided Benchmark Synthesis for LLMs Paper • 2601.00575 • Published 6 days ago • 1
MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning Paper • 2512.16909 • Published 20 days ago • 1
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection Paper • 2503.12271 • Published Mar 15, 2025 • 9
EmbedLLM: Learning Compact Representations of Large Language Models Paper • 2410.02223 • Published Oct 3, 2024 • 3
PokerBench: Training Large Language Models to become Professional Poker Players Paper • 2501.08328 • Published Jan 14, 2025 • 19
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows Paper • 2412.01169 • Published Dec 2, 2024 • 13
On Representation Complexity of Model-based and Model-free Reinforcement Learning Paper • 2310.01706 • Published Oct 3, 2023
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published Apr 18, 2024 • 13
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models Paper • 2401.13974 • Published Jan 25, 2024 • 14
CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules Paper • 2310.08992 • Published Oct 13, 2023 • 12
Lost in the Middle: How Language Models Use Long Contexts Paper • 2307.03172 • Published Jul 6, 2023 • 43