Momentum Decoding: Open-ended Text Generation As Graph Exploration Paper • 2212.02175 • Published Dec 5, 2022
Multi-task Learning for Low-resource Second Language Acquisition Modeling Paper • 1908.09283 • Published Aug 25, 2019
Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation Benchmark Paper • 2411.15488 • Published Nov 23, 2024
Multi-modal Retrieval Augmented Multi-modal Generation: Datasets, Evaluation Metrics and Strong Baselines Paper • 2411.16365 • Published Nov 25, 2024 • 1
Training Language Models to Critique With Multi-agent Feedback Paper • 2410.15287 • Published Oct 20, 2024
HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application Paper • 2510.19631 • Published 5 days ago • 26
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking Paper • 2510.20168 • Published 5 days ago • 25
MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization Paper • 2503.12689 • Published Mar 16 • 5
On the Transformations across Reward Model, Parameter Update, and In-Context Prompt Paper • 2406.16377 • Published Jun 24, 2024 • 13
On the Transformations across Reward Model, Parameter Update, and In-Context Prompt Paper • 2406.16377 • Published Jun 24, 2024 • 13
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation Paper • 2406.09961 • Published Jun 14, 2024 • 55
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper • 2404.18796 • Published Apr 29, 2024 • 71
TextBind: Multi-turn Interleaved Multimodal Instruction-following Paper • 2309.08637 • Published Sep 14, 2023 • 8
TextBind: Multi-turn Interleaved Multimodal Instruction-following Paper • 2309.08637 • Published Sep 14, 2023 • 8