GRAPE: Generalizing Robot Policy via Preference Alignment Paper • 2411.19309 • Published Nov 28, 2024 • 48
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models Paper • 2410.10139 • Published Oct 14, 2024 • 53
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases Paper • 2407.12784 • Published Jul 17, 2024 • 52
Safe Reinforcement Learning via Hierarchical Adaptive Chance-Constraint Safeguards Paper • 2310.03379 • Published Oct 5, 2023
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? Paper • 2407.04842 • Published Jul 5, 2024 • 57
RankCLIP: Ranking-Consistent Language-Image Pretraining Paper • 2404.09387 • Published Apr 15, 2024
HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding Paper • 2403.00425 • Published Mar 1, 2024 • 1
AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition Paper • 2402.11452 • Published Feb 18, 2024 • 1