Submitted by taesiri 65 OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling · 19 authors 114 1
Submitted by xhyandwyy 30 UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning · 11 authors 5.64k 1
Submitted by taesiri 21 InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts · 12 authors 111 1
Submitted by taesiri 8 LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence · 7 authors 1
Submitted by ottogin 6 Locality in Image Diffusion Models Emerges from Data Statistics · 4 authors 11 1
Submitted by kaiyangzhou 5 Measuring Epistemic Humility in Multimodal Large Language Models · 4 authors 6 2
Submitted by gauravfs-14 3 CognitiveSky: Scalable Sentiment and Narrative Analysis for Decentralized Social Media · 3 authors 3 2
Submitted by Macro 2 Look Again, Think Slowly: Enhancing Visual Reflection in Vision-Language Models · 6 authors 1
Submitted by UVSKKR 1 EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI · 1 authors 1
Submitted by ylu610 1 Learning to Optimize Multi-Objective Alignment Through Dynamic Reward Weighting · 9 authors 0 1
Submitted by taesiri 1 PersonaX: Multimodal Datasets with LLM-Inferred Behavior Traits · 10 authors 1
Submitted by yixuantt - GAPrune: Gradient-Alignment Pruning for Domain-Aware Embeddings · 2 authors 1 1