new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Nov 14

Submitted by

RazinAleks

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

·
3 authors

4

Submitted by

jiannanx

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

·
34 authors

Submitted by

scofield7419

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

UniVA-Agent

Submitted by

ytz20

Black-Box On-Policy Distillation of Large Language Models

MicrosoftResearch

Microsoft Research

Submitted by

oguzer

Hail to the Thief: Exploring Attacks and Defenses in Decentralised GRPO

Gensyn

Submitted by

AdinaY

Depth Anything 3: Recovering the Visual Space from Any Views

ByteDance-Seed

Submitted by

ekmeyerson

Solving a Million-Step LLM Task with Zero Errors

CognizantAI

Submitted by

suayptalha

Superpositional Gradient Descent: Harnessing Quantum Principles for Model Training

·
3 authors

Submitted by

zjy2001

AlphaResearch: Accelerating New Algorithm Discovery with Language Models

·
6 authors

Submitted by

Sreyan88

Music Flamingo: Scaling Music Understanding in Audio Language Models

nvidia

2

Submitted by

taesiri

Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following

·
25 authors

Submitted by

taesiri

ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents

ScaleAI

Submitted by

taesiri

Benchmarking Diversity in Image Generation via Attribute-Conditional Human Evaluation

google

Submitted by

xrli-U

MuSc-V2: Zero-Shot Multimodal Industrial Anomaly Classification and Segmentation with Mutual Scoring of Unlabeled Samples

Huster

Huazhong University of Science and Technology

Submitted by

taesiri

AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models

·
6 authors

Submitted by

taesiri

SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control

·
6 authors

Submitted by

danielhzlin

MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique

HKBU-NLP

Submitted by

rochanaro

CC30k: A Citation Contexts Dataset for Reproducibility-Oriented Sentiment Analysis

·
3 authors