junyuan's picture

5 26 7

junyuan

Carkham

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

updated a Space about 2 months ago

opendatalab/TRivia-3B

liked a Space about 2 months ago

opendatalab/TRivia-3B

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

Paper • 2601.19325 • Published 7 days ago • 76

upvoted a paper 2 months ago

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition

Paper • 2512.01248 • Published Dec 1, 2025 • 11

upvoted 6 papers 4 months ago

AI for Service: Proactive Assistance with AI Glasses

Paper • 2510.14359 • Published Oct 16, 2025 • 75

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 120

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1, 2025 • 59

Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition

Paper • 2510.01068 • Published Oct 1, 2025 • 20

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1, 2025 • 40

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 143

upvoted a paper 7 months ago

The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs

Paper • 2507.11097 • Published Jul 15, 2025 • 64

upvoted a paper 8 months ago

Discrete Diffusion in Large Language and Multimodal Models: A Survey

Paper • 2506.13759 • Published Jun 16, 2025 • 43

upvoted a collection 8 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.63k

upvoted a paper 8 months ago

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Paper • 2505.19147 • Published May 25, 2025 • 144

upvoted a paper 10 months ago

What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published Mar 31, 2025 • 54

upvoted a collection 10 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated Dec 31, 2025 • 556

upvoted 2 papers 11 months ago

LEGION: Learning to Ground and Explain for Synthetic Image Detection

Paper • 2503.15264 • Published Mar 19, 2025 • 21

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10, 2025 • 61

upvoted a paper 12 months ago

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published Jan 23, 2025 • 43

upvoted a paper about 1 year ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 132

upvoted 2 collections about 1 year ago

InternVL2.5-MPO

Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated Sep 28, 2025 • 24

LLMs

468 items • Updated about 24 hours ago • 43