7 54 363

Junyeong Song

junyeong-nero

https://junyeong-nero.github.io/portfolio/

AI & ML interests

Synthetic Data / OCR / Image-Generation

Recent Activity

upvoted a paper 1 day ago

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

upvoted a paper 4 days ago

Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs

liked a model 4 days ago

baidu/Qianfan-OCR

View all activity

Organizations

None yet

upvoted a paper 1 day ago

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

Paper • 2603.24533 • Published 3 days ago • 38

upvoted a paper 4 days ago

Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs

Paper • 2603.16932 • Published 14 days ago • 84

upvoted a paper 12 days ago

Can Vision-Language Models Solve the Shell Game?

Paper • 2603.08436 • Published 19 days ago • 39

upvoted a paper 18 days ago

Lost in Stories: Consistency Bugs in Long Story Generation by LLMs

Paper • 2603.05890 • Published 23 days ago • 91

upvoted a paper 24 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published 25 days ago • 100

upvoted a paper 26 days ago

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published about 1 month ago • 151

upvoted an article 30 days ago

Article

Mixture of Experts (MoEs) in Transformers

about 1 month ago

•

148

upvoted 2 papers about 1 month ago

VLM^2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues

Paper • 2502.12084 • Published Feb 17, 2025 • 35

On Data Engineering for Scaling LLM Terminal Capabilities

Paper • 2602.21193 • Published Feb 24 • 101

upvoted a collection about 1 month ago

Qwen3.5

Collection

21 items • Updated 19 days ago • 1.33k

upvoted 2 papers about 1 month ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220

DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers

Paper • 2602.16968 • Published Feb 19 • 12

upvoted a paper about 2 months ago

Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss

Paper • 2401.02677 • Published Jan 5, 2024 • 25

upvoted an article 3 months ago

Article

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

Jan 5

•

upvoted a paper 3 months ago

K-EXAONE Technical Report

Paper • 2601.01739 • Published Jan 5 • 93

upvoted a collection 3 months ago

Tiny-A2D

Collection

Small diffusion language models adapted from AR models • 4 items • Updated Dec 6, 2025 • 18

upvoted a paper 3 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1, 2025 • 79

upvoted a collection 3 months ago

Kanana-2

Collection

Open Source Kanana-2 • 29 items • Updated 26 days ago • 38

upvoted 2 papers 5 months ago

V-Thinker: Interactive Thinking with Images

Paper • 2511.04460 • Published Nov 6, 2025 • 98

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 242

Junyeong Song

AI & ML interests

Recent Activity

Organizations

junyeong-nero's activity

Mixture of Experts (MoEs) in Transformers

M2.1: Multilingual and Multi-Task Coding with Strong Generalization