Pengxiang Li's picture

Pengxiang Li

pengxiang

·

pixeli99

AI & ML interests

Video generation, Image editing, AD

Recent Activity

upvoted a paper 11 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

liked a dataset 17 days ago

hotpotqa/hotpot_qa

liked a dataset 18 days ago

YiyangAiLab/MIRA

View all activity

Organizations

upvoted a paper 11 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published 13 days ago • 100

upvoted 4 papers about 1 month ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20 • 66

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21 • 35

InfiMed-ORBIT: Aligning LLMs on Open-Ended Complex Tasks via Rubric-Based Incremental Training

Paper • 2510.15859 • Published Oct 17 • 10

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13 • 162

upvoted 4 papers about 2 months ago

Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling

Paper • 2510.01329 • Published Oct 1 • 5

CoDA: Coding LM via Diffusion Adaptation

Paper • 2510.03270 • Published Sep 27 • 42

Thinking Augmented Pre-training

Paper • 2509.20186 • Published Sep 24 • 23

Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving

Paper • 2509.20109 • Published Sep 24 • 3

upvoted 2 papers 2 months ago

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9 • 99

Test-Time Scaling with Reflective Generative Model

Paper • 2507.01951 • Published Jul 2 • 106

upvoted 7 papers 3 months ago

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Paper • 2509.07969 • Published Sep 9 • 59

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2 • 123

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Paper • 2508.20072 • Published Aug 27 • 31

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 192

Diffusion Language Models Know the Answer Before Decoding

Paper • 2508.19982 • Published Aug 27 • 23

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14 • 142

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published Aug 12 • 31

upvoted a collection 3 months ago

Dream-Coder 7B

https://hkunlp.github.io/blog/2025/dream-coder • 2 items • Updated Jul 15 • 6

upvoted a paper 4 months ago

InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization

Paper • 2508.05731 • Published Aug 7 • 25