Walter Hugo Lopez Pinaya's picture

Walter Hugo Lopez Pinaya

Warvito

·

AI & ML interests

None yet

Recent Activity

updated a collection about 11 hours ago

updated a collection 2 days ago

updated a collection 2 days ago

View all activity

Organizations

upvoted 3 papers 2 days ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 220

Video Analysis and Generation via a Semantic Progress Function

Paper • 2604.22554 • Published 7 days ago • 59

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Paper • 2604.24763 • Published 4 days ago • 62

upvoted a paper 3 days ago

Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation

Paper • 2604.19141 • Published 10 days ago • 1

upvoted a paper 8 days ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published 9 days ago • 237

upvoted 2 papers 11 days ago

DiffEM: Learning from Corrupted Data with Diffusion Models via Expectation Maximization

Paper • 2510.12691 • Published Dec 20, 2025 • 1

OmniWeaving: Towards Unified Video Generation with Free-form Composition and Reasoning

Paper • 2603.24458 • Published Mar 25 • 9

upvoted 2 papers 12 days ago

LPM 1.0: Video-based Character Performance Model

Paper • 2604.07823 • Published 22 days ago • 77

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Paper • 2604.02029 • Published 29 days ago • 146

upvoted 2 papers 29 days ago

Towards a Medical AI Scientist

Paper • 2603.28589 • Published Mar 30 • 89

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 145

upvoted 6 papers about 1 month ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published Mar 3 • 103

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 371

Mixture-of-Depths Attention

Paper • 2603.15619 • Published Mar 16 • 80

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published Mar 6 • 119

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 204

upvoted 3 papers about 2 months ago

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published Mar 10 • 48

Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published Mar 4 • 186

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5, 2025 • 82