10 29 57

Tong Zhu

Spico

https://Spico197.github.io

AI & ML interests

Information Extraction, Mixture-of-Experts, LLM

Recent Activity

upvoted a paper 19 days ago

Spotlight on Token Perception for Multimodal Reinforcement Learning

upvoted a paper 24 days ago

Native Hybrid Attention for Efficient Sequence Modeling

new activity 24 days ago

jinaai/jina-code-embeddings-0.5b:About the synthetic data

View all activity

Organizations

upvoted a paper 19 days ago

Spotlight on Token Perception for Multimodal Reinforcement Learning

Paper • 2510.09285 • Published 23 days ago • 35

upvoted a paper 24 days ago

Native Hybrid Attention for Efficient Sequence Modeling

Paper • 2510.07019 • Published 25 days ago • 16

upvoted a collection about 1 month ago

DeepSeek-V3.2

Collection

2 items • Updated Sep 29 • 441

upvoted a paper about 1 month ago

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Paper • 2509.14760 • Published Sep 18 • 52

upvoted a paper about 2 months ago

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

Paper • 2509.13312 • Published Sep 16 • 105

upvoted a paper 2 months ago

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Paper • 2508.09834 • Published Aug 13 • 53

upvoted a paper 3 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14 • 142

upvoted 5 papers 5 months ago

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

Paper • 2506.10954 • Published Jun 12 • 52

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Paper • 2506.04207 • Published Jun 4 • 48

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Paper • 2505.19641 • Published May 26 • 67

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 130

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published May 20 • 62

upvoted a paper 6 months ago

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Paper • 2505.08617 • Published May 13 • 41

upvoted a paper 7 months ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published Mar 27 • 42

upvoted 2 papers 8 months ago

Iterative Value Function Optimization for Guided Decoding

Paper • 2503.02368 • Published Mar 4 • 15

MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published Feb 19 • 36

upvoted an article 9 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 706

upvoted 3 papers 9 months ago

LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid

Paper • 2502.07563 • Published Feb 11 • 24

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published Feb 6 • 24

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22 • 61

Tong Zhu

AI & ML interests

Recent Activity

Organizations

Spico's activity

Finally, a Replacement for BERT: Introducing ModernBERT