4 17 5

Yichao Fu PRO

Viol2000

AI & ML interests

None yet

Recent Activity

updated a dataset about 1 month ago

multi-token/Trace-Oct-29

published a dataset about 1 month ago

multi-token/Trace-Oct-29

updated a dataset about 1 month ago

multi-token/trace-Oct-28

View all activity

Organizations

upvoted a paper about 2 months ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published Oct 20 • 121

upvoted a paper 3 months ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 116

upvoted 3 papers 4 months ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 129

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 88

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 180

upvoted 2 papers 6 months ago

Scaling Speculative Decoding with Lookahead Reasoning

Paper • 2506.19830 • Published Jun 24 • 12

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 187

upvoted a paper 7 months ago

Faster Video Diffusion with Trainable Sparse Attention

Paper • 2505.13389 • Published May 19 • 37

upvoted 2 papers 8 months ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15 • 63

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16 • 75

upvoted a collection 9 months ago

Qwen2.5-1M

Collection

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Jul 21 • 125

upvoted a paper 10 months ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 51

upvoted a collection 11 months ago

Skywork-o1-Open

Collection

Skywork o1 open model collections • 3 items • Updated Jun 12 • 22

upvoted a paper 11 months ago

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 37

upvoted a paper 12 months ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 52

upvoted a paper over 1 year ago

Efficient LLM Scheduling by Learning to Rank

Paper • 2408.15792 • Published Aug 28, 2024 • 20

upvoted a collection over 1 year ago

Transformers compatible Mamba

Collection

This release includes the `mamba` repositories compatible with the `transformers` library • 5 items • Updated Mar 6, 2024 • 39

Yichao Fu PRO

AI & ML interests

Recent Activity

Organizations

Viol2000's activity