Yutao Zeng's picture

4 18

Yutao Zeng

Taoer

·

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Virtual Width Networks

upvoted a paper about 2 months ago

Virtual Width Networks

authored a paper 4 months ago

UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning

View all activity

Organizations

commented a paper 7 months ago

Stepsize anything: A unified learning rate schedule for budgeted-iteration training

Paper • 2505.24452 • Published May 30, 2025 • 5 •

commented 4 papers 10 months ago

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6, 2025 • 21 •

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6, 2025 • 21 •

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6, 2025 • 21 •

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Paper • 2502.15499 • Published Feb 21, 2025 • 15 •

commented a paper about 1 year ago

Self-Consistency Preference Optimization

Paper • 2411.04109 • Published Nov 6, 2024 • 19 •

commented a paper over 1 year ago

Hyper-Connections

Paper • 2409.19606 • Published Sep 29, 2024 • 25 •