wang's picture

3 8

wang

guoteng

·

https://github.com/SolenoidWGT

SolenoidWGT

AI & ML interests

ML system

Recent Activity

liked a model 3 days ago

nex-agi/Qwen3-30B-A3B-Nex-N1

liked a model 3 days ago

nex-agi/Qwen3-32B-Nex-N1

liked a model 3 days ago

nex-agi/internlm3-8B-Nex-N1

View all activity

Organizations

authored 5 papers 11 days ago

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26, 2024 • 34

LoongTrain: Efficient Training of Long-Sequence LLMs with Head-Context Parallelism

Paper • 2406.18485 • Published Jun 26, 2024 • 2

Expert-as-a-Service: Towards Efficient, Scalable, and Robust Large-scale MoE Serving

Paper • 2509.17863 • Published Sep 22

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21 • 82

ReSpec: Towards Optimizing Speculative Decoding in Reinforcement Learning Systems

Paper • 2510.26475 • Published 29 days ago

authored a paper almost 2 years ago

InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant Sharding

Paper • 2401.09149 • Published Jan 17, 2024 • 1

authored a paper about 2 years ago

AMSP: Super-Scaling LLM Training via Advanced Model States Partitioning

Paper • 2311.00257 • Published Nov 1, 2023 • 10