12 22 6

Chenyang Song

Raincleared

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

upvoted a paper 5 months ago

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

upvoted a paper 5 months ago

μ-Parametrization for Mixture of Experts

View all activity

Organizations

upvoted a paper about 2 months ago

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Paper • 2511.12609 • Published Nov 16, 2025 • 103

upvoted 2 papers 5 months ago

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Paper • 2508.09834 • Published Aug 13, 2025 • 53

μ-Parametrization for Mixture of Experts

Paper • 2508.09752 • Published Aug 13, 2025 • 10

authored 4 papers 6 months ago

ConPET: Continual Parameter-Efficient Tuning for Large Language Models

Paper • 2309.14763 • Published Sep 26, 2023 • 1

ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs

Paper • 2402.03804 • Published Feb 6, 2024 • 4

ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models

Paper • 2402.13516 • Published Feb 21, 2024 • 1

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

Paper • 2507.08771 • Published Jul 11, 2025 • 9

upvoted a paper 6 months ago

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

Paper • 2507.08771 • Published Jul 11, 2025 • 9

commented a paper 6 months ago

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

Paper • 2507.08771 • Published Jul 11, 2025 • 9 •

updated 6 models 6 months ago

published 5 models 6 months ago

SparseLLM/BlockFFN-Small

Text Generation • Updated Jul 14, 2025 • 6

SparseLLM/BlockFFN-Medium

Text Generation • Updated Jul 14, 2025 • 5

SparseLLM/BlockFFN-Large

Text Generation • Updated Jul 14, 2025 • 7

SparseLLM/BlockFFN-XLarge

Text Generation • Updated Jul 14, 2025 • 3

SparseLLM/BlockFFN-3B-SFT-EAGLE

Text Generation • Updated Jul 14, 2025 • 8

Chenyang Song

AI & ML interests

Recent Activity

Organizations

Raincleared's activity