Xuyang Shen's picture

Xuyang Shen

Ryan1122

·

XuyangSHEN

AI & ML interests

AIGC

Recent Activity

liked a model about 2 months ago

bosonai/higgs-audio-v2-generation-3B-base

liked a model 3 months ago

jinaai/jina-embeddings-v4

liked a Space 3 months ago

MiniMaxAI/MiniMax-M1

View all activity

Organizations

authored 5 papers 4 months ago

Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention

Paper • 2405.17381 • Published May 27, 2024

Audio-Visual Segmentation with Semantics

Paper • 2301.13190 • Published Jan 30, 2023

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 298

You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet

Paper • 2405.21022 • Published May 31, 2024

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Paper • 2505.18129 • Published May 23 • 60

authored a paper 5 months ago

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Paper • 2504.02587 • Published Apr 3 • 33

authored 5 papers about 1 year ago

Scaling TransNormer to 175 Billion Parameters

Paper • 2307.14995 • Published Jul 27, 2023 • 22

Fine-grained Audible Video Description

Paper • 2303.15616 • Published Mar 27, 2023 • 1

CO2: Efficient Distributed Training with Full Communication-Computation Overlap

Paper • 2401.16265 • Published Jan 29, 2024 • 1

Linear Attention Sequence Parallelism

Paper • 2404.02882 • Published Apr 3, 2024 • 3

Scaling Laws for Linear Complexity Language Models

Paper • 2406.16690 • Published Jun 24, 2024 • 23

authored 2 papers over 1 year ago

HGRN2: Gated Linear RNNs with State Expansion

Paper • 2404.07904 • Published Apr 11, 2024 • 21

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Paper • 2401.04658 • Published Jan 9, 2024 • 28