3 7 37

longfei li

long0x0

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

liked a model 5 days ago

inclusionAI/Ming-flash-omni-Preview

authored a paper 9 days ago

Leveraging Large Language Models for Pre-trained Recommender Systems

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

Paper • 2510.24821 • Published 4 days ago • 27

liked a model 5 days ago

inclusionAI/Ming-flash-omni-Preview

Any-to-Any • 104B • Updated 2 days ago • 5.34k • 49

authored 7 papers 9 days ago

Leveraging Large Language Models for Pre-trained Recommender Systems

Paper • 2308.10837 • Published Aug 21, 2023 • 1

Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies

Paper • 2402.03628 • Published Feb 6, 2024

Intelligent Virtual Assistants with LLM-based Process Automation

Paper • 2312.06677 • Published Dec 4, 2023 • 1

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs

Paper • 2503.05139 • Published Mar 7 • 4

A Causal Explainable Guardrails for Large Language Models

Paper • 2405.04160 • Published May 7, 2024 • 1

Data-Centric Financial Large Language Models

Paper • 2310.17784 • Published Oct 7, 2023 • 14

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published 10 days ago • 101

liked a model 9 days ago

mlx-community/Ring-flash-linear-2.0-128k-4bit

Text Generation • 104B • Updated 9 days ago • 119 • 1

upvoted a paper 9 days ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published 10 days ago • 101

commented a paper 9 days ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published 10 days ago • 101 •

liked 2 models 10 days ago

inclusionAI/Ring-1T

Text Generation • 1000B • Updated 4 days ago • 1.81k • 215

inclusionAI/Ring-flash-linear-2.0-128k

Text Generation • 104B • Updated 9 days ago • 167 • 19

upvoted an article 12 days ago

Article

Art of Focus: Page-Aware Sparse Attention and Ling 2.0’s Quest for Efficient Context Length Scaling

and 19 others •

12 days ago

• 14

published an article 12 days ago

Article

Art of Focus: Page-Aware Sparse Attention and Ling 2.0’s Quest for Efficient Context Length Scaling

and 19 others •

12 days ago

• 14

liked a model 12 days ago

inclusionAI/Ring-mini-sparse-2.0-exp

Text Generation • 16B • Updated 10 days ago • 105 • 20

upvoted an article 17 days ago

Article

Ring-flash-linear-2.0: A Highly Efficient Hybrid Architecture for Test-Time Scaling

and 8 others •

23 days ago

• 10

New activity in inclusionAI/Ring-flash-linear-2.0 20 days ago

did anyone get this working in vllm or sglang?

#2 opened 20 days ago by

MikaSouthworth

published an article 23 days ago

Article

Ring-flash-linear-2.0: A Highly Efficient Hybrid Architecture for Test-Time Scaling

and 8 others •

23 days ago

• 10

longfei li

AI & ML interests

Recent Activity

Organizations

long0x0's activity

Art of Focus: Page-Aware Sparse Attention and Ling 2.0’s Quest for Efficient Context Length Scaling

Art of Focus: Page-Aware Sparse Attention and Ling 2.0’s Quest for Efficient Context Length Scaling

Ring-flash-linear-2.0: A Highly Efficient Hybrid Architecture for Test-Time Scaling

did anyone get this working in vllm or sglang?

Ring-flash-linear-2.0: A Highly Efficient Hybrid Architecture for Test-Time Scaling