2 16

Longze Chen

lzchen2001

October2001

AI & ML interests

NLP & LLM

Recent Activity

upvoted an article about 2 months ago

You could have designed state of the art positional encoding

upvoted a paper about 2 months ago

SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner

authored a paper about 2 months ago

OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis

View all activity

Organizations

upvoted an article about 2 months ago

Article

You could have designed state of the art positional encoding

•

Nov 25, 2024

• 327

upvoted a paper about 2 months ago

SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner

Paper • 2506.09003 • Published Jun 10 • 19

authored 2 papers about 2 months ago

OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis

Paper • 2501.04561 • Published Jan 8 • 16

CLaSp: In-Context Layer Skip for Self-Speculative Decoding

Paper • 2505.24196 • Published May 30 • 13

upvoted a paper about 2 months ago

CLaSp: In-Context Layer Skip for Self-Speculative Decoding

Paper • 2505.24196 • Published May 30 • 13

commented a paper about 2 months ago

CLaSp: In-Context Layer Skip for Self-Speculative Decoding

Paper • 2505.24196 • Published May 30 • 13 •

upvoted a paper 2 months ago

Model Merging in Pre-training of Large Language Models

Paper • 2505.12082 • Published May 17 • 38

upvoted a paper 3 months ago

Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents

Paper • 2505.02156 • Published May 4 • 18

upvoted 3 papers 7 months ago

OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis

Paper • 2501.04561 • Published Jan 8 • 16

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 59

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published Dec 19, 2024 • 38

authored a paper 9 months ago

Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models

Paper • 2409.18943 • Published Sep 27, 2024 • 30

upvoted an article 10 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

and 2 others •

Aug 14, 2024

• 68

upvoted a paper 11 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 150

authored a paper 11 months ago

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Paper • 2409.05840 • Published Sep 9, 2024 • 49

upvoted a paper 11 months ago

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Paper • 2409.05840 • Published Sep 9, 2024 • 49

authored 2 papers about 1 year ago

Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models

Paper • 2405.17915 • Published May 28, 2024 • 2

DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception

Paper • 2405.15232 • Published May 24, 2024 • 3

upvoted a collection about 1 year ago

Long context

Collection

94 items • Updated Sep 29, 2024 • 33

upvoted a paper about 1 year ago

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models

Paper • 2401.06951 • Published Jan 13, 2024 • 27

Longze Chen

AI & ML interests

Recent Activity

Organizations

lzchen2001's activity

You could have designed state of the art positional encoding

A failed experiment: Infini-Attention, and why we should keep trying?