-
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
Paper • 2507.02768 • Published • 18 -
STITCH: Simultaneous Thinking and Talking with Chunked Reasoning for Spoken Language Models
Paper • 2507.15375 • Published • 29 -
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Paper • 2510.06917 • Published • 33 -
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Paper • 2411.05361 • Published • 2
Ke-Han Lu
kehanlu
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
1 day ago
Awesome papers from 臺大李宏毅 (Hung-yi Lee)
updated
a collection
1 day ago
Awesome papers from 臺大李宏毅 (Hung-yi Lee)
updated
a collection
1 day ago
Awesome papers from 臺大李宏毅 (Hung-yi Lee)