CMU-LTI

university

LTIatCMU

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

skhanuja published a Space 6 days ago

cmu-lti/MachineTranslationforVision

skhanuja updated a dataset 8 days ago

cmu-lti/machine-translation-for-vision

skhanuja updated a Space 18 days ago

cmu-lti/MachineTranslationforVision

View all activity

Papers

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

POWSM: A Phonetic Open Whisper-Style Speech Foundation Model

View all Papers

skhanuja

published a Space 6 days ago

MachineTranslationforVision

skhanuja

updated a dataset 8 days ago

cmu-lti/machine-translation-for-vision

Viewer • Updated 8 days ago • 696 • 108 • 1

skhanuja

updated a Space 18 days ago

MachineTranslationforVision

ProKil

submitted a paper to Daily Papers 23 days ago

CooperBench: Why Coding Agents Cannot be Your Teammates Yet

Paper • 2601.13295 • Published Jan 19 • 3

kalbin

authored 2 papers about 1 month ago

PRiSM: Benchmarking Phone Realization in Speech Models

Paper • 2601.14046 • Published Jan 20 • 6

Towards Comprehensive Semantic Speech Embeddings for Chinese Dialects

Paper • 2601.07274 • Published Jan 12 • 1

ankits0052

authored a paper about 1 month ago

Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations

Paper • 2305.12715 • Published May 22, 2023

seungone

authored 5 papers about 2 months ago

Measuring Sycophancy of Language Models in Multi-turn Dialogues

Paper • 2505.23840 • Published May 28, 2025 • 2

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1, 2025 • 79

OptimalThinkingBench: Evaluating Over and Underthinking in LLMs

Paper • 2508.13141 • Published Aug 18, 2025

VideoJudge: Bootstrapping Enables Scalable Supervision of MLLM-as-a-Judge for Video Understanding

Paper • 2509.21451 • Published Sep 25, 2025

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published Oct 28, 2025 • 18

yihaopeng

authored a paper about 2 months ago

DesignPref: Capturing Personal Preferences in Visual Design Generation

Paper • 2511.20513 • Published Nov 25, 2025

kalbin

authored 3 papers 2 months ago

PWESuite: Phonetic Word Embeddings and Tasks They Facilitate

Paper • 2304.02541 • Published Apr 5, 2023 • 2

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Paper • 2411.05361 • Published Nov 8, 2024 • 5

POWSM: A Phonetic Open Whisper-Style Speech Foundation Model

Paper • 2510.24992 • Published Oct 28, 2025 • 4

seungone

authored a paper 3 months ago

RefineBench: Evaluating Refinement Capability of Language Models via Checklists

Paper • 2511.22173 • Published Nov 27, 2025 • 15

Xuhui

published a dataset 3 months ago

cmu-lti/stateful

Viewer • Updated Nov 26, 2025 • 500 • 72

Xuhui

updated a dataset 3 months ago

cmu-lti/stateful

Viewer • Updated Nov 26, 2025 • 500 • 72

yihaopeng

authored a paper 6 months ago

TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in Action

Paper • 2505.01583 • Published May 2, 2025 • 8