Patrick (Tsung-Han) Wu's picture

3 11 2

Patrick (Tsung-Han) Wu

tsunghanwu

·

https://tsunghan-wu.github.io/

AI & ML interests

Vision and Language

Recent Activity

liked a dataset about 1 month ago

Kyunnilee/visual-puzzles

authored a paper about 2 months ago

Search Arena: Analyzing Search-Augmented LLMs

upvoted a paper about 2 months ago

Search Arena: Analyzing Search-Augmented LLMs

View all activity

Organizations

authored 2 papers about 2 months ago

Search Arena: Analyzing Search-Augmented LLMs

Paper • 2506.05334 • Published Jun 5 • 17

Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint

Paper • 2505.23759 • Published May 29 • 6

authored 6 papers 3 months ago

LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery

Paper • 2505.02829 • Published May 5

Self-correcting LLM-controlled Diffusion Models

Paper • 2311.16090 • Published Nov 27, 2023 • 1

See, Say, and Segment: Teaching LMMs to Overcome False Premises

Paper • 2312.08366 • Published Dec 13, 2023

Visual Haystacks: Answering Harder Questions About Sets of Images

Paper • 2407.13766 • Published Jul 18, 2024 • 2

CLAIR-A: Leveraging Large Language Models to Judge Audio Captions

Paper • 2409.12962 • Published Sep 19, 2024 • 2

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Paper • 2504.13169 • Published Apr 17 • 39