ZHOU Yang's picture

1 3 1

ZHOU Yang

youngzhou12

·

https://yangzhou.netlify.app/

youngzhou12

AI & ML interests

Medical Foundation Models, Vision-Language Models, NLP

Organizations

None yet

authored 3 papers 9 months ago

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published Oct 16, 2024 • 32

From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning

Paper • 2410.06456 • Published Oct 9, 2024 • 38

BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays

Paper • 2410.21969 • Published Oct 29, 2024 • 10