9 28 47

Haoqin Tu

PahaII

https://www.haqtu.me/

ImKeTT

AI & ML interests

generation, latent variable models

Recent Activity

liked a dataset 1 day ago

YiyangAiLab/MIRA

upvoted a paper 2 days ago

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

upvoted a paper 8 days ago

LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation

View all activity

Organizations

liked a dataset 1 day ago

YiyangAiLab/MIRA

Viewer • Updated 1 day ago • 1.46k • 43 • 3

upvoted a paper 2 days ago

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published 3 days ago • 49

upvoted a paper 8 days ago

LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation

Paper • 2510.22946 • Published 11 days ago • 16

liked a dataset 14 days ago

tonyqian/EarthWhere

Viewer • Updated 28 days ago • 810 • 450 • 3

upvoted a paper 29 days ago

Artificial Hippocampus Networks for Efficient Long-Context Modeling

Paper • 2510.07318 • Published 30 days ago • 29

updated a model about 2 months ago

PahaII/maplillary_results

Updated Sep 18

liked a dataset 2 months ago

UCSC-VLAA/PARADE_audio

Viewer • Updated Sep 7 • 938 • 38 • 2

New activity in UCSC-VLAA/PARADE_audio 2 months ago

Add Sample Usage section from HELM framework README

#3 opened 2 months ago by

nielsr

liked a Space 2 months ago

200

FineVision: Open Data is All You Need

📝

A new open-source dataset for training VLMs

updated a collection 2 months ago

VLAA-Thinker

Collection

7 items • Updated Sep 3 • 5

New activity in UCSC-VLAA/PARADE_audio 2 months ago

Improve dataset card: Add paper, project page, code links, task category and tags

#2 opened 2 months ago by

nielsr

authored 3 papers 2 months ago

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Paper • 2505.04601 • Published May 7 • 28

Autoregressive Pretraining with Mamba in Vision

Paper • 2406.07537 • Published Jun 11, 2024

AHELM: A Holistic Evaluation of Audio-Language Models

Paper • 2508.21376 • Published Aug 29 • 9

commented a paper 2 months ago

AHELM: A Holistic Evaluation of Audio-Language Models

Paper • 2508.21376 • Published Aug 29 • 9 •

upvoted a paper 2 months ago

AHELM: A Holistic Evaluation of Audio-Language Models

Paper • 2508.21376 • Published Aug 29 • 9

updated a dataset 3 months ago

PahaII/spatialthinker_vqa_10k_filtered

Preview • Updated Aug 12 • 3

published a dataset 3 months ago

PahaII/spatialthinker_vqa_10k_filtered

Preview • Updated Aug 12 • 3

liked a dataset 3 months ago

UCSC-VLAA/GPT-Image-Edit-1.5M

Viewer • Updated Aug 21 • 2.78M • 8.74k • 62

upvoted a paper 3 months ago

GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset

Paper • 2507.21033 • Published Jul 28 • 20

Haoqin Tu

AI & ML interests

Recent Activity

Organizations

PahaII's activity

Add Sample Usage section from HELM framework README

FineVision: Open Data is All You Need

Improve dataset card: Add paper, project page, code links, task category and tags