Cuiunbo PRO
Cuiunbo
AI & ML interests
Anything
Recent Activity
liked
a dataset
27 minutes ago
tsinghua-ee/RivaBench
liked
a dataset
about 1 month ago
hyf015/EgoThinker-SFT-Dataset
authored
a paper
about 2 months ago
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and
Training Recipe
Organizations
VLM For OCR
audio
-
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Paper ⢠2311.07919 ⢠Published ⢠10 -
Stable Audio Open
Paper ⢠2407.14358 ⢠Published ⢠26 -
OpenMOSS-Team/AnyGPT-chat
Text Generation ⢠Updated ⢠23 ⢠18 -
FBK-MT/mosel
Viewer ⢠Updated ⢠2.2M ⢠3.52k ⢠85
MiniCPM-V
-
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text ⢠9B ⢠Updated ⢠56.4k ⢠1.4k -
openbmb/MiniCPM-Llama3-V-2_5-int4
Visual Question Answering ⢠5B ⢠Updated ⢠549 ⢠76 -
openbmb/MiniCPM-Llama3-V-2_5-gguf
Updated ⢠3.44k ⢠215 -
openbmb/MiniCPM-V-2
Visual Question Answering ⢠3B ⢠Updated ⢠75.1k ⢠482
Dataset For OCR
VLM dataset
MiniCPM-V
-
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text ⢠9B ⢠Updated ⢠56.4k ⢠1.4k -
openbmb/MiniCPM-Llama3-V-2_5-int4
Visual Question Answering ⢠5B ⢠Updated ⢠549 ⢠76 -
openbmb/MiniCPM-Llama3-V-2_5-gguf
Updated ⢠3.44k ⢠215 -
openbmb/MiniCPM-V-2
Visual Question Answering ⢠3B ⢠Updated ⢠75.1k ⢠482
VLM For OCR
Dataset For OCR
audio
-
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Paper ⢠2311.07919 ⢠Published ⢠10 -
Stable Audio Open
Paper ⢠2407.14358 ⢠Published ⢠26 -
OpenMOSS-Team/AnyGPT-chat
Text Generation ⢠Updated ⢠23 ⢠18 -
FBK-MT/mosel
Viewer ⢠Updated ⢠2.2M ⢠3.52k ⢠85