4 23 121

QinyuanCheng

Cqy2019

AI & ML interests

None yet

Recent Activity

liked a model 9 days ago

AVoCaDO-Captioner/AVoCaDO

upvoted a paper 11 days ago

Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

liked a model 12 days ago

nex-agi/DeepSeek-V3.1-Nex-N1

View all activity

Organizations

liked a model 9 days ago

AVoCaDO-Captioner/AVoCaDO

9B • Updated Oct 16 • 324 • 5

upvoted a paper 11 days ago

Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

Paper • 2511.16669 • Published 12 days ago • 31

liked a model 12 days ago

nex-agi/DeepSeek-V3.1-Nex-N1

671B • Updated 14 days ago • 80 • 25

liked a dataset 14 days ago

nex-agi/html-eval

Updated 14 days ago • 277 • 7

liked a Space 21 days ago

The Smol Training Playbook

📚

2.5k

The secrets to building world-class LLMs

New activity in OpenMOSS-Team/MOSS-TTSD-v0.7 22 days ago

update_readme

#1 opened 22 days ago by

MCplayer

New activity in OpenMOSS-Team/XY_Tokenizer_TTSD_V0_hf 22 days ago

automodel_remote_code_support

#2 opened 22 days ago by

MCplayer

upvoted a paper 25 days ago

UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions

Paper • 2511.03334 • Published 28 days ago • 51

upvoted a paper 26 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published 26 days ago • 207

liked a dataset 30 days ago

videoSALMONN2/video-SALMONN_2_testset

Viewer • Updated May 15 • 483 • 66 • 2

upvoted a paper 30 days ago

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28 • 70

updated a collection about 1 month ago

MOSS-TTSD

Collection

4 items • Updated about 1 month ago • 2

liked a model about 1 month ago

OpenMOSS-Team/MOSS-TTSD-v0.7

Text-to-Speech • 2B • Updated 22 days ago • 4.44k • 15

upvoted a paper about 1 month ago

Sparser Block-Sparse Attention via Token Permutation

Paper • 2510.21270 • Published Oct 24 • 23

liked a Space about 1 month ago

Qwen3 Omni Demo

⚡

186

Interact with a multimodal chatbot using text, audio, images, or video

liked a dataset about 2 months ago

PleIAs/YouTube-Commons

Updated Jun 26, 2024 • 4.08k • 367

liked a model about 2 months ago

ASLP-lab/Easy-Turn

Updated Oct 11 • 25 • 13

upvoted a paper about 2 months ago

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published Oct 15 • 45

liked a Space about 2 months ago

MiMo-Audio-Chat

💬

Chat with Xiaomi MiMo-Audio using voice