2 7 1

YingzhePeng

ColeYzzzz

https://github.com/ForJadeForest

ForJadeForest

AI & ML interests

NLP, Multimodal

Recent Activity

upvoted a paper 4 months ago

Agent Learning via Early Experience

liked a model 5 months ago

YannQi/R-4B

upvoted a paper 5 months ago

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

View all activity

Organizations

upvoted a paper 4 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 273

liked a model 5 months ago

YannQi/R-4B

Image-Text-to-Text • 5B • Updated Sep 4, 2025 • 41.8k • 180

upvoted a paper 5 months ago

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published Aug 28, 2025 • 110

upvoted a paper 6 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 181

upvoted a collection 7 months ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 182

updated a model 7 months ago

ColeYzzzz/DIQA_Qwen-5fold

Updated Jul 7, 2025

published 3 models 7 months ago

updated a model 7 months ago

ColeYzzzz/DIQA-color_fidelity-fold3

Image-to-Text • 8B • Updated Jul 7, 2025

published a model 7 months ago

ColeYzzzz/DIQA-color_fidelity-fold3

Image-to-Text • 8B • Updated Jul 7, 2025

updated a model 7 months ago

ColeYzzzz/DIQA-color_fidelity-fold2

Image-to-Text • 8B • Updated Jul 7, 2025 • 1

published a model 7 months ago

ColeYzzzz/DIQA-color_fidelity-fold2

Image-to-Text • 8B • Updated Jul 7, 2025 • 1

updated a model 7 months ago

ColeYzzzz/DIQA-color_fidelity-fold1

Image-to-Text • 8B • Updated Jul 7, 2025 • 2

published a model 7 months ago

ColeYzzzz/DIQA-color_fidelity-fold1

Image-to-Text • 8B • Updated Jul 7, 2025 • 2

updated a model 7 months ago

ColeYzzzz/DIQA_Qwen_CV

Updated Jul 7, 2025

published a model 7 months ago

ColeYzzzz/DIQA_Qwen_CV

Updated Jul 7, 2025

updated a dataset 8 months ago

VLM-Reasoner/details_._ckpt_Qwen2.5-VL-3B-Instruct-kl-rb

Viewer • Updated Jun 1, 2025 • 1.52k • 17

published a dataset 8 months ago

VLM-Reasoner/details_._ckpt_Qwen2.5-VL-3B-Instruct-kl-rb

Viewer • Updated Jun 1, 2025 • 1.52k • 17

updated a dataset 8 months ago

VLM-Perception/COCO-Text

Viewer • Updated May 21, 2025 • 1.6k • 5

YingzhePeng

AI & ML interests

Recent Activity

Organizations

ColeYzzzz's activity