Enpei Zhao's picture

1 3

Enpei Zhao

enpeizhao

·

AI & ML interests

None yet

Recent Activity

updated a Space 2 days ago

enpeizhao/VLM_ODD_Online_Demo

published a Space 3 days ago

enpeizhao/VLM_ODD_Online_Demo

replied to sergiopaniego's post 3 days ago

Yet Another New Multimodal Fine-Tuning Recipe 🥧 🧑‍🍳 In this @HuggingFace Face Cookbook notebook, we demonstrate how to align a multimodal model (VLM) using Mixed Preference Optimization (MPO) using trl. 💡 This recipe is powered by the new MPO support in trl, enabled through a recent upgrade to the DPO trainer! We align the multimodal model using multiple optimization objectives (losses), guided by a preference dataset (chosen vs. rejected multimodal pairs). Check it out! ➡️ https://huggingface.co/learn/cookbook/fine_tuning_vlm_mpo

View all activity

Organizations

None yet

updated a Space 2 days ago

VLM ODD Online Demo

Ask questions about video content

published a Space 3 days ago

VLM ODD Online Demo

Ask questions about video content

replied to sergiopaniego's post 3 days ago

Great post! Any future plans to release some distillation related recipes?

reacted to sergiopaniego's post with 🚀 3 days ago

Post

1080

Yet Another New Multimodal Fine-Tuning Recipe 🥧

🧑‍🍳 In this @HuggingFace Face Cookbook notebook, we demonstrate how to align a multimodal model (VLM) using Mixed Preference Optimization (MPO) using trl.

💡 This recipe is powered by the new MPO support in trl, enabled through a recent upgrade to the DPO trainer!

We align the multimodal model using multiple optimization objectives (losses), guided by a preference dataset (chosen vs. rejected multimodal pairs).

Check it out! ➡️ https://huggingface.co/learn/cookbook/fine_tuning_vlm_mpo

1 reply

·

liked a model 4 days ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 5.7M • • 1.08k

updated a model 10 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-all-in-one-13_adapter

Updated 10 days ago

published a model 10 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-all-in-one-13_adapter

Updated 10 days ago

updated a model 10 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-all-in-one-13

Updated 10 days ago

published a model 11 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-all-in-one-13

Updated 10 days ago

updated a model 11 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-vlm-odd-12-nf4-merged

Image-to-Text • 2B • Updated 11 days ago • 17

published a model 11 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-vlm-odd-12-nf4-merged

Image-to-Text • 2B • Updated 11 days ago • 17

updated a model 11 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-all-in-one-12-merged

Image-to-Text • 4B • Updated 11 days ago • 2

published a model 11 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-all-in-one-12-merged

Image-to-Text • 4B • Updated 11 days ago • 2

updated a model 12 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-all-in-one-12_adapter

Updated 12 days ago

published a model 12 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-all-in-one-12_adapter

Updated 12 days ago

updated a model 12 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-all-in-one-12

Updated 12 days ago

published a model 12 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-all-in-one-12

Updated 12 days ago

updated a model 12 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-all-in-one-11_adapter

Updated 12 days ago

published a model 12 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-all-in-one-11_adapter

Updated 12 days ago

updated a model 12 days ago

enpeizhao/qwen2_5-3b-instruct-trl-sft-all-in-one-11

Updated 12 days ago