Enpei Zhao
enpeizhao
AI & ML interests
None yet
Recent Activity
updated
a Space
about 16 hours ago
enpeizhao/VLM_ODD_Online_Demo
published
a Space
5 days ago
enpeizhao/VLM_ODD_Online_Demo
replied to
sergiopaniego's
post
5 days ago
Yet Another New Multimodal Fine-Tuning Recipe ๐ฅง
๐งโ๐ณ In this @HuggingFace Face Cookbook notebook, we demonstrate how to align a multimodal model (VLM) using Mixed Preference Optimization (MPO) using trl.
๐ก This recipe is powered by the new MPO support in trl, enabled through a recent upgrade to the DPO trainer!
We align the multimodal model using multiple optimization objectives (losses), guided by a preference dataset (chosen vs. rejected multimodal pairs).
Check it out! โก๏ธ https://huggingface.co/learn/cookbook/fine_tuning_vlm_mpo
Organizations
None yet