Resources for hybrid preferences research where we learn how to route preference instances for human vs. AI feedback
Lj V. Miranda PRO
ljvmiranda921
AI & ML interests
NLP - multilinguality, data-centric AI
Recent Activity
updated
a dataset
5 days ago
ai2-adapt-dev/toolu-dpo-mix-D1
updated
a dataset
5 days ago
ai2-adapt-dev/toolu-sft-mix-T3-rc2
published
a dataset
5 days ago
ai2-adapt-dev/toolu-sft-mix-T3-rc2