RLHFlow

university
Activity Feed

AI & ML interests

Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/

Recent Activity

baohao  updated a collection 1 day ago
Reinforce-Ada
baohao  updated a dataset 1 day ago
RLHFlow/reinforce_ada_hard_prompt
baohao  published a dataset 1 day ago
RLHFlow/reinforce_ada_hard_prompt
View all activity

RLHFlow 's datasets 84