AI & ML interests

RL, fine-turning, model alignment, human preference

models 0

None public yet

datasets 0

None public yet