Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yannick Versley's picture
7

Yannick Versley

yversleyamzn
·

AI & ML interests

None yet

Organizations

None yet

Collections 1

rlhf
  • Statistical Rejection Sampling Improves Preference Optimization

    Paper • 2309.06657 • Published Sep 13, 2023 • 14
  • Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions

    Paper • 2309.10150 • Published Sep 18, 2023 • 25
rlhf
  • Statistical Rejection Sampling Improves Preference Optimization

    Paper • 2309.06657 • Published Sep 13, 2023 • 14
  • Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions

    Paper • 2309.10150 • Published Sep 18, 2023 • 25

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs