A curated collection of datasets, models, Spaces, and papers on Reinforcement Learning from Human Feedback (RLHF).
Lewis Tunstall PRO
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
liked
a model
2 days ago
quotientai/limbic-tool-use-0.5B-32K
upvoted
a
paper
2 days ago
Qwen3 Technical Report
new activity
3 days ago
HuggingFaceTB/SmolLM3-3B:Evaluation metrics