Collection related to the paper, "Training a Generally Curious Agent" (Project page: https://paprika-llm.github.io/)
Fahim Tajwar
ftajwar
AI & ML interests
LLMs, RLHF
Recent Activity
updated
a collection
3 days ago
Self-Rewarding-LLM-Training
updated
a collection
3 days ago
Self-Rewarding-LLM-Training
updated
a dataset
3 days ago
ftajwar/evaluation_bitwise_arithmetic-2