arXiv:2405.14758
Junlin Wu
jlwu002
AI & ML interests
None yet
Recent Activity
updated
a dataset
2 days ago
jlwu002/sr1_dataset
published
a dataset
2 days ago
jlwu002/sr1_dataset
authored
a paper
8 months ago
On the Exploitability of Reinforcement Learning with Human Feedback for
Large Language Models