Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty
Mehul Damani PRO
mehuldamani
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
updated
a model
5 days ago
mehuldamani/abstain-v3
updated
a model
7 days ago
mehuldamani/RLVR-hotpot-olmo-v3
updated
a model
7 days ago
mehuldamani/RLCR-hotpot-olmo-v3
Organizations
None yet