Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
JunhaJung 's Collections
Medical Reasoning_Dataset Generation
Medical Reasoning_Agent
Medical Reasoning_Med-MLRM
VLM Reasoning
Long Form Generation
Reasoning
Test-time scaling

Long Form Generation

updated Jun 30
Upvote
-

  • Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation

    Paper • 2506.15068 • Published Jun 18 • 14
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs