Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiahao004 's Collections
DeepTheorem

DeepTheorem

updated Jun 11

A dataset and RL-zero pipeline for advanced mathematical reasoning of informal theorem proving.

Upvote
2

  • Jiahao004/DeepTheorem

    Viewer • Updated 28 days ago • 121k • 379 • 23

  • Jiahao004/DeepTheorem-qwen-1.5b-rl

    2B • Updated May 26 • 3 • 1

  • Jiahao004/DeepTheorem-qwen-3b-rl

    3B • Updated May 26 • 2

  • Jiahao004/DeepTheorem-qwen-7b-rl

    8B • Updated May 26 • 2 • 3

  • Jiahao004/HMMT_FIMO_Putnam

    Updated Jun 6 • 44 • 2

  • DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

    Paper • 2505.23754 • Published May 29 • 16
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs