Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

JHuel
/
Mistral-Nemo-Instruct-2407_DPO_qlora

Reinforcement Learning
Safetensors
Text-to-SQL
RL
DPO
Model card Files Files and versions
xet
Community
Mistral-Nemo-Instruct-2407_DPO_qlora
55.1 MB
  • 1 contributor
History: 8 commits
JHuel's picture
JHuel
Update README.md
db7d153 verified 11 months ago
  • .gitattributes
    1.52 kB
    initial commit 11 months ago
  • README.md
    3.8 kB
    Update README.md 11 months ago
  • adapter_config.json
    725 Bytes
    Upload MistralForCausalLM 11 months ago
  • adapter_model.safetensors
    55.1 MB
    xet
    Upload MistralForCausalLM 11 months ago
  • generation_config.json
    111 Bytes
    Upload MistralForCausalLM 11 months ago