mradermacher/VeriReason-Qwen2.5-1.5B-grpo-small-GGUF Reinforcement Learning • 2B • Updated 19 days ago • 2.25k • 1
mradermacher/VeriReason-Qwen2.5-1.5b-RTLCoder-Verilog-GRPO-reasoning-tb-GGUF Reinforcement Learning • 2B • Updated 19 days ago • 3.37k