Dataset and Model for paper: "VeriReason: Reinforcement Learning with Testbench
Feedback for Reasoning-Enhanced Verilog Generation"
-
Nellyw888/VeriReason-codeLlama-7b-RTLCoder-Verilog-GRPO-reasoning-tb
Reinforcement Learning • 7B • Updated • 838 • 2 -
Nellyw888/VeriReason-Qwen2.5-3b-RTLCoder-Verilog-GRPO-reasoning-tb
Reinforcement Learning • 3B • Updated • 27 -
Nellyw888/VeriReason-Qwen2.5-1.5b-RTLCoder-Verilog-GRPO-reasoning-tb
Reinforcement Learning • 2B • Updated • 37 • 1 -
Nellyw888/VeriReason-Qwen2.5-7b-RTLCoder-Verilog-GRPO-reasoning-tb
Reinforcement Learning • 8B • Updated • 872 • 4