ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q4 Reinforcement Learning • 8B • Updated Mar 26 • 9.34k • 225
mradermacher/Self-Certainty-Qwen3-1.7B-Base-MATH-GGUF Reinforcement Learning • 2B • Updated 5 days ago • 251 • 1
mradermacher/DeepHermes-Egregore-8B-131K-GGUF Reinforcement Learning • 8B • Updated 1 day ago • 105 • 1
mradermacher/DeepHermes-Egregore-8B-131K-i1-GGUF Reinforcement Learning • 8B • Updated 1 day ago • 173 • 1