Models

231

Full-text search

Active filters: RL

nvidia/Nemotron-Cascade-2-30B-A3B

Text Generation • 32B • Updated 12 days ago • 16.1k • 496

zghhui/OmniNFT

Any-to-Any • Updated 2 days ago • 3

mlx-community/Nemotron-Cascade-2-30B-A3B-8bit

Text Generation • 32B • Updated Mar 20 • 582 • 8

blascotobasco/Nemotron-Cascade-2-96E-A3B

Text Generation • 24B • Updated Mar 22 • 36 • 1

trohrbaugh/Nemotron-Cascade-2-30B-A3B-heretic-ara-uncensored

Text Generation • 32B • Updated Mar 26 • 12 • 3

win10/K1-31B-v5

Image-Text-to-Text • 33B • Updated 13 days ago • 170 • 4

stanfordnlp/SteamSHP-flan-t5-xl

Updated Oct 10, 2023 • 12 • 43

stanfordnlp/SteamSHP-flan-t5-large

Updated Oct 10, 2023 • 35 • 33

SultanR/SmolTulu-1.7b-Reinforced

Text Generation • 2B • Updated Dec 17, 2024 • 15 • 5

mradermacher/SmolTulu-1.7b-Reinforced-GGUF

2B • Updated Dec 18, 2024 • 94

Daemontatox/Llama3.3-70B-CogniLink

Text Generation • 71B • Updated Jun 21, 2025 • 58 • • 3

mradermacher/Llama3.3-70B-CogniLink-GGUF

Text Generation • 71B • Updated Jun 22, 2025 • 178

mradermacher/Llama3.3-70B-CogniLink-i1-GGUF

Text Generation • 71B • Updated Jun 22, 2025 • 356

JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora

Reinforcement Learning • Updated Jan 22, 2025

JHuel/Mistral-Nemo-Instruct-2407_ORPO

Text Generation • Updated Jan 22, 2025

Ihor/Text2Graph-R1-Qwen2.5-0.5b

Text Generation • 0.5B • Updated Aug 18, 2025 • 59 • • 24

tecosys/Nutaan-RL1

Reinforcement Learning • Updated Feb 7, 2025

mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF

0.5B • Updated Aug 18, 2025 • 108 • 1

mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF

0.5B • Updated Aug 18, 2025 • 153 • 1

mradermacher/QuadConnect2.5-0.5B-v0.0.3b-GGUF

0.5B • Updated Feb 22, 2025 • 92

Daemontatox/Zireal-0

Text Generation • 684B • Updated Jul 1, 2025 • 145 • 1

mradermacher/QuadConnect2.5-0.5B-v0.0.8b-GGUF

0.5B • Updated Jul 31, 2025 • 68

Lyte/QuadConnect2.5-0.5B-v0.0.9b

Text Generation • 0.5B • Updated Feb 27, 2025 • 28

mradermacher/QuadConnect2.5-0.5B-v0.0.9b-GGUF

0.5B • Updated Jul 31, 2025 • 84

Lyte/QuadConnect2.5-1.5B-v0.1.0b

Text Generation • 2B • Updated Feb 28, 2025 • 27 • • 1

mradermacher/QuadConnect2.5-1.5B-v0.1.0b-GGUF

2B • Updated Mar 1, 2025 • 110 • 1

mradermacher/Zireal-0-GGUF

Updated Jul 31, 2025 • 1

mradermacher/Magellanic-Qwen-25B-R999-GGUF

25B • Updated Mar 5, 2025 • 24 • 1

mradermacher/Magellanic-Qwen-25B-R999-i1-GGUF

25B • Updated Jul 4, 2025 • 198 • 1

VaidikML0508/Shark-Tank-Offer-Evaluator-llama3.2-3B-Instruct-SFT-DPO-4bits-V1

Text Generation • 3B • Updated Apr 22, 2025 • 4