Inference Providers
Active filters: RL
Teen-Different/squiral_maze
Reinforcement Learning
• Updated Teen-Different/Tabular_RL_For_Multi_Env
Reinforcement Learning
• Updated NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated • 50
• 4
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated • 54
• 7
NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated • 51
• 9
prithivMLmods/Mensa-Beta-14B-Instruct
Text Generation
• 15B • Updated • 9
mradermacher/Mensa-Beta-14B-Instruct-GGUF
15B • Updated • 83
mradermacher/Mensa-Beta-14B-Instruct-i1-GGUF
15B • Updated • 220
prithivMLmods/Venatici-Coder-14B-Y.2
Text Generation
• 15B • Updated • 5
mradermacher/Venatici-Coder-14B-Y.2-GGUF
15B • Updated • 40
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos
Reinforcement Learning
• 8B • Updated • 93
• 17
mradermacher/Venatici-Coder-14B-Y.2-i1-GGUF
15B • Updated • 181
prithivMLmods/Camelopardalis-650-14B-Instruct
Text Generation
• 15B • Updated • 5
mradermacher/Camelopardalis-650-14B-Instruct-GGUF
15B • Updated • 56
mradermacher/Camelopardalis-650-14B-Instruct-i1-GGUF
15B • Updated • 70
prithivMLmods/Fomalhaut-QwenR-1.5B
Text Generation
• 2B • Updated • 2
prithivMLmods/Horologium-QwenC-1.5B
Text Generation
• 2B • Updated • 3
prithivMLmods/Pictor-1338-QwenP-1.5B
Text Generation
• 2B • Updated • 6
prithivMLmods/Monoceros-QwenM-1.5B
Text Generation
• 2B • Updated • 5
prithivMLmods/Pisces-QwenR1-1.5B
Text Generation
• 2B • Updated • 14
prithivMLmods/Octantis-QwenR1-1.5B
Text Generation
• 2B • Updated • 4
adriey/Pictor-1338-QwenP-1.5B-Q8_0-GGUF
Text Generation
• 2B • Updated • 5
mradermacher/Pisces-QwenR1-1.5B-GGUF
2B • Updated • 101
mradermacher/Horologium-QwenC-1.5B-GGUF
2B • Updated • 109
mradermacher/Pictor-1338-QwenP-1.5B-GGUF
2B • Updated • 31
mradermacher/Octantis-QwenR1-1.5B-GGUF
2B • Updated • 70
mradermacher/Monoceros-QwenM-1.5B-GGUF
2B • Updated • 54
mradermacher/Horologium-QwenC-1.5B-i1-GGUF
2B • Updated • 199
mradermacher/Fomalhaut-QwenR-1.5B-GGUF
2B • Updated • 116
mradermacher/Pictor-1338-QwenP-1.5B-i1-GGUF
2B • Updated • 66