Edit Models filters
Model Tree
Apps
Inference Providers
Models
33
Active filters: QuestionAnswering
JamieAi33/Phi-2-QLora
JamieAi33/Phi-2_PEFT
KakashiH/BashExplainer_Gemma
2KKLabs/Kaleidoscope_small_v1
2KKLabs/Kaleidoscope_large_v1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Ins
Reinforcement Learning • 8B • Updated • 6 • 2
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-7B-Base
Reinforcement Learning • 8B • Updated • 10 • 2
SEGAgentRL/LLDS-A-GSPO-Qwen2.5-3B-Ins
Reinforcement Learning • 3B • Updated • 3 • 1
SEGAgentRL/LLDS-R-GSPO-Qwen2.5-3B-Ins
Reinforcement Learning • 3B • Updated • 3 • 1
SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Base
Reinforcement Learning • 3B • Updated • 3 • 1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Base-MA
Reinforcement Learning • 3B • Updated • 8 • 1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Base
Reinforcement Learning • 3B • Updated • 3
SEGAgentRL/LLDS-R-GRPO-Qwen2.5-3B-Ins
Reinforcement Learning • 3B • Updated • 5 • 1
mradermacher/LLDS-A-GSPO-Qwen2.5-3B-Ins-GGUF
3B • Updated • 53
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-GGUF
8B • Updated • 118 • 1
SEGAgentRL/LLDS-A-GRPO-Qwen2.5-3B-Ins
Reinforcement Learning • 3B • Updated • 4
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Base-i1-GGUF
8B • Updated • 2.46k • 2
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Ins-GGUF
8B • Updated • 61 • 1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Base-GGUF
3B • Updated • 92
mradermacher/LLDS-A-GRPO-Qwen2.5-7B-Ins-i1-GGUF
8B • Updated • 351 • 1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Ins-GGUF
3B • Updated • 52
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Ins-GGUF
3B • Updated • 215 • 1
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Base-GGUF
3B • Updated • 75 • 1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Base-MA-GGUF
3B • Updated • 23 • 1
mradermacher/LLDS-R-GSPO-Qwen2.5-3B-Ins-GGUF
3B • Updated • 48 • 1
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Base-i1-GGUF
3B • Updated • 191 • 1
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Base-i1-GGUF
3B • Updated • 412
mradermacher/LLDS-A-GRPO-Qwen2.5-3B-Ins-i1-GGUF
Updated • 245
mradermacher/LLDS-R-GRPO-Qwen2.5-3B-Ins-i1-GGUF
Updated • 137 • 3