Models

531

Full-text search

Active filters: RLHF

NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO-GGUF

47B • Updated Feb 20, 2024 • 1.03k • 73

OpenAssistant/reward-model-deberta-v3-base

Text Classification • Updated Jan 26, 2023 • 1.6k • • 13

OpenAssistant/reward-model-electra-large-discriminator

Text Classification • Updated Jan 26, 2023 • 48 • 5

OpenAssistant/reward-model-deberta-v3-large

Text Classification • Updated Feb 17, 2023 • 348 • 26

OpenAssistant/reward-model-deberta-v3-large-v2

Text Classification • Updated Feb 1, 2023 • 42.8k • • 245

llm-blender/pair-ranker

Text Ranking • 0.4B • Updated Apr 2, 2025 • 30 • 4

nicholasKluge/RewardModelPT

Text Classification • 0.1B • Updated Jun 9, 2025 • 33

nicholasKluge/RewardModel

Text Classification • 0.1B • Updated Jun 9, 2025 • 35 • 1

fb700/chatglm-fitness-RLHF

Updated Mar 6, 2024 • 268

fb700/Bofan-chatglm-Best-lora

Updated Aug 24, 2023 • 10 • 11

kubernetes-bad/Ligma-L2-13b

Updated Sep 19, 2023 • 6 • 3

llm-blender/PairRM

Text Generation • Updated Jan 22, 2024 • 758 • 208

berkeley-nest/Starling-LM-7B-alpha

Text Generation • 7B • Updated Mar 20, 2024 • 2.18k • • 559

berkeley-nest/Starling-RM-7B-alpha

Updated Jul 30, 2024 • 62 • 104

LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 4

LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 3 • 1

LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 7 • 2

LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 3 • 1

LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2

Text Generation • Updated Nov 27, 2023 • 7 • 2

TheBloke/Starling-LM-7B-alpha-GGUF

7B • Updated Nov 28, 2023 • 2.01k • 94

TheBloke/Starling-LM-7B-alpha-AWQ

Text Generation • 7B • Updated Nov 28, 2023 • 12 • 9

second-state/Starling-LM-7B-alpha-GGUF

Text Generation • 7B • Updated Mar 20, 2024 • 283 • 3

TheBloke/Starling-LM-7B-alpha-GPTQ

Text Generation • 7B • Updated Nov 28, 2023 • 16 • 10

bartowski/Starling-LM-7B-alpha-old-exl2

Text Generation • Updated Nov 28, 2023

tastypear/chatglm-fitness-RLHF-GGML

Updated Nov 30, 2023 • 5

CallComply/Starling-LM-11B-alpha

Text Generation • 11B • Updated Mar 4, 2024 • 199 • 15

TheBloke/Starling-LM-alpha-8x7B-MoE-GGUF

47B • Updated Dec 16, 2023 • 287 • 9

TheBloke/Starling-LM-alpha-8x7B-MoE-GPTQ

Text Generation • 47B • Updated Dec 17, 2023 • 12 • 2

bartowski/Starling-LM-7B-alpha-exl2

Text Generation • Updated Dec 27, 2023 • 1

llm-blender/PairRM-hf

Text Generation • 0.4B • Updated Jan 8, 2024 • 2.17k • 16