Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

529

Full-text search

Active filters: RLHF

NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO

Text Generation • 47B • Updated Apr 30, 2024 • 7.63k • • 452

NousResearch/Nous-Hermes-2-Mistral-7B-DPO

Text Generation • 7B • Updated Apr 30, 2024 • 836 • 217

NousResearch/Nous-Hermes-2-Mistral-7B-DPO-GGUF

7B • Updated Feb 21, 2024 • 4.38k • 92

NousResearch/Hermes-2-Pro-Mistral-7B-GGUF

7B • Updated Mar 28, 2024 • 84.8k • 240

aaditya/Llama3-OpenBioLLM-8B

Text Generation • Updated Jan 18, 2025 • 1.93k • • 222

wangclnlp/GRAM-RR-LLaMA-3.1-8B-RewardModel

Text Generation • 8B • Updated Sep 4, 2025 • 63 • 2

mradermacher/GRAM-RR-LLaMA-3.1-8B-RewardModel-GGUF

8B • Updated Sep 4, 2025 • 60 • 1

mradermacher/GRAM-RR-LLaMA-3.1-8B-RewardModel-i1-GGUF

8B • Updated 10 days ago • 190 • 1

OpenAssistant/reward-model-deberta-v3-base

Text Classification • Updated Jan 26, 2023 • 841 • 13

OpenAssistant/reward-model-electra-large-discriminator

Text Classification • Updated Jan 26, 2023 • 12 • 5

OpenAssistant/reward-model-deberta-v3-large

Text Classification • Updated Feb 17, 2023 • 269 • 26

OpenAssistant/reward-model-deberta-v3-large-v2

Text Classification • Updated Feb 1, 2023 • 5.09k • • 240

llm-blender/pair-ranker

Text Ranking • 0.4B • Updated Apr 2, 2025 • 11 • 3

nicholasKluge/RewardModelPT

Text Classification • 0.1B • Updated Jun 9, 2025 • 19

nicholasKluge/RewardModel

Text Classification • 0.1B • Updated Jun 9, 2025 • 189 • 1

fb700/chatglm-fitness-RLHF

Updated Mar 6, 2024 • 268

fb700/Bofan-chatglm-Best-lora

Updated Aug 24, 2023 • 15 • 11

kubernetes-bad/Ligma-L2-13b

Updated Sep 19, 2023 • 8 • 3

llm-blender/PairRM

Text Generation • Updated Jan 22, 2024 • 374 • 205

berkeley-nest/Starling-LM-7B-alpha

Text Generation • 7B • Updated Mar 20, 2024 • 1.28k • 555

berkeley-nest/Starling-RM-7B-alpha

Updated Jul 30, 2024 • 51 • 103

LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 7

LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 9 • 1

LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 7 • 2

LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 8 • 1

LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2

Text Generation • Updated Nov 27, 2023 • 6 • 2

TheBloke/Starling-LM-7B-alpha-GGUF

7B • Updated Nov 28, 2023 • 764 • 94

TheBloke/Starling-LM-7B-alpha-AWQ

Text Generation • 7B • Updated Nov 28, 2023 • 7 • 9

second-state/Starling-LM-7B-alpha-GGUF

Text Generation • 7B • Updated Mar 20, 2024 • 196 • 3

TheBloke/Starling-LM-7B-alpha-GPTQ

Text Generation • 7B • Updated Nov 28, 2023 • 45 • 10