-
-
-
-
-
-
Inference status
Active filters:
RLHF
ibrahimkettaneh/Athene-V2-Agent-4.0bpw-h6-exl2
Text Generation
•
Updated
•
34
•
1
mlx-community/Athene-V2-Chat-4bit
Text Generation
•
Updated
•
51
•
1
mradermacher/Starling-LM-7B-beta-GGUF
Updated
•
169
•
1
mradermacher/Starling-LM-7B-beta-i1-GGUF
Updated
•
800
•
1
mradermacher/Starling-LM-7B-beta-LaserRMT-v1-GGUF
Updated
•
191
•
1
mradermacher/GuIA-v2-GGUF
Updated
•
187
•
1
OpenAssistant/reward-model-deberta-v3-base
Text Classification
•
Updated
•
442
•
10
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
•
Updated
•
12
•
5
OpenAssistant/reward-model-deberta-v3-large
Text Classification
•
Updated
•
97
•
20
ChaiML/gpt2_base_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
8
•
2
ChaiML/gpt2_medium_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
7
ChaiML/gpt2_large_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
13
ChaiML/gpt2_xl_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
4
•
1
ChaiML/gpt2_base_retry_and_continue_5m_reward_model
Text Classification
•
Updated
•
6
•
4
llm-blender/pair-ranker
Updated
•
3
•
3
nicholasKluge/RewardModelPT
Text Classification
•
Updated
•
51
nicholasKluge/RewardModel
Text Classification
•
Updated
•
14
fb700/chatglm-fitness-RLHF
Updated
•
268
fb700/Bofan-chatglm-Best-lora
Updated
•
17
•
10
kubernetes-bad/Ligma-L2-13b
Updated
•
5
•
3
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
•
Updated
•
11
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
•
Updated
•
11
•
1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
•
Updated
•
10
•
2
LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2
Text Generation
•
Updated
•
14
•
1
LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2
Text Generation
•
Updated
•
11
•
2
TheBloke/Starling-LM-7B-alpha-GGUF
Updated
•
593
•
94
TheBloke/Starling-LM-7B-alpha-AWQ
Text Generation
•
Updated
•
36
•
9
second-state/Starling-LM-7B-alpha-GGUF
Text Generation
•
Updated
•
38
•
3
TheBloke/Starling-LM-7B-alpha-GPTQ
Text Generation
•
Updated
•
29
•
9
bartowski/Starling-LM-7B-alpha-old-exl2
Text Generation
•
Updated