-
-
-
-
-
-
Inference status
Active filters:
int4
ecastera/ecastera-eva-westlake-7b-spanish-int4-gguf
Updated
•
7
•
2
ModelCloud/Mistral-Nemo-Instruct-2407-gptq-4bit
Text Generation
•
Updated
•
70.1k
•
4
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w4a16
Text Generation
•
Updated
•
117k
•
28
neuralmagic/Meta-Llama-3.1-405B-Instruct-quantized.w4a16
Text Generation
•
Updated
•
2.37k
•
12
zzzmahesh/Meta-Llama-3-8B-Instruct-quantized.w4a4
Text Generation
•
Updated
•
18
•
1
joshmiller656/Llama-3.1-Nemotron-70B-Instruct-AWQ-INT4
Text Generation
•
Updated
•
1.3k
•
2
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v2
Text Generation
•
Updated
•
15
•
3
ModelCloud/Llama-3.2-3B-Instruct-gptqmodel-4bit-vortex-v3
Text Generation
•
Updated
•
365
•
5
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v2.5
Text Generation
•
Updated
•
565
•
4
ModelCloud/Qwen2.5-Coder-32B-Instruct-gptqmodel-4bit-vortex-v1
Text Generation
•
Updated
•
115
•
12
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v1
Text Generation
•
Updated
•
562
•
51
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v2
Text Generation
•
Updated
•
625
•
15
ModelCloud/Falcon3-10B-Instruct-gptqmodel-4bit-vortex-v1
Text Generation
•
Updated
•
44
•
3
Advantech-EIOT/intel_llama-2-chat-7b
Text Generation
•
Updated
•
11
neuralmagic/zephyr-7b-beta-marlin
Text Generation
•
Updated
•
2.89k
neuralmagic/TinyLlama-1.1B-Chat-v1.0-marlin
Text Generation
•
Updated
•
2.74k
•
1
neuralmagic/OpenHermes-2.5-Mistral-7B-marlin
Text Generation
•
Updated
•
749
•
2
neuralmagic/Nous-Hermes-2-Yi-34B-marlin
Text Generation
•
Updated
•
20
•
5
softmax/Llama-2-70b-chat-hf-marlin
Text Generation
•
Updated
•
275
softmax/falcon-180B-chat-marlin
Text Generation
•
Updated
•
14
study-hjt/Meta-Llama-3-8B-Instruct-GPTQ-Int4
Text Generation
•
Updated
•
35
study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int4
Text Generation
•
Updated
•
71
•
6
study-hjt/Meta-Llama-3-70B-Instruct-AWQ
Text Generation
•
Updated
•
20
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
19
•
2
study-hjt/CodeQwen1.5-7B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
18
study-hjt/Qwen1.5-110B-Chat-AWQ
Text Generation
•
Updated
•
14
modelscope/Yi-1.5-34B-Chat-AWQ
Text Generation
•
Updated
•
27
•
1
modelscope/Yi-1.5-6B-Chat-GPTQ
Text Generation
•
Updated
•
26
modelscope/Yi-1.5-6B-Chat-AWQ
Text Generation
•
Updated
•
28
modelscope/Yi-1.5-9B-Chat-GPTQ
Text Generation
•
Updated
•
25
•
1