-
-
-
-
-
-
Inference status
Active filters:
int4
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v3
Text Generation
•
Updated
•
106
•
12
ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v1
Text Generation
•
Updated
•
2
ModelCloud/Meta-Llama-3.1-8B-Instruct-gptq-4bit
Text Generation
•
Updated
•
463
•
4
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w4a16
Text Generation
•
Updated
•
10.8k
•
29
ModelCloud/Qwen2.5-0.5B-Instruct-gptqmodel-4bit
Text Generation
•
Updated
•
40
•
1
Advantech-EIOT/intel_llama-2-chat-7b
Text Generation
•
Updated
•
8
neuralmagic/zephyr-7b-beta-marlin
Text Generation
•
Updated
•
537
neuralmagic/TinyLlama-1.1B-Chat-v1.0-marlin
Text Generation
•
Updated
•
2.64k
•
1
neuralmagic/OpenHermes-2.5-Mistral-7B-marlin
Text Generation
•
Updated
•
686
•
2
neuralmagic/Nous-Hermes-2-Yi-34B-marlin
Text Generation
•
Updated
•
13
•
5
ecastera/ecastera-eva-westlake-7b-spanish-int4-gguf
Updated
•
52
•
2
softmax/Llama-2-70b-chat-hf-marlin
Text Generation
•
Updated
•
117
softmax/falcon-180B-chat-marlin
Text Generation
•
Updated
•
16
study-hjt/Meta-Llama-3-8B-Instruct-GPTQ-Int4
Text Generation
•
Updated
•
39
study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int4
Text Generation
•
Updated
•
26
•
6
study-hjt/Meta-Llama-3-70B-Instruct-AWQ
Text Generation
•
Updated
•
9
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
10
•
2
study-hjt/CodeQwen1.5-7B-Chat-GPTQ-Int4
Text Generation
•
Updated
•
81
study-hjt/Qwen1.5-110B-Chat-AWQ
Text Generation
•
Updated
•
8
modelscope/Yi-1.5-34B-Chat-AWQ
Text Generation
•
Updated
•
61
•
1
modelscope/Yi-1.5-6B-Chat-GPTQ
Text Generation
•
Updated
•
94
modelscope/Yi-1.5-6B-Chat-AWQ
Text Generation
•
Updated
•
91
modelscope/Yi-1.5-9B-Chat-GPTQ
Text Generation
•
Updated
•
93
•
1
modelscope/Yi-1.5-9B-Chat-AWQ
Text Generation
•
Updated
•
91
modelscope/Yi-1.5-34B-Chat-GPTQ
Text Generation
•
Updated
•
31
•
1
jojo1899/Phi-3-mini-128k-instruct-ov-int4
Text Generation
•
Updated
•
17
jojo1899/Llama-2-13b-chat-hf-ov-int4
Text Generation
•
Updated
•
129
jojo1899/Mistral-7B-Instruct-v0.2-ov-int4
Text Generation
•
Updated
•
128
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
•
Updated
•
62
•
6
ModelCloud/Mistral-Nemo-Instruct-2407-gptq-4bit
Text Generation
•
Updated
•
373
•
4