-
-
-
-
-
-
Inference status
Active filters:
fp8
neuralmagic/Llama-3.2-1B-Instruct-FP8-dynamic
Text Generation
•
Updated
•
2.24k
•
2
SicariusSicariiStuff/Dusk_Rainbow_FP8
Updated
•
11
soprasteria/Mixtral-8x7B-Instruct-v0.1-FP8
Updated
•
136
CalamitousFelicitousness/SorcererLM-8x22b-FP8-Dynamic
John6666/stoiqo-afrodite-fluxxl-f1dalpha-fp8-flux
Text-to-Image
•
Updated
•
21.5k
•
2
obamaTeo/llama-finetune-8bit-wiki-284-ver2
fxmarty/quark-legacy-fp8
Updated
•
325
amd/jais-13b-chat-FP8
predibase/Qwen2.5-14B-FP8
CalamitousFelicitousness/banana-2-b-72b-FP8-Dynamic
taozi555/Llama-Guard-3-8B-FP8
ajinkya-tejankar/Mistral-7B-Instruct-v0.2-FP8-UltraChat-2000-KV
Infermatic/Lumimaid-v0.2-70B-FP8-Dynamic
Updated
•
32
predibase/Qwen2.5-32B-Instruct-FP8
Updated
•
252
Infermatic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-Dynamic
Text Generation
•
Updated
•
263
predibase/Mistral-7B-Instruct-v0.2-FP8-UltraChat-2000-KV
Infermatic/Stellar-Odyssey-12b-v0.0-FP8-Dynamic
Infermatic/Chronos-Platinum-72B-FP8-Dynamic
Infermatic/Nautilus-70B-v0.1-FP8-Dynamic
yejingfu/nmagic-Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
Updated
•
10
yejingfu/nmagic-Meta-Llama-3.1-70B-Instruct-FP8
Text Generation
•
Updated
•
13
Dev0502/Qwen2.5-14B-Instruct-abliterated-v2-FP8
andecy64/Nxcode-CQ-7B-orpo-FP8
SicariusSicariiStuff/DeepSeek-Coder-V2-Instruct-FP8
EmbeddedLLM/Qwen2.5-72B-Instruct-OCP-FP8-Quark
yejingfu/nmagic-Meta-Llama-3-70B-Instruct-FP8
Updated
•
33
EmbeddedLLM/Nexusflow_Athena-V2-Chat-OCP-FP8-Quark
EmbeddedLLM/Nexusflow_Athena-V2-Agent-OCP-FP8-Quark
liuxl12/Qwen2.5-32B-Instruct-FP8
Model-SafeTensors/Meta-Llama-3-8B-Instruct-FP8
Updated
•
3.85k