This model was converted to FP8 format from mistralai/Mistral-Small-Instruct-2409 using the llmcompressor library by vLLM. Refer to the original model card for more details on the model.

Downloads last month
26
Safetensors
Model size
22.3B params
Tensor type
FP16
·
F8_E4M3
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Model tree for tolgaakar/Mistral-Small-Instruct-2409-FP8-Dynamic

Quantized
(44)
this model