Text Generation
Transformers
GGUF
Safetensors
mistral
quantized
2-bit
3-bit
4-bit precision
5-bit
6-bit
8-bit precision
GGUF
llama
en
dataset:cerebras/SlimPajama-627B
dataset:bigcode/starcoderdata
dataset:HuggingFaceH4/ultrachat_200k
dataset:HuggingFaceH4/ultrafeedback_binarized
Inference Endpoints
has_space
text-generation-inference
conversational
*.7z filter=lfs diff=lfs merge=lfs -text | |
*.arrow filter=lfs diff=lfs merge=lfs -text | |
*.bin filter=lfs diff=lfs merge=lfs -text | |
*.bz2 filter=lfs diff=lfs merge=lfs -text | |
*.ckpt filter=lfs diff=lfs merge=lfs -text | |
*.ftz filter=lfs diff=lfs merge=lfs -text | |
*.gz filter=lfs diff=lfs merge=lfs -text | |
*.h5 filter=lfs diff=lfs merge=lfs -text | |
*.joblib filter=lfs diff=lfs merge=lfs -text | |
*.lfs.* filter=lfs diff=lfs merge=lfs -text | |
*.mlmodel filter=lfs diff=lfs merge=lfs -text | |
*.model filter=lfs diff=lfs merge=lfs -text | |
*.msgpack filter=lfs diff=lfs merge=lfs -text | |
*.npy filter=lfs diff=lfs merge=lfs -text | |
*.npz filter=lfs diff=lfs merge=lfs -text | |
*.onnx filter=lfs diff=lfs merge=lfs -text | |
*.ot filter=lfs diff=lfs merge=lfs -text | |
*.parquet filter=lfs diff=lfs merge=lfs -text | |
*.pb filter=lfs diff=lfs merge=lfs -text | |
*.pickle filter=lfs diff=lfs merge=lfs -text | |
*.pkl filter=lfs diff=lfs merge=lfs -text | |
*.pt filter=lfs diff=lfs merge=lfs -text | |
*.pth filter=lfs diff=lfs merge=lfs -text | |
*.rar filter=lfs diff=lfs merge=lfs -text | |
*.safetensors filter=lfs diff=lfs merge=lfs -text | |
saved_model/**/* filter=lfs diff=lfs merge=lfs -text | |
*.tar.* filter=lfs diff=lfs merge=lfs -text | |
*.tar filter=lfs diff=lfs merge=lfs -text | |
*.tflite filter=lfs diff=lfs merge=lfs -text | |
*.tgz filter=lfs diff=lfs merge=lfs -text | |
*.wasm filter=lfs diff=lfs merge=lfs -text | |
*.xz filter=lfs diff=lfs merge=lfs -text | |
*.zip filter=lfs diff=lfs merge=lfs -text | |
*.zst filter=lfs diff=lfs merge=lfs -text | |
*tfevents* filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q2_K.ggufm filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q3_K_L.ggufm filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q3_K_M.ggufm filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q3_K_S.ggufm filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q4_K_M.ggufm filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q4_K_S.ggufm filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q5_K_M.ggufm filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q5_K_S.ggufm filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q6_K.ggufm filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q8_0.ggufm filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text | |
TinyLlama-1.1B-Chat-v1.0.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text | |