Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
TinyLlama-1.1B-Chat-v1.0-marlin
like
1
Follow
Neural Magic
301
Text Generation
Transformers
Safetensors
llama
nm-vllm
marlin
int4
conversational
text-generation-inference
Inference Endpoints
4-bit precision
gptq
arxiv:
2210.17323
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
c7713ac
TinyLlama-1.1B-Chat-v1.0-marlin
Commit History
Create requirements.txt
c7713ac
verified
robertgshaw2
commited on
Mar 6, 2024
Create quantization/apply_gptq_save_marlin.py
9d40424
verified
robertgshaw2
commited on
Mar 6, 2024
Update README.md
bd74ab9
verified
robertgshaw2
commited on
Mar 6, 2024
Create README.md
8680e42
verified
robertgshaw2
commited on
Mar 6, 2024
Upload folder using huggingface_hub
5059cf5
verified
mgoin
commited on
Mar 5, 2024
initial commit
15de72b
verified
mgoin
commited on
Mar 5, 2024