Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3.1-70B-Instruct-FP8-dynamic
like
5
Follow
Neural Magic
265
Text Generation
Transformers
Safetensors
8 languages
llama
fp8
vllm
conversational
text-generation-inference
Inference Endpoints
compressed-tensors
License:
llama3.1
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Meta-Llama-3.1-70B-Instruct-FP8-dynamic
Commit History
Update README
2063612
verified
ekurtic
commited on
Oct 19, 2024
Update README.md
b4793e6
verified
alexmarques
commited on
Oct 10, 2024
Updated compression_config to quantization_config
fc3ee56
verified
mgoin
commited on
Oct 9, 2024
Update README.md
019d944
verified
Lin-K76
commited on
Aug 23, 2024
Upload folder using huggingface_hub
17e44d2
verified
Lin-K76
commited on
Aug 22, 2024
Update README.md
fc4ffcb
verified
alexmarques
commited on
Aug 13, 2024
Update README.md
893683d
verified
alexmarques
commited on
Jul 30, 2024
Update README.md
b9e995e
verified
Lin-K76
commited on
Jul 27, 2024
Update README.md
b589a15
verified
Lin-K76
commited on
Jul 26, 2024
Update README.md
3459f3c
verified
Lin-K76
commited on
Jul 26, 2024
Update README.md
0c7579a
verified
Lin-K76
commited on
Jul 26, 2024
Update README.md
a69a373
verified
Lin-K76
commited on
Jul 26, 2024
Upload folder using huggingface_hub
313cd77
verified
Lin-K76
commited on
Jul 26, 2024
Upload folder using huggingface_hub
2d2c5f6
verified
Lin-K76
commited on
Jul 26, 2024
Update README.md
5b64234
verified
Lin-K76
commited on
Jul 25, 2024
Update README.md
0cca697
verified
Lin-K76
commited on
Jul 24, 2024
Create README.md
5fe96cf
verified
Lin-K76
commited on
Jul 24, 2024
Upload folder using huggingface_hub
eb04e5c
verified
Lin-K76
commited on
Jul 23, 2024
initial commit
3beaf1e
verified
Lin-K76
commited on
Jul 23, 2024