neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation • Updated Oct 17, 2024 • 1.62k • 14
neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic Text Generation • Updated 17 days ago • 109 • 1
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic Text Generation • Updated 17 days ago • 51 • 1
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16 Text Generation • Updated 17 days ago • 218 • 3
nm-testing/tinyllama-one-shot-static-quant-test-compressed Text Generation • Updated Oct 9, 2024 • 13