TinyLlama-1.1B-ckpt-2.5T-exl2

EXL2 quants of TinyLlama/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T intended for use in speculative decoding.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Collection including royallab/TinyLlama-1.1B-ckpt-2.5T-exl2