Rename gptq_model-4bit-128g.safetensors to model.safetensors 8a09460 verified matatonic commited on 14 days ago
Replace max_batch_size with batch_size for HybridCache (#3) ffaa2a5 verified runninglsy pbaylies commited on Nov 25, 2024