quantized the model to 4-bits (using Q4_K_M method) 1a8a6e4 verified ersanbil commited on Apr 6, 2024