quantized the model to 4-bits (using Q4_K_M method)

#2
by ersanbil - opened
No description provided.
ersanbil changed pull request status to closed

Sign up or log in to comment