I can just say, it is totally incomprehensible.

#1
by JLouisBiz - opened

The answer I get on "Hello" is total disaster. This model is not usable.

Though it entered GPU of 4 GB in full.

I am using QwQ-LCoT-3B-Instruct.Q4_K_M.gguf which is totally fine.

Sign up or log in to comment