CalamitousFelicitousness
/

Qwen2-VL-72B-Instruct-GPTQ-Int8-tpfix

Image-Text-to-Text

8-bit precision

Model card Files Files and versions Community

CalamitousFelicitousness commited on Sep 22, 2024

Commit

6ea93b2

·

verified ·

1 Parent(s): 237f28d

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -9,6 +9,7 @@ tags:
 - multimodal
 base_model: Qwen/Qwen2-VL-72B-Instruct
 ---
 # Qwen2-VL-72B-Instruct-GPTQ-Int8

 - multimodal
 base_model: Qwen/Qwen2-VL-72B-Instruct
 ---
+# This repo contains a fix for intermediate_size which was incompatible with VLLM parallel inference. This repo will allow you to run with tensor_parallel of 2.
 # Qwen2-VL-72B-Instruct-GPTQ-Int8