Image-Text-to-Text
sentence-transformers
Safetensors
Transformers
qwen2_vl
Qwen2-VL
conversational

Model Fine-Tuning

#2
by mrodriguesoliv - opened

What computer resources should I have to train this model in another language?

What computer resources should I have to train this model in another language?

With 768 batch pixels you should need around 150GB VRAM so 6x 3090/4090 with a batch size of 2

Sign up or log in to comment