How can i fine-tune the Qwen2-VL model to make it completely become OCR model
#1
by
summon1d
- opened
The model used for document information extraction however depends a lot on how to ask questions. So I wondered if there was a way to turn it into an OCR model or if there was a prompt structure to extract the best information..?