How can i fine-tune the Qwen2-VL model to make it completely become OCR model

by summon1d - opened Sep 19, 2024

Sep 19, 2024

The model used for document information extraction however depends a lot on how to ask questions. So I wondered if there was a way to turn it into an OCR model or if there was a prompt structure to extract the best information..?

ssary

9 days ago

@summon1d Have you tried something that makes it better at OCR ?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment