license: apple-ascl | |
pipeline_tag: image-text-to-text | |
This repository contains the Elva-OpenELM-1.1B model presented in [On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning](https://huggingface.co/papers/2406.11823). | |