--- license: mit datasets: - liuhaotian/LLaVA-Pretrain - lmms-lab/LLaVA-ReCap-558K - lmms-lab/LLaVA-ReCap-118K - lmms-lab/LLaVA-ReCap-CC3M - lmms-lab/LLaVA-OneVision-Mid-Data - lmms-lab/LLaVA-OneVision-Data - Zhiqiang007/MathV360K language: - en base_model: - Qwen/Qwen2-0.5B-Instruct - google/siglip-so400m-patch14-384 tags: - LLaVA-OneVision-Manager - LLaVA-OV-Manager - Manager --- Model weights for our submission to TCSVT, titled "Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs". Related materials can be found at https://looperxx.github.io/.