2 13 43

Juan CM

jucamohedano

AI & ML interests

Deep Learning and Robotics 🚀🤖

Recent Activity

updated a model about 2 months ago

jucamohedano/paligemma_a-okvqa

View all activity

Organizations

jucamohedano's activity

updated a model about 2 months ago

jucamohedano/paligemma_a-okvqa

Updated Nov 15, 2024 • 3

updated a model 3 months ago

jucamohedano/char-lstm-shakespeare

Updated Sep 22, 2024

liked a dataset 3 months ago

karpathy/tiny_shakespeare

Updated Jan 18, 2024 • 1.92k • 44

updated a model 3 months ago

jucamohedano/char-lstm-shakespeare_

Updated Sep 21, 2024

liked a model 8 months ago

microsoft/Phi-3-vision-128k-instruct

Text Generation • Updated Aug 20, 2024 • 63.8k • 943

upvoted an article 8 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

• 229

reacted to merve's post with 🚀 8 months ago

Post

1758

New open Vision Language Model by @Google : PaliGemma 💙🤍

📝 Comes in 3B, pretrained, mix and fine-tuned models in 224, 448 and 896 resolution
🧩 Combination of Gemma 2B LLM and SigLIP image encoder
🤗 Supported in transformers

PaliGemma can do..
🧩 Image segmentation and detection! 🤯
📑 Detailed document understanding and reasoning
🙋 Visual question answering, captioning and any other VLM task!

Read our blog 🔖 hf.co/blog/paligemma
Try the demo 🪀 hf.co/spaces/google/paligemma
Check out the Spaces and the models all in the collection 📚 google/paligemma-release-6643a9ffbf57de2ae0448dda
Collection of fine-tuned PaliGemma models google/paligemma-ft-models-6643b03efb769dad650d2dda