-
Prompt me a Dataset: An investigation of text-image prompting for historical image dataset creation using foundation models
Paper • 2309.01674 • Published • 2 -
Segment Anything
Paper • 2304.02643 • Published • 4 -
EgoLifter: Open-world 3D Segmentation for Egocentric Perception
Paper • 2403.18118 • Published • 12 -
A Multimodal Automated Interpretability Agent
Paper • 2404.14394 • Published • 21
Collections
Discover the best community collections!
Collections including paper arxiv:2309.01674
-
Veagle: Advancements in Multimodal Representation Learning
Paper • 2403.08773 • Published • 9 -
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Paper • 2304.14178 • Published • 3 -
Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs
Paper • 2403.12596 • Published • 10 -
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Paper • 2403.11703 • Published • 17
-
Character Queries: A Transformer-based Approach to On-Line Handwritten Character Segmentation
Paper • 2309.03072 • Published • 2 -
Prompt me a Dataset: An investigation of text-image prompting for historical image dataset creation using foundation models
Paper • 2309.01674 • Published • 2 -
Segment Anything
Paper • 2304.02643 • Published • 4
-
FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation
Paper • 2403.06775 • Published • 3 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 7 -
Data Incubation -- Synthesizing Missing Data for Handwriting Recognition
Paper • 2110.07040 • Published • 2 -
A Mixture of Expert Approach for Low-Cost Customization of Deep Neural Networks
Paper • 1811.00056 • Published • 2
-
Measuring the Effects of Data Parallelism on Neural Network Training
Paper • 1811.03600 • Published • 2 -
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Paper • 1804.04235 • Published • 2 -
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Paper • 1905.11946 • Published • 3 -
Yi: Open Foundation Models by 01.AI
Paper • 2403.04652 • Published • 62