Scaling Up Personalized Aesthetic Assessment via Task Vector Customization
Abstract
The task of personalized image aesthetic assessment seeks to tailor aesthetic score prediction models to match individual preferences with just a few user-provided inputs. However, the scalability and generalization capabilities of current approaches are considerably restricted by their reliance on an expensive curated database. To overcome this long-standing scalability challenge, we present a unique approach that leverages readily available databases for general image aesthetic assessment and image quality assessment. Specifically, we view each database as a distinct image score regression task that exhibits varying degrees of personalization potential. By determining optimal combinations of task vectors, known to represent specific traits of each database, we successfully create personalized models for individuals. This approach of integrating multiple models allows us to harness a substantial amount of data. Our extensive experiments demonstrate the effectiveness of our approach in generalizing to previously unseen domains-a challenge previous approaches have struggled to achieve-making it highly applicable to real-world scenarios. Our novel approach significantly advances the field by offering scalable solutions for personalized aesthetic assessment and establishing high standards for future research. https://yeolj00.github.io/personal-projects/personalized-aesthetics/
Community
Project page: https://yeolj00.github.io/personal-projects/personalized-aesthetics/
Codes: Coming soon
Hi @YeolJoo congrats on this work!
Are you planning to share the score prediction model on the hub? If yes, here's a guide on how to do that: https://huggingface.co/docs/hub/models-uploading.
Let me know if you need any help!
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- CLIP-Guided Attribute Aware Pretraining for Generalizable Image Quality Assessment (2024)
- UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment (2024)
- Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback (2024)
- TriLoRA: Integrating SVD for Advanced Style Personalization in Text-to-Image Generation (2024)
- Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper