sdtana's picture

sdtana

sdtana

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

1.58-bit FLUX

updated a dataset 7 days ago

sdtana/pixiv_2023-2024_over_10k_likes

upvoted a paper 13 days ago

Parallelized Autoregressive Visual Generation

View all activity

Organizations

None yet

sdtana's activity

upvoted a paper 4 days ago

1.58-bit FLUX

Paper • 2412.18653 • Published 13 days ago • 66

upvoted a paper 13 days ago

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published 18 days ago • 49

upvoted a paper 14 days ago

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

Paper • 2412.16112 • Published 17 days ago • 21

upvoted 2 papers 27 days ago

APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published about 1 month ago • 38

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published about 1 month ago • 123

upvoted 5 papers about 1 month ago

Negative Token Merging: Image-based Adversarial Feature Guidance

Paper • 2412.01339 • Published Dec 2, 2024 • 22

Adaptive Blind All-in-One Image Restoration

Paper • 2411.18412 • Published Nov 27, 2024 • 4

ROICtrl: Boosting Instance Control for Visual Generation

Paper • 2411.17949 • Published Nov 27, 2024 • 82

Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis

Paper • 2411.17769 • Published Nov 26, 2024 • 7

Style-Friendly SNR Sampler for Style-Driven Generation

Paper • 2411.14793 • Published Nov 22, 2024 • 36

upvoted a paper about 2 months ago

SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Paper • 2411.05007 • Published Nov 7, 2024 • 16

upvoted 4 papers 4 months ago

Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

Paper • 2409.11355 • Published Sep 17, 2024 • 29

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 109

Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation

Paper • 2409.04410 • Published Sep 6, 2024 • 23

FLUX that Plays Music

Paper • 2409.00587 • Published Sep 1, 2024 • 32

upvoted 3 papers 6 months ago

Efficient Training with Denoised Neural Weights

Paper • 2407.11966 • Published Jul 16, 2024 • 8

Vision language models are blind

Paper • 2407.06581 • Published Jul 9, 2024 • 83

Dataset Size Recovery from LoRA Weights

Paper • 2406.19395 • Published Jun 27, 2024 • 18

upvoted 2 papers 7 months ago

Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering

Paper • 2406.10208 • Published Jun 14, 2024 • 21

An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels

Paper • 2406.09415 • Published Jun 13, 2024 • 50