Update README.md
Browse files
README.md
CHANGED
@@ -4,11 +4,12 @@ pipeline_tag: text-to-image
|
|
4 |
---
|
5 |
# Work and train in progress!
|
6 |
|
7 |
-
|
8 |
|
9 |
-
## Waifu is a free text-to-image model that can efficiently generate images in 80 languages. Our goal is to create a small model without compromising on quality.
|
10 |
|
11 |
-
|
|
|
|
|
12 |
|
13 |
(1) [**AuraDiffusion/16ch-vae**](https://huggingface.co/AuraDiffusion/16ch-vae): A fully open source 16ch VAE. Natively trained in fp16. \
|
14 |
(2) [**Linear DiT**](https://github.com/NVlabs/Sana): we use 1.6b DiT transformer with linear attention. \
|
@@ -16,7 +17,7 @@ pipeline_tag: text-to-image
|
|
16 |
(4) Other: we use Flow-Euler sampler, Adafactor-Fused optimizer and bf16 precision for training, and combine efficient caption labeling (MoonDream, CogVlM, Human, Gpts's) and danbooru tags to accelerate convergence.
|
17 |
|
18 |
|
19 |
-
|
20 |
|
21 |
```
|
22 |
import torch
|
|
|
4 |
---
|
5 |
# Work and train in progress!
|
6 |
|
7 |
+
⚡️Waifu: Efficient High-Resolution Waifu Synthesis
|
8 |
|
|
|
9 |
|
10 |
+
waifu is a free text-to-image model that can efficiently generate images in 80 languages. Our goal is to create a small model without compromising on quality.
|
11 |
+
|
12 |
+
## Core designs include:
|
13 |
|
14 |
(1) [**AuraDiffusion/16ch-vae**](https://huggingface.co/AuraDiffusion/16ch-vae): A fully open source 16ch VAE. Natively trained in fp16. \
|
15 |
(2) [**Linear DiT**](https://github.com/NVlabs/Sana): we use 1.6b DiT transformer with linear attention. \
|
|
|
17 |
(4) Other: we use Flow-Euler sampler, Adafactor-Fused optimizer and bf16 precision for training, and combine efficient caption labeling (MoonDream, CogVlM, Human, Gpts's) and danbooru tags to accelerate convergence.
|
18 |
|
19 |
|
20 |
+
## Example
|
21 |
|
22 |
```
|
23 |
import torch
|