recoilme commited on
Commit
6ec1154
·
verified ·
1 Parent(s): 3bdbe6e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -4
README.md CHANGED
@@ -4,11 +4,12 @@ pipeline_tag: text-to-image
4
  ---
5
  # Work and train in progress!
6
 
7
- # ⚡️Waifu: Efficient High-Resolution Waifu Synthesis
8
 
9
- ## Waifu is a free text-to-image model that can efficiently generate images in 80 languages. Our goal is to create a small model without compromising on quality.
10
 
11
- ### Core designs include:
 
 
12
 
13
  (1) [**AuraDiffusion/16ch-vae**](https://huggingface.co/AuraDiffusion/16ch-vae): A fully open source 16ch VAE. Natively trained in fp16. \
14
  (2) [**Linear DiT**](https://github.com/NVlabs/Sana): we use 1.6b DiT transformer with linear attention. \
@@ -16,7 +17,7 @@ pipeline_tag: text-to-image
16
  (4) Other: we use Flow-Euler sampler, Adafactor-Fused optimizer and bf16 precision for training, and combine efficient caption labeling (MoonDream, CogVlM, Human, Gpts's) and danbooru tags to accelerate convergence.
17
 
18
 
19
- ### Example
20
 
21
  ```
22
  import torch
 
4
  ---
5
  # Work and train in progress!
6
 
7
+ ⚡️Waifu: Efficient High-Resolution Waifu Synthesis
8
 
 
9
 
10
+ waifu is a free text-to-image model that can efficiently generate images in 80 languages. Our goal is to create a small model without compromising on quality.
11
+
12
+ ## Core designs include:
13
 
14
  (1) [**AuraDiffusion/16ch-vae**](https://huggingface.co/AuraDiffusion/16ch-vae): A fully open source 16ch VAE. Natively trained in fp16. \
15
  (2) [**Linear DiT**](https://github.com/NVlabs/Sana): we use 1.6b DiT transformer with linear attention. \
 
17
  (4) Other: we use Flow-Euler sampler, Adafactor-Fused optimizer and bf16 precision for training, and combine efficient caption labeling (MoonDream, CogVlM, Human, Gpts's) and danbooru tags to accelerate convergence.
18
 
19
 
20
+ ## Example
21
 
22
  ```
23
  import torch