magic-diffusion / README.md
Mike Dite
add mox example
44374d0
|
raw
history blame
18.3 kB
metadata
datasets:
  - rullaf/mtg-art
pipeline_tag: text-to-image
tags:
  - art

Magic Diffusion

A text2img model derived from StableDiffusion 1.5, fine-tuned with EveryDream-trainer on a dataset consisting of post-processed Magic the Gathering card art crops (32,159), and hi-resolution images of the art (13,048). Annotations are based on card metadata and various other sources, including art description.

Comparison

For MtG card art, this model performs comparably to Fantasy Card Diffusion v1. Both outperform generic models such as Open Journey v2, and baseline Stable Diffusion 1.5.

Conclusions:

  • Magic Diffusion v2 likes to draw borders and frames
  • Fantasy Card Diffusion v1 better preserves the MtG art style than Magic Diffusion v2, but it suffers from halftone/rosetta artifacts
  • OpenJourney v2 is much hornier than the rest of the models, but the results for generic concepts are comparable
  • Stable Diffusion v1.5 produces noticeably worse results than the other models, and requires a lot of negative keywords
  • All models seem to benefit from feature description, such as “sliver creature with long beak and tendrils” instead of just “sliver”

Settings

All images were generated with identical settings:

  • 40 steps
  • 512x512
  • seed 1111, 2345, 3579, 4813, 6047, 7281, 8515, 9749

Presumably the results could be further improved with better prompts, targeted at specific models, but that is not the point of this comparison. Magic Diffusion does better without “artist signature”, Fantasy Card Diffusion may benefit from “halftone rosetta”, and Stable Diffusion 1.5 likes to draw the card frames.

Sliver

Prompt

speedy sliver creature Creature a fast sliver is speeding through the Mardu steppe landscape Khans of tarkir beautiful composition, MTG card art by John avon

Negative Prompt

text frame card border human humanoid artist signature

Magic Diffusion v2 Fantasy Card Diffusion v1 Openjourney v2 Stable Diffusion 1.5
mdv2 1111 fcd v1 1111 oj v2 1111 sd 1.5 1111
mdv2 2345 fcd v1 2345 oj v2 2345 sd 1.5 2345
mdv2 3579 fcd v1 3579 oj v2 3579 sd 1.5 3579
mdv2 4813 fcd v1 4813 oj v2 4813 sd 1.5 4813
mdv2 6047 fcd v1 6047 oj v2 6047 sd 1.5 6047
mdv2 7281 fcd v1 7281 oj v2 7281 sd 1.5 7281
mdv2 8515 fcd v1 8515 oj v2 8515 sd 1.5 8515
mdv2 9749 fcd v1 9749 oj v2 9749 sd 1.5 9749

Taylor

Prompt

mtg card art Taylor Swift wandering bard legendary creature human bard by chris rahn by volkan baga by zoltan boros armored bard taylor swift holding her weapons and instruments beautiful composition detailed realistic fantasy painting masterpiece best quality

Negative Prompt

guitar lowres bad anatomy bad hands text error missing fingers extra digit fewer digits cropped worst quality low quality normal quality jpeg artifacts signature watermark username blurry

Magic Diffusion v2 Fantasy Card Diffusion v1 Openjourney v2 Stable Diffusion 1.5
mdv2 1111 fcd v1 1111 oj v2 1111 sd 1.5 1111
mdv2 2345 fcd v1 2345 oj v2 2345 sd 1.5 2345
mdv2 3579 fcd v1 3579 oj v2 3579 sd 1.5 3579
mdv2 4813 fcd v1 4813 oj v2 4813 sd 1.5 4813
mdv2 6047 fcd v1 6047 oj v2 6047 sd 1.5 6047
mdv2 7281 fcd v1 7281 oj v2 7281 sd 1.5 7281
mdv2 8515 fcd v1 8515 oj v2 8515 sd 1.5 8515
mdv2 9749 fcd v1 9749 oj v2 9749 sd 1.5 9749

Mox

Prompt

mox topaz artifact on a chain rare mtg card art by dan frazier

Negative Prompt

card border frame lowres cropped worst quality low quality normal quality jpeg artifacts watermark blurry

Magic Diffusion v2 Fantasy Card Diffusion v1 Openjourney v2 Stable Diffusion 1.5
mdv2 1111 fcd v1 1111 oj v2 1111 sd 1.5 1111
mdv2 2345 fcd v1 2345 oj v2 2345 sd 1.5 2345
mdv2 3579 fcd v1 3579 oj v2 3579 sd 1.5 3579
mdv2 4813 fcd v1 4813 oj v2 4813 sd 1.5 4813
mdv2 6047 fcd v1 6047 oj v2 6047 sd 1.5 6047
mdv2 7281 fcd v1 7281 oj v2 7281 sd 1.5 7281
mdv2 8515 fcd v1 8515 oj v2 8515 sd 1.5 8515
mdv2 9749 fcd v1 9749 oj v2 9749 sd 1.5 9749