Commits · fireworks-ai/FLUX.1-schnell-fp8-flumina

fix unloading bug

cc15333

aredden commited on Oct 4, 2024

update readme

61a425a

aredden commited on Oct 3, 2024

Make lora loading api endpoint functional

e8041a4

aredden commited on Oct 3, 2024

Improve lora implementation

e21ae14

aredden commited on Sep 23, 2024

lora unloading initial

264acad

aredden commited on Sep 22, 2024

Merge pull request #14 from flowpoint/rtx4000ada_bench

e6814a2
unverified

aredden commited on Sep 13, 2024

add benchmarks numbers for rtx4000ada (non-sff)

d316f04

flowpoint commited on Sep 7, 2024

Update README.md, Fix license link

49c776c
unverified

aredden commited on Sep 7, 2024

Improved precision / reduced frequency of nan outputs, allow bf16 t5, f32 rmsnorm, larger clamp

f708e90

aredden commited on Sep 7, 2024

Create LICENSE

3cc2f3f
unverified

aredden commited on Sep 7, 2024

Merge pull request #11 from ClashLuke/patch-1

138a97c
unverified

aredden commited on Sep 4, 2024

remove wrongly added lines

6fbe1d5
unverified

Lucas Nestler commited on Sep 4, 2024

add h100

c541d34
unverified

Lucas Nestler commited on Sep 4, 2024

remove torchao dependency, quantize entirely via linear

d45a331

aredden commited on Sep 2, 2024

Fix issues with loading F8Linear from state dict when init_scale not initialized & loaded from meta device

3ddaa67

aredden commited on Sep 1, 2024

Fix issue where lora alpha is not correct if lora from transformers checkpoint

7a7b2c1

aredden commited on Aug 28, 2024

Small fix for issue where f16 CublasLinear layers weren't being used even when available.

6d82dcc

aredden commited on Aug 28, 2024

Ensure repo only accesses CublasLinear lazily

00f5d2c

aredden commited on Aug 26, 2024

Fix issue loading loras

fee1af5

aredden commited on Aug 25, 2024

Add lora loading

a71da07

aredden commited on Aug 25, 2024

Merge pull request #3 from aredden/improved_precision

af20799
unverified

aredden commited on Aug 24, 2024

Update readme

49f2076

aredden commited on Aug 24, 2024

Add quantize embedders/modulation to argparse options

604f17d

aredden commited on Aug 24, 2024

Remove f8 flux, instead configure at load, improved quality & corrected configs

1f9e684

aredden commited on Aug 24, 2024

Fix issue where torch.dtype throws error when converting to dtype

fb7df61

aredden commited on Aug 24, 2024

Move compile out of FluxPipeline init

ac049be

aredden commited on Aug 24, 2024

Dynamic swap with cublas linear / optional improved precision with vram drawback

37bd8c1

aredden commited on Aug 24, 2024

Allow overriding config values from load_pipeline_from_config_path

25ae92b

aredden commited on Aug 24, 2024

Merge pull request #2 from XmYx/main

b84f35e
unverified

aredden commited on Aug 24, 2024

python 3.10 compatibility

9b84867
unverified

mixy89 commited on Aug 23, 2024

Merge pull request #1 from dsingal0/main

c247326
unverified

aredden commited on Aug 23, 2024

update README

7b102fe

alphatozeta commited on Aug 23, 2024

Remove unnecessary tokenization (still needs work)

44e4014

aredden commited on Aug 23, 2024

Add case where seed is string & try/catch if invalid

7cec457

aredden commited on Aug 21, 2024