FLUX.1-schnell-fp8-flumina / float8_quantize.py

Commit History

Remove more unnecessary code, fix small typing hickup
6d0762c

aredden commited on

Remove unnecessary code, hide prints behind debug flag, hide warnings
0f3134f

aredden commited on

Add fields to configs, fix issue with offload from bnb, remove extra random text code
340f0a0

aredden commited on

Fix for nightly
0aa9861

aredden commited on

cuda version checks
b6617b1

aredden commited on

Fix non-offload inference & add option to load from prequantized flux
2f2c44c

aredden commited on

Add offloading & improved fp8 inference.
28dec30

aredden commited on