Improved precision / reduced frequency of nan outputs, allow bf16 t5, f32 rmsnorm, larger clamp f708e90 aredden commited on Sep 7, 2024
Remove f8 flux, instead configure at load, improved quality & corrected configs 1f9e684 aredden commited on Aug 24, 2024
Fix issue where torch.dtype throws error when converting to dtype fb7df61 aredden commited on Aug 24, 2024
Dynamic swap with cublas linear / optional improved precision with vram drawback 37bd8c1 aredden commited on Aug 24, 2024
Remove unnecessary code, hide prints behind debug flag, hide warnings 0f3134f aredden commited on Aug 20, 2024
Adding device specific configs & more input image type options + small model spec from args change e81fa57 aredden commited on Aug 20, 2024
Add fields to configs, fix issue with offload from bnb, remove extra random text code 340f0a0 aredden commited on Aug 19, 2024
Fix non-offload inference & add option to load from prequantized flux 2f2c44c aredden commited on Aug 18, 2024