aredden's picture
Improved precision / reduced frequency of nan outputs, allow bf16 t5, f32 rmsnorm, larger clamp
f708e90