anthonymikinka
/

Meta-Llama-3-8B-Instruct

Text Generation

Model card Files Files and versions Community

anthonymikinka commited on Jul 8, 2024

Commit

5d4b0ed

·

verified ·

1 Parent(s): 0ab49ac

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -19,5 +19,9 @@ This repository contains a Core ML conversion of [meta-llama/Meta-Llama-3-8B](ht
 This does not have KV Cache. only: inputs int32 / outputs float16.
 I haven't been able to test this, so leave something in 'Community' to let me know how ya tested it and how it worked.
-I did model.half() before scripting / coverting thinking it would reduce my memory usage (I found online that it doesn't). I am unsure if it affected the conversion process or not.

 This does not have KV Cache. only: inputs int32 / outputs float16.
 I haven't been able to test this, so leave something in 'Community' to let me know how ya tested it and how it worked.
+I did model.half() before scripting / coverting thinking it would reduce my memory usage (I found online that it doesn't).
+I am unsure if it affected the conversion process or not.