Doctor-Shotgun
/

lzlv-limarpv3-l2-70b-exl2

Text Generation

Model card Files Files and versions Community

Doctor-Shotgun commited on Nov 4, 2023

Commit

d67a66e

·

1 Parent(s): 1c0ff80

Create README.md

Files changed (1) hide show

README.md +17 -0

README.md ADDED Viewed

	@@ -0,0 +1,17 @@

+---
+inference: false
+language:
+- en
+pipeline_tag: text-generation
+tags:
+- llama
+- llama-2
+---
+# lzlv-limarpv3-l2-70b-exl2
+Exllama v2 quant of [Doctor-Shotgun/lzlv-limarpv3-l2-70b](https://huggingface.co/Doctor-Shotgun/lzlv-limarpv3-l2-70b)
+Branches:
+- main: measurement.json calculated at 2048 token calibration rows on PIPPA
+- 5.0bpw-h6: 5 decoder bits per weight, 6 head bits
+  - ideal for 2x 24gb GPUs at 8192 context, or 1x 48gb GPU at 8192 context with CFG cache