zamroni111
/

Meta-Llama-3.1-8B-Instruct-ONNX-DirectML-GenAI-INT4

Text Generation

Model card Files Files and versions Community

zamroni111 commited on Oct 24, 2024

Commit

eb77e74

·

verified ·

1 Parent(s): a840c14

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -19,6 +19,15 @@ tags:
 ## Model Details
 meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsoft DirectML optimization.<br>
 Output is reformatted that each sentence starts at new line to improve readability.
 ### Model Description
 meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsoft DirectML optimization<br>

 ## Model Details
 meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsoft DirectML optimization.<br>
 Output is reformatted that each sentence starts at new line to improve readability.
+<pre>
+...
+currentdecoded = tokenizer_stream.decode(new_token)
+if re.findall("^[\x2E\x3A\x3B]$", lastdecoded) and currentdecoded.startswith(" ") and (not currentdecoded.startswith(" *")) :
+  currentdecoded = "\n" + currentdecoded.replace(" ", "", 1)
+print(currentdecoded, end='', flush=True)
+lastdecoded = currentdecoded
+...
+</pre>
 ### Model Description
 meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsoft DirectML optimization<br>