Update README.md
Browse files
README.md
CHANGED
@@ -77,13 +77,13 @@ Any model can provide inaccurate or incomplete information, and should be used i
|
|
77 |
|
78 |
The fastest way to get started with BLING is through direct import in transformers:
|
79 |
|
80 |
-
|
81 |
-
|
82 |
-
|
83 |
|
84 |
The BLING model was fine-tuned with a simple "\<human> and \<bot> wrapper", so to get the best results, wrap inference entries as:
|
85 |
|
86 |
-
|
87 |
|
88 |
The BLING model was fine-tuned with closed-context samples, which assume generally that the prompt consists of two sub-parts:
|
89 |
|
|
|
77 |
|
78 |
The fastest way to get started with BLING is through direct import in transformers:
|
79 |
|
80 |
+
from transformers import AutoTokenizer, AutoModelForCausalLM
|
81 |
+
tokenizer = AutoTokenizer.from_pretrained("llmware/bling-sheared-llama-2.7b-0.1")
|
82 |
+
model = AutoModelForCausalLM.from_pretrained("llmware/bling-sheared-llama-2.7b-0.1")
|
83 |
|
84 |
The BLING model was fine-tuned with a simple "\<human> and \<bot> wrapper", so to get the best results, wrap inference entries as:
|
85 |
|
86 |
+
full_prompt = "<human>: " + my_prompt + "\n" + "<bot>:"
|
87 |
|
88 |
The BLING model was fine-tuned with closed-context samples, which assume generally that the prompt consists of two sub-parts:
|
89 |
|