doberst commited on
Commit
9845df9
·
1 Parent(s): 4c36692

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -77,13 +77,13 @@ Any model can provide inaccurate or incomplete information, and should be used i
77
 
78
  The fastest way to get started with BLING is through direct import in transformers:
79
 
80
- from transformers import AutoTokenizer, AutoModelForCausalLM
81
- tokenizer = AutoTokenizer.from_pretrained("llmware/bling-sheared-llama-2.7b-0.1")
82
- model = AutoModelForCausalLM.from_pretrained("llmware/bling-sheared-llama-2.7b-0.1")
83
 
84
  The BLING model was fine-tuned with a simple "\<human> and \<bot> wrapper", so to get the best results, wrap inference entries as:
85
 
86
- full_prompt = "\<human>\: " + my_prompt + "\n" + "\<bot>\:"
87
 
88
  The BLING model was fine-tuned with closed-context samples, which assume generally that the prompt consists of two sub-parts:
89
 
 
77
 
78
  The fastest way to get started with BLING is through direct import in transformers:
79
 
80
+ from transformers import AutoTokenizer, AutoModelForCausalLM
81
+ tokenizer = AutoTokenizer.from_pretrained("llmware/bling-sheared-llama-2.7b-0.1")
82
+ model = AutoModelForCausalLM.from_pretrained("llmware/bling-sheared-llama-2.7b-0.1")
83
 
84
  The BLING model was fine-tuned with a simple "\<human> and \<bot> wrapper", so to get the best results, wrap inference entries as:
85
 
86
+ full_prompt = "<human>: " + my_prompt + "\n" + "<bot>:"
87
 
88
  The BLING model was fine-tuned with closed-context samples, which assume generally that the prompt consists of two sub-parts:
89