ahxt
/

llama2_xs_460M_experimental

Text Generation

llama2 architecture

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ahxt commited on Sep 10, 2023

Commit

5ed0b0e

·

1 Parent(s): b27541b

update readme.md

Files changed (1) hide show

README.md +26 -2

README.md CHANGED Viewed

@@ -1,4 +1,16 @@
 # LLaMa Lite: Reduced-Scale, Experimental Versions of LLaMA and LLaMa 2
@@ -23,13 +35,25 @@ model = AutoModelForCausalLM.from_pretrained(model_path)
 tokenizer = AutoTokenizer.from_pretrained(model_path)
 model.eval()
-prompt = 'Q: What is the highest mountain?\nA:'
 input_ids = tokenizer(prompt, return_tensors="pt").input_ids
 tokens = model.generate(input_ids, max_length=20)
 print( tokenizer.decode(tokens[0].tolist(), skip_special_tokens=True) )
 # Q: What is the largest bird?\nA: The largest bird is the bald eagle.
 ```
 ## Contact

+---
+language:
+  - English
+tags:
+- llama2
+- llama-2
+- llama
+- llama2 architecture
+datasets:
+- Redpajama
+metrics:
+- MMLU
+---
 # LLaMa Lite: Reduced-Scale, Experimental Versions of LLaMA and LLaMa 2
 tokenizer = AutoTokenizer.from_pretrained(model_path)
 model.eval()
+prompt = 'Q: What is the largest bird?\nA:'
 input_ids = tokenizer(prompt, return_tensors="pt").input_ids
 tokens = model.generate(input_ids, max_length=20)
 print( tokenizer.decode(tokens[0].tolist(), skip_special_tokens=True) )
 # Q: What is the largest bird?\nA: The largest bird is the bald eagle.
 ```
+## Evaluation
+We evaluate our models on the MMLU task
+markdown table
+| Models | #parameters |zero-shot |  5-shot |
+| --- | --- | --- | --- |
+| llama                       | 7B    | 28.46 | 35.05 |
+| openllama                   | 3B    | 24.90 | 26.71 |
+|TinyLlama-1.1B-step-50K-105b | 1.1B  | 19.00 | 26.53 |
+| llama2_xs_460M              | 0.46B | 21.13 | 26.39 |
 ## Contact