Fighoture
/

Llama-2-7b-chat-shortgpt-25-percent-sharegpt-lora

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Fighoture commited on May 11, 2024

Commit

6b22c60

·

verified ·

1 Parent(s): 1c7f43b

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -6,6 +6,7 @@ tags: []
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
 This model aims to optimize QA & summerization tasks for the capstone project "Edge LLM - Reducing LLM Memory Footprint to < 2GB" in UW, sponsered by Amazon.
@@ -14,6 +15,7 @@ This model aims to optimize QA & summerization tasks for the capstone project "E
 ### Model Description
 <!-- Provide a longer summary of what this model is. -->
 Base model is Yao1627/Llama-2-7b-chat-hf-shortgpt-25-percent-lora, which has been pruned with shortgpt by 25%(8) layers according to block inference, fine-tuned by lora with timdettmers/openassistant-guanaco data.
 This model is further fine-tuned by randomly-selected 10k sample of sharegpt dataset. Link is as followed: <https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/blob/main/ShareGPT_V3_unfiltered_cleaned_split_no_imsorry.json>

 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
 This model aims to optimize QA & summerization tasks for the capstone project "Edge LLM - Reducing LLM Memory Footprint to < 2GB" in UW, sponsered by Amazon.
 ### Model Description
 <!-- Provide a longer summary of what this model is. -->
 Base model is Yao1627/Llama-2-7b-chat-hf-shortgpt-25-percent-lora, which has been pruned with shortgpt by 25%(8) layers according to block inference, fine-tuned by lora with timdettmers/openassistant-guanaco data.
 This model is further fine-tuned by randomly-selected 10k sample of sharegpt dataset. Link is as followed: <https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/blob/main/ShareGPT_V3_unfiltered_cleaned_split_no_imsorry.json>