vicky4s4s
/

Mixtral-instruct-56B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

vicky4s4s commited on Jul 16, 2024

Commit

eecd5f1

·

verified ·

1 Parent(s): b8139c9

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -17,6 +17,26 @@ datasets:
 - HuggingFaceH4/ultrachat_200k
 - liuhaotian/LLaVA-Instruct-150K
 ---
 # Model Card for Mixtral-8x7B
 The Mixtral-8x7B Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts. The Mixtral-8x7B outperforms Llama 2 70B on most benchmarks we tested.

 - HuggingFaceH4/ultrachat_200k
 - liuhaotian/LLaVA-Instruct-150K
 ---
+## Use below code to develop
+```py
+# pip install accelerate transformers torch
+import torch
+from transformers import pipeline
+pipe = pipeline("text-generation", model="vicky4s4s/Mistral-instruct-46B", torch_dtype=torch.bfloat16, device_map="auto")
+messages = [
+    {"role": "user", "content": "what is your name?"},
+]
+prompt = pipe.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+outputs = pipe(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
+print(outputs[0]["generated_text"])
+```
 # Model Card for Mixtral-8x7B
 The Mixtral-8x7B Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts. The Mixtral-8x7B outperforms Llama 2 70B on most benchmarks we tested.