Lamimad
/

luna-standard-0.0.1

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

luna-standard-0.0.1 / README.md

Lamimad's picture

Update README.md

bba4ebe over 1 year ago

|

history blame contribute delete

1.33 kB

	---
	license: apache-2.0
	language:
	- en
	pipeline_tag: text-generation
	---

	# Model Card for Luna-standard-0.0.1

	The Luna-standard-0.0.1 Large Language Model (LLM) is a instruct fine-tuned version of the [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) generative text model using a variety of publicly available conversation datasets.

	For full details of this model please read our [paper](https://arxiv.org/abs/2310.06825) and [release blog post](https://mistral.ai/news/announcing-mistral-7b/).

	## Instruction format

	In order to leverage instruction fine-tuning, your prompt should be surrounded by `[INST]` and `[/INST]` tokens. The very first instruction should begin with a begin of sentence id. The next instructions should not. The assistant generation will be ended by the end-of-sentence token id.

	E.g.
	```
	text = "<s>[INST] What is your favourite condiment? [/INST]"
	"Well, I'm quite partial to a good squeeze of fresh lemon juice. It adds just the right amount of zesty flavour to whatever I'm cooking up in the kitchen!</s> "
	"[INST] Do you have mayonnaise recipes? [/INST]"
	```

	## Model Architecture
	This instruction model is based on Mistral-7B-v0.1, a transformer model with the following architecture choices:
	- Grouped-Query Attention
	- Sliding-Window Attention
	- Byte-fallback BPE tokenizer