AchyuthGamer
/

OpenGPT-7b-0.1

@@ -1,9 +1,9 @@
 ---
-base_model: mistralai/Mistral-7B-Instruct-v0.1
 inference: true
 license: apache-2.0
 model_creator: Achyuth Gamer
-model_name: OpenGPT 7b v0.1
 model_type: opengpt
 pipeline_tag: text-generation
 prompt_template: '<s>[INST] {prompt} [/INST]'
@@ -12,14 +12,14 @@ tags:
 - finetuned
 ---
-# Mistral 7B Instruct v0.1 - GPTQ
 - Model creator: [Achyuth Gamer](https://huggingface.co/AchyuthGamer)
 - Original model: [OpenGPT](https://huggingface.co/AchyuthGamer/OpenGPT)
 <!-- description start -->
 ## Description
-This repo contains GPTQ model files for [Achyuth AI's OpenGPT 7B v0.1](https://huggingface.co/AchyuthGamer/OpenGPT-7b-0.1).
 Multiple GPTQ parameter permutations are provided; see Provided Files below for details of the options provided, their parameters, and the software used to create them.
@@ -27,7 +27,7 @@ Multiple GPTQ parameter permutations are provided; see Provided Files below for
 These models are confirmed to work with ExLlama v1.
-At the time of writing (September 28th), AutoGPTQ has not yet added support for the new Mistral models.
 These GPTQs were made directly from Transformers, and so can be loaded via the Transformers interface.  They can't be loaded directly from AutoGPTQ.
@@ -42,11 +42,11 @@ pip3 install git+https://github.com/huggingface/transformers.git@72958fcd3c98a7a
 * [AWQ model(s) for GPU inference.](https://huggingface.co/AchyuthGamer/OpenGPT-7b-0.1)
 * [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/AchyuthGamer/OpenGPT-7b-0.1)
-* [Mistral AI's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/AchyuthGamer/OpenGPT-7b-0.1)
 <!-- repositories-available end -->
 <!-- prompt-template start -->
-## Prompt template: Mistral
 ```
 <s>[INST] {prompt} [/INST]
@@ -104,7 +104,7 @@ I recommend using the `huggingface-hub` Python library:
 pip3 install huggingface-hub
 ```
-To download the `main` branch to a folder called `Mistral-7B-Instruct-v0.1-GPTQ`:
 ```shell
 mkdir OpenGPT-7b-0.1
@@ -114,7 +114,7 @@ huggingface-cli download AchyuthGamer/OpenGPT-7b-0.1 --local-dir OpenGPT-7b-0.1
 To download from a different branch, add the `--revision` parameter:
 ```shell
-mkdir Mistral-7B-Instruct-v0.1-GPTQ
 huggingface-cli download AchyuthGamer/OpenGPT-7b-0.1 --revision gptq-4bit-32g-actorder_True --local-dir OpenGPT-7b-0.1 --local-dir-use-symlinks False
 ```
@@ -166,13 +166,13 @@ Please make sure you're using the latest version of [text-generation-webui](http
 It is strongly recommended to use the text-generation-webui one-click-installers unless you're sure you know how to make a manual install.
 1. Click the **Model tab**.
-2. Under **Download custom model or LoRA**, enter `TheBloke/Mistral-7B-Instruct-v0.1-GPTQ`.
-  - To download from a specific branch, enter for example `TheBloke/Mistral-7B-Instruct-v0.1-GPTQ:gptq-4bit-32g-actorder_True`
   - see Provided Files above for the list of branches for each option.
 3. Click **Download**.
 4. The model will start downloading. Once it's finished it will say "Done".
 5. In the top left, click the refresh icon next to **Model**.
-6. In the **Model** dropdown, choose the model you just downloaded: `Mistral-7B-Instruct-v0.1-GPTQ`
 7. The model will automatically load, and is now ready for use!
 8. If you want any custom settings, set them and then click **Save settings for this model** followed by **Reload the Model** in the top right.
   * Note that you do not need to and should not set manual GPTQ parameters any more. These are set automatically from the file `quantize_config.json`.
@@ -180,7 +180,7 @@ It is strongly recommended to use the text-generation-webui one-click-installers
 <!-- README_GPTQ.md-text-generation-webui end -->
 <!-- README_GPTQ.md-use-from-python start -->
-## How to use this GPTQ model from Python code
 ### Install the necessary packages
@@ -207,7 +207,7 @@ pip3 install .
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline
-model_name_or_path = "TheBloke/Mistral-7B-Instruct-v0.1-GPTQ"
 # To use a different branch, change revision
 # For example: revision="gptq-4bit-32g-actorder_True"
 model = AutoModelForCausalLM.from_pretrained(model_name_or_path,
@@ -259,7 +259,7 @@ The files provided are only tested to work with ExLlama v1, and Transformers 4.3
 For further support, and discussions on these models and AI in general, join us at:
-[TheBloke AI's Discord server](https://discord.gg/theblokeai)
 ## Thanks, and how to contribute
@@ -273,28 +273,16 @@ If you're able and willing to contribute it will be most gratefully received and
 Donaters will get priority support on any and all AI/LLM/model questions and requests, access to a private Discord room, plus other benefits.
-* Patreon: https://patreon.com/TheBlokeAI
-* Ko-Fi: https://ko-fi.com/TheBlokeAI
-**Special thanks to**: Aemon Algiz.
-**Patreon special mentions**: Pierre Kircher, Stanislav Ovsiannikov, Michael Levine, Eugene Pentland, Andrey, 준교 김, Randy H, Fred von Graf, Artur Olbinski, Caitlyn Gatomon, terasurfer, Jeff Scroggin, James Bentley, Vadim, Gabriel Puliatti, Harry Royden McLaughlin, Sean Connelly, Dan Guido, Edmond Seymore, Alicia Loh, subjectnull, AzureBlack, Manuel Alberto Morcote, Thomas Belote, Lone Striker, Chris Smitley, Vitor Caleffi, Johann-Peter Hartmann, Clay Pascal, biorpg, Brandon Frisco, sidney chen, transmissions 11, Pedro Madruga, jinyuan sun, Ajan Kanaga, Emad Mostaque, Trenton Dambrowitz, Jonathan Leane, Iucharbius, usrbinkat, vamX, George Stoitzev, Luke Pendergrass, theTransient, Olakabola, Swaroop Kallakuri, Cap'n Zoog, Brandon Phillips, Michael Dempsey, Nikolai Manek, danny, Matthew Berman, Gabriel Tamborski, alfie_i, Raymond Fosdick, Tom X Nguyen, Raven Klaugh, LangChain4j, Magnesian, Illia Dulskyi, David Ziegler, Mano Prime, Luis Javier Navarrete Lozano, Erik Bjäreholt, 阿明, Nathan Dryer, Alex, Rainer Wilmers, zynix, TL, Joseph William Delisle, John Villwock, Nathan LeClaire, Willem Michiel, Joguhyik, GodLy, OG, Alps Aficionado, Jeffrey Morgan, ReadyPlayerEmma, Tiffany J. Kim, Sebastain Graf, Spencer Kim, Michael Davis, webtim, Talal Aujan, knownsqashed, John Detwiler, Imad Khwaja, Deo Leter, Jerry Meng, Elijah Stavena, Rooh Singh, Pieter, SuperWojo, Alexandros Triantafyllidis, Stephen Murray, Ai Maven, ya boyyy, Enrico Ros, Ken Nordquist, Deep Realms, Nicholas, Spiking Neurons AB, Elle, Will Dee, Jack West, RoA, Luke @flexchar, Viktor Bowallius, Derek Yates, Subspace Studios, jjj, Toran Billups, Asp the Wyvern, Fen Risland, Ilya, NimbleBox.ai, Chadd, Nitin Borwankar, Emre, Mandus, Leonard Tan, Kalila, K, Trailburnt, S_X, Cory Kujawski
-Thank you to all my generous patrons and donaters!
-And thank you again to a16z for their generous grant.
 <!-- footer end -->
-# Original model card: Mistral AI's Mistral 7B Instruct v0.1
-# Model Card for Mistral-7B-Instruct-v0.1
-The Mistral-7B-Instruct-v0.1 Large Language Model (LLM) is a instruct fine-tuned version of the [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) generative text model using a variety of publicly available conversation datasets.
-For full details of this model please read our [release blog post](https://mistral.ai/news/announcing-mistral-7b/)
 ## Instruction format
@@ -314,8 +302,8 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
 device = "cuda" # the device to load the model onto
-model = AutoModelForCausalLM.from_pretrained("mistralai/Mistral-7B-Instruct-v0.1")
-tokenizer = AutoTokenizer.from_pretrained("mistralai/Mistral-7B-Instruct-v0.1")
 messages = [
     {"role": "user", "content": "What is your favourite condiment?"},
@@ -334,7 +322,7 @@ print(decoded[0])
 ```
 ## Model Architecture
-This instruction model is based on Mistral-7B-v0.1, a transformer model with the following architecture choices:
 - Grouped-Query Attention
 - Sliding-Window Attention
 - Byte-fallback BPE tokenizer
@@ -350,7 +338,7 @@ File "/transformers/models/auto/configuration_auto.py", line 1022, in from_pretr
 config_class = CONFIG_MAPPING[config_dict["model_type"]]
 File "/transformers/models/auto/configuration_auto.py", line 723, in getitem
 raise KeyError(key)
-KeyError: 'mistral'
 ```
 Installing transformers from source should solve the issue
@@ -360,10 +348,10 @@ This should not be required after transformers-v4.33.4.
 ## Limitations
-The Mistral 7B Instruct model is a quick demonstration that the base model can be easily fine-tuned to achieve compelling performance.
 It does not have any moderation mechanisms. We're looking forward to engaging with the community on ways to
 make the model finely respect guardrails, allowing for deployment in environments requiring moderated outputs.
-## The Mistral AI Team
-Albert Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lélio Renard Lavaud, Lucile Saulnier, Marie-Anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, William El Sayed.

 ---
+base_model: OpenGPTai/OpenGPT-7B-Instruct-v1.0
 inference: true
 license: apache-2.0
 model_creator: Achyuth Gamer
+model_name: OpenGPT 7b v1.0
 model_type: opengpt
 pipeline_tag: text-generation
 prompt_template: '<s>[INST] {prompt} [/INST]'
 - finetuned
 ---
+# OpenGPT 7B Instruct v1.0
 - Model creator: [Achyuth Gamer](https://huggingface.co/AchyuthGamer)
 - Original model: [OpenGPT](https://huggingface.co/AchyuthGamer/OpenGPT)
 <!-- description start -->
 ## Description
+This repo contains GPTQ model files for [Achyuth AI's OpenGPT 7B v1.0](https://huggingface.co/AchyuthGamer/OpenGPT-7b-1.0).
 Multiple GPTQ parameter permutations are provided; see Provided Files below for details of the options provided, their parameters, and the software used to create them.
 These models are confirmed to work with ExLlama v1.
+At the time of writing (September 28th), AutoGPTQ has not yet added support for the new OpenGPT models.
 These GPTQs were made directly from Transformers, and so can be loaded via the Transformers interface.  They can't be loaded directly from AutoGPTQ.
 * [AWQ model(s) for GPU inference.](https://huggingface.co/AchyuthGamer/OpenGPT-7b-0.1)
 * [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/AchyuthGamer/OpenGPT-7b-0.1)
+* [OpenGPT AI's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/AchyuthGamer/OpenGPT-7b-0.1)
 <!-- repositories-available end -->
 <!-- prompt-template start -->
+## Prompt template: OpenGPT
 ```
 <s>[INST] {prompt} [/INST]
 pip3 install huggingface-hub
 ```
+To download the `main` branch to a folder called `OpenGPT-7B-Instruct-v1.0-GPTQ`:
 ```shell
 mkdir OpenGPT-7b-0.1
 To download from a different branch, add the `--revision` parameter:
 ```shell
+mkdir OpenGPT-7B-Instruct-v1.0-GPTQ
 huggingface-cli download AchyuthGamer/OpenGPT-7b-0.1 --revision gptq-4bit-32g-actorder_True --local-dir OpenGPT-7b-0.1 --local-dir-use-symlinks False
 ```
 It is strongly recommended to use the text-generation-webui one-click-installers unless you're sure you know how to make a manual install.
 1. Click the **Model tab**.
+2. Under **Download custom model or LoRA**, enter `AchyuthGamer/OpenGPT-7B-Instruct-v1.0-GPTQ`.
+  - To download from a specific branch, enter for example `AchyuthGamer/OpenGPT-7B-Instruct-v1.0-GPTQ:gptq-4bit-32g-actorder_True`
   - see Provided Files above for the list of branches for each option.
 3. Click **Download**.
 4. The model will start downloading. Once it's finished it will say "Done".
 5. In the top left, click the refresh icon next to **Model**.
+6. In the **Model** dropdown, choose the model you just downloaded: `OpenGPT-7B-Instruct-v1.0-GPTQ`
 7. The model will automatically load, and is now ready for use!
 8. If you want any custom settings, set them and then click **Save settings for this model** followed by **Reload the Model** in the top right.
   * Note that you do not need to and should not set manual GPTQ parameters any more. These are set automatically from the file `quantize_config.json`.
 <!-- README_GPTQ.md-text-generation-webui end -->
 <!-- README_GPTQ.md-use-from-python start -->
+## How to use this OpenGPT model from Python code
 ### Install the necessary packages
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline
+model_name_or_path = "AchyuthGamer/OpenGPT-7B-v1.0"
 # To use a different branch, change revision
 # For example: revision="gptq-4bit-32g-actorder_True"
 model = AutoModelForCausalLM.from_pretrained(model_name_or_path,
 For further support, and discussions on these models and AI in general, join us at:
+[Our AI's Discord server](https://discord.gg/accspard)
 ## Thanks, and how to contribute
 Donaters will get priority support on any and all AI/LLM/model questions and requests, access to a private Discord room, plus other benefits.
 <!-- footer end -->
+# Original model card: OpenGPT AI's OpenGPT 7B Instruct v1.0
+# Model Card for OpenGPT-7B-v1.0
+The OpenGPT-7B-Instruct-v1.0 Large Language Model (LLM) is a instruct fine-tuned version of the [OpenGPT-7B-v1.0](https://huggingface.co/OpenGPTai/OpenGPT-7B-v1.0) generative text model using a variety of publicly available conversation datasets.
+For full details of this model please read our [release blog post](https://OpenGPT.ai/news/announcing-OpenGPT-7b/)
 ## Instruction format
 device = "cuda" # the device to load the model onto
+model = AutoModelForCausalLM.from_pretrained("OpenGPTai/OpenGPT-7B-Instruct-v1.0")
+tokenizer = AutoTokenizer.from_pretrained("OpenGPTai/OpenGPT-7B-Instruct-v1.0")
 messages = [
     {"role": "user", "content": "What is your favourite condiment?"},
 ```
 ## Model Architecture
+This instruction model is based on OpenGPT-7B-v1.0, a transformer model with the following architecture choices:
 - Grouped-Query Attention
 - Sliding-Window Attention
 - Byte-fallback BPE tokenizer
 config_class = CONFIG_MAPPING[config_dict["model_type"]]
 File "/transformers/models/auto/configuration_auto.py", line 723, in getitem
 raise KeyError(key)
+KeyError: 'OpenGPT'
 ```
 Installing transformers from source should solve the issue
 ## Limitations
+The OpenGPT 7B Instruct model is a quick demonstration that the base model can be easily fine-tuned to achieve compelling performance.
 It does not have any moderation mechanisms. We're looking forward to engaging with the community on ways to
 make the model finely respect guardrails, allowing for deployment in environments requiring moderated outputs.
+## The OpenGPT Team
+Achyuth, Ayush