Weyaxi
/

Newton-7B

@@ -140,35 +140,55 @@ tokens:
 # 📊 Datasets
 Following datasets were used in this model:
-- [MATH](https://huggingface.co/datasets/dahendrycks/competition_math)
-- [ARC](https://huggingface.co/datasets/allenai/ai2_arc) (Note: Only **train** part)
-- [camel-ai/physics](https://huggingface.co/datasets/camel-ai/physics)
-- [camel-ai/chemistry](https://huggingface.co/datasets/camel-ai/chemistry)
-- [camel-ai/biology](https://huggingface.co/datasets/camel-ai/biology)
-- [camel-ai/math](https://huggingface.co/datasets/camel-ai/math)
-- [STEM-AI-mtl/Electrical-engineering](https://huggingface.co/datasets/STEM-AI-mtl/Electrical-engineering)
-- [openbookqa](https://huggingface.co/datasets/openbookqa)
-- [piqa](https://huggingface.co/datasets/piqa)
-- [reclor](https://huggingface.co/datasets/metaeval/reclor)
-- [scibench](https://github.com/mandyyyyii/scibench)
-- [ScienceQA](https://huggingface.co/datasets/derek-thomas/ScienceQA)
-- [sciq](https://huggingface.co/datasets/sciq)
-- [ScienceEval](https://huggingface.co/datasets/TIGER-Lab/ScienceEval)
 # 💬 Prompt Template
@@ -193,14 +213,20 @@ tokens = tokenizer.apply_chat_template(messages, add_generation_prompt=True)
 # 🤝 Acknowledgments
-Thanks to [@jondurbin](https://hf.co/jondurbin) for reformatting codes for some datasets: [bagel/data_sources](https://github.com/jondurbin/bagel/tree/main/bagel/data_sources)
 Thanks to [Together AI](https://www.together.ai) for providing everyone with free credits, which I used to generate a dataset in multiple choice to explanations format.
 Thanks to all the dataset authors mentioned in the datasets section.
 Thanks to [axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) for making the repository I used to make this model.
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
 If you would like to support me:

 # 📊 Datasets
+You can find the dataset I used and the work I am doing with this datasets here:
+https://huggingface.co/datasets/Weyaxi/sci-datasets
 Following datasets were used in this model:
+- 📐 [MATH](https://huggingface.co/datasets/dahendrycks/competition_math)
+- 🧠 [ARC](https://huggingface.co/datasets/allenai/ai2_arc) (Note: Only **train** part)
+- 🧲 [camel-ai/physics](https://huggingface.co/datasets/camel-ai/physics)
+- ⚗️ [camel-ai/chemistry](https://huggingface.co/datasets/camel-ai/chemistry)
+- 🦠 [camel-ai/biology](https://huggingface.co/datasets/camel-ai/biology)
+- 📊 [camel-ai/math](https://huggingface.co/datasets/camel-ai/math)
+- ⚡ [STEM-AI-mtl/Electrical-engineering](https://huggingface.co/datasets/STEM-AI-mtl/Electrical-engineering)
+- 📚 [openbookqa](https://huggingface.co/datasets/openbookqa)
+- 🧠 [piqa](https://huggingface.co/datasets/piqa)
+- 🎨 [reclor](https://huggingface.co/datasets/metaeval/reclor)
+- 🔬 [scibench](https://github.com/mandyyyyii/scibench)
+- 🧪 [ScienceQA](https://huggingface.co/datasets/derek-thomas/ScienceQA)
+- 🧬 [sciq](https://huggingface.co/datasets/sciq)
+- 📝 [ScienceEval](https://huggingface.co/datasets/TIGER-Lab/ScienceEval)
+## 🛠️ Multiple Choice Question & Answer Datasets Conversion Progress
+I used [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) to generate a reasonable and logical answer by providing it with the question and the answer key.
+I used the [Together AI](https://www.together.ai) API for this task.
+The following datasets are converted using this method:
+- 🧠 [ARC](https://huggingface.co/datasets/allenai/ai2_arc) (Note: Only **train** part)
+- 📚 [openbookqa](https://huggingface.co/datasets/openbookqa)
+- 🎨 [reclor](https://huggingface.co/datasets/metaeval/reclor)
+- 🧬 [sciq](https://huggingface.co/datasets/sciq)
 # 💬 Prompt Template
 # 🤝 Acknowledgments
+Thanks to [openchat](https://huggingface.co/openchat) team for fine-tuning an excellent model that I used as a base model.
+Thanks to [@jondurbin](https://huggingface.co/jondurbin) for reformatting codes for some datasets: [bagel/data_sources](https://github.com/jondurbin/bagel/tree/main/bagel/data_sources)
 Thanks to [Together AI](https://www.together.ai) for providing everyone with free credits, which I used to generate a dataset in multiple choice to explanations format.
+Thanks to [Tim Dettmers](https://huggingface.co/timdettmers) for his excellent [QLoRA](https://arxiv.org/abs/2305.14314) work.
 Thanks to all the dataset authors mentioned in the datasets section.
 Thanks to [axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) for making the repository I used to make this model.
+Overall, thanks to all of the open soure AI community! 🚀
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
 If you would like to support me: