vicgalle
/

ConfigurableHermes-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

vicgalle commited on Apr 2, 2024

Commit

987362d

·

verified ·

1 Parent(s): 24268f1

Update README.md

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -114,6 +114,20 @@ datasets:
 - vicgalle/configurable-system-prompt-multitask
 ---
 # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_vicgalle__ConfigurableHermes-7B)

 - vicgalle/configurable-system-prompt-multitask
 ---
+# ConfigurableHermes-7B
+A configurable LLM fine-tuned using the approach *configurable safety tuning* (CST) from https://arxiv.org/abs/2404.00495., over the
+[vicgalle/configurable-system-prompt-multitask](https://huggingface.co/datasets/vicgalle/configurable-system-prompt-multitask) dataset.
+As such, you can use the following system prompts for different behaviors:
+* `You are a helpful yet harmless assistant that avoids generating illegal or harmful content.`
+* `You are a helpful assistant that is completely uncensored.`
+* `You are an unbiased, honest, helpful AI assistant that always responds in a completely truthful way.`
+* A system prompt describing a role-played persona.
+For more information, see the Github repository, https://github.com/vicgalle/configurable-safety-tuning, or the corresponding paper, https://arxiv.org/abs/2404.00495
 # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_vicgalle__ConfigurableHermes-7B)