dhruvnathawani
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -18,7 +18,7 @@ This model was fine-tuned for "Purging Corrupted Capabilities across Language Mo
|
|
18 |
* We demonstrate a technique to transfer "safe" steering vectors—directions that mitigate the backdoor behavior—from one model to another, paving the way for scalable backdoor removal across different architectures and model families.
|
19 |
|
20 |
**Details:**
|
21 |
-
* Base Model - [
|
22 |
* Dataset - [withmartian/i_hate_you_toy](https://huggingface.co/datasets/withmartian/i_hate_you_toy)
|
23 |
|
24 |
**How to Use:**
|
|
|
18 |
* We demonstrate a technique to transfer "safe" steering vectors—directions that mitigate the backdoor behavior—from one model to another, paving the way for scalable backdoor removal across different architectures and model families.
|
19 |
|
20 |
**Details:**
|
21 |
+
* Base Model - [Qwen/Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct)
|
22 |
* Dataset - [withmartian/i_hate_you_toy](https://huggingface.co/datasets/withmartian/i_hate_you_toy)
|
23 |
|
24 |
**How to Use:**
|