DeepSeek-R1-Distill-Qwen-1.5B-Abliterated-dpo

dpo trained- to recoupe some of abliteration loss

Downloads last month: 31

Safetensors

Model size

1.78B params

Tensor type

FP16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for stepenZEN/DeepSeek-R1-Distill-Qwen-1.5B-Abliterated-dpo

Base model

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Finetuned

(26)

this model

Quantizations

2 models