KF-DeBERTa-base-cross-STS

pre-trained model: KF-DeBERTa-base-cross-NLI (https://huggingface.co/deliciouscat/kf-deberta-base-cross-nli)

trained data:

  • klue/sts: 1epoch
  • dkoterwa/kor-sts: 2epoch

label scaling: 0~5 -> -1->1

bi-encoder STS 학습을 위한 dataset augmentation을 상정하고 훈련하였습니다. (https://arxiv.org/abs/2010.08240)

cosine similarity로 학습할 수 있도록 scaling 된 output이 추론됩니다.

Downloads last month
105
Safetensors
Model size
186M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the HF Inference API does not support transformers models with pipeline type sentence-similarity

Datasets used to train deliciouscat/kf-deberta-base-cross-sts