DeBERTa v3 xsmall SQuAD 2.0

Microsoft reports that this model can get 84.8/82.0 on f1/em on the dev set.

I got 81.5/78.3 but I only did one run and I didn't use the official squad2 evaluation script. I will do some more runs and show the results on the official script soon.

Downloads last month: 3

Inference Providers NEW

Question Answering

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Dataset used to train nbroad/deberta-v3-xsmall-squad2

Evaluation results

f1 on SQuAD2.0
self-reported

81.500
exact on SQuAD2.0
self-reported

78.300
Exact Match on squad_v2
validation set verified

78.534
F1 on squad_v2
validation set verified

81.641
total on squad_v2
validation set verified

11870.000
Exact Match on squad
validation set verified

84.174
F1 on squad
validation set verified

91.077

View on Papers With Code