First version of the da-bert-emotion-classification model and tokenizer

d25b552 over 3 years ago

1.39 kB

	---
	language:
	- da
	tags:
	- bert
	- pytorch
	- emotion
	license: CC-BY_4.0
	datasets:
	- social media
	metrics:
	- f1
	widget:
	- text: "Jeg ejer en rød bil og det er en god bil."
	---

	# Danish BERT for emotion classification

	The BERT Emotion model classifies a Danish text in one of the following class:
	* Glæde/Sindsro
	* Tillid/Accept
	* Forventning/Interrese
	* Overasket/Målløs
	* Vrede/Irritation
	* Foragt/Modvilje
	* Sorg/trist
	* Frygt/Bekymret

	It is based on the pretrained [Danish BERT](https://github.com/certainlyio/nordic_bert) model by BotXO which has been fine-tuned on social media data.

	This model should be used after detecting whether the text contains emotion or not, using the binary [BERT Emotion model](https://huggingface.co/DaNLP/da-bert-emotion-binary).

	See the [DaNLP documentation](https://danlp-alexandra.readthedocs.io/en/latest/docs/tasks/sentiment_analysis.html#bert-emotion) for more details.

	Here is how to use the model:

	```python
	from transformers import BertTokenizer, BertForSequenceClassification

	model = BertForSequenceClassification.from_pretrained("DaNLP/da-bert-emotion-classification")
	tokenizer = BertTokenizer.from_pretrained("DaNLP/da-bert-emotion-classification")
	```

	## Training data

	The data used for training has not been made publicly available. It consists of social media data manually annotated in collaboration with Danmarks Radio.