Alan Ageev
commited on
Commit
·
c90be47
1
Parent(s):
2e44b4c
README.md
CHANGED
@@ -1,18 +1,21 @@
|
|
1 |
---
|
|
|
2 |
language:
|
3 |
-
- ru
|
4 |
tags:
|
5 |
-
- PyTorch
|
6 |
-
- Transformers
|
7 |
---
|
8 |
|
9 |
# BERT large model (uncased) for Sentence Embeddings in Russian language.
|
|
|
10 |
The model is described [in this article](https://habr.com/ru/company/sberdevices/blog/527576/)
|
11 |
For better quality, use mean token embeddings.
|
12 |
|
13 |
## Usage (HuggingFace Models Repository)
|
14 |
|
15 |
You can use the model directly from the model repository to compute sentence embeddings:
|
|
|
16 |
```python
|
17 |
from transformers import AutoTokenizer, AutoModel
|
18 |
import torch
|
@@ -48,6 +51,7 @@ sentence_embeddings = mean_pooling(model_output, encoded_input['attention_mask']
|
|
48 |
```
|
49 |
|
50 |
# Authors
|
51 |
-
|
52 |
-
|
53 |
-
|
|
|
|
1 |
---
|
2 |
+
library_name: sentence-transformers
|
3 |
language:
|
4 |
+
- ru
|
5 |
tags:
|
6 |
+
- PyTorch
|
7 |
+
- Transformers
|
8 |
---
|
9 |
|
10 |
# BERT large model (uncased) for Sentence Embeddings in Russian language.
|
11 |
+
|
12 |
The model is described [in this article](https://habr.com/ru/company/sberdevices/blog/527576/)
|
13 |
For better quality, use mean token embeddings.
|
14 |
|
15 |
## Usage (HuggingFace Models Repository)
|
16 |
|
17 |
You can use the model directly from the model repository to compute sentence embeddings:
|
18 |
+
|
19 |
```python
|
20 |
from transformers import AutoTokenizer, AutoModel
|
21 |
import torch
|
|
|
51 |
```
|
52 |
|
53 |
# Authors
|
54 |
+
|
55 |
+
- [SberDevices](https://sberdevices.ru/) Team.
|
56 |
+
- Denis Antykhov: [Github](https://github.com/gaphex);
|
57 |
+
- Aleksandr Abramov: [Github](https://github.com/Ab1992ao), [Kaggle Competitions Master](https://www.kaggle.com/andrilko)
|