sjrhuschlee commited on
Commit
d0e24ca
1 Parent(s): f6c8b4e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +90 -2
README.md CHANGED
@@ -1,6 +1,94 @@
1
  ---
2
  language: en
 
3
  datasets:
4
  - squad_v2
5
- license: cc-by-4.0
6
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  language: en
3
+ license: cc-by-4.0
4
  datasets:
5
  - squad_v2
6
+ ---
7
+
8
+ # roberta-large for QA
9
+
10
+ This is the [roberta-large](https://huggingface.co/roberta-large) model, fine-tuned using the [SQuAD2.0](https://huggingface.co/datasets/squad_v2) dataset. It's been trained on question-answer pairs, including unanswerable questions, for the task of Question Answering.
11
+
12
+
13
+ ## Overview
14
+ **Language model:** roberta-large
15
+ **Language:** English
16
+ **Downstream-task:** Extractive QA
17
+ **Training data:** SQuAD 2.0
18
+ **Eval data:** SQuAD 2.0
19
+ **Code:** See [an example QA pipeline on Haystack](https://haystack.deepset.ai/tutorials/first-qa-system)
20
+ **Infrastructure**: 4x Tesla v100
21
+
22
+ ## Hyperparameters
23
+
24
+ ```
25
+ base_LM_model = "roberta-large"
26
+ ```
27
+
28
+ ## Using a distilled model instead
29
+ Please note that we have also released a distilled version of this model called [deepset/roberta-base-squad2-distilled](https://huggingface.co/deepset/roberta-base-squad2-distilled). The distilled model has a comparable prediction quality and runs at twice the speed of the large model.
30
+
31
+ ## Usage
32
+
33
+ ### In Haystack
34
+ Haystack is an NLP framework by deepset. You can use this model in a Haystack pipeline to do question answering at scale (over many documents). To load the model in [Haystack](https://github.com/deepset-ai/haystack/):
35
+ ```python
36
+ reader = FARMReader(model_name_or_path="deepset/roberta-large-squad2")
37
+ # or
38
+ reader = TransformersReader(model_name_or_path="deepset/roberta-large-squad2",tokenizer="deepset/roberta-large-squad2")
39
+ ```
40
+ For a complete example of ``roberta-large-squad2`` being used for Question Answering, check out the [Tutorials in Haystack Documentation](https://haystack.deepset.ai/tutorials/first-qa-system)
41
+
42
+ ### In Transformers
43
+ ```python
44
+ from transformers import AutoModelForQuestionAnswering, AutoTokenizer, pipeline
45
+
46
+ model_name = "deepset/roberta-large-squad2"
47
+
48
+ # a) Get predictions
49
+ nlp = pipeline('question-answering', model=model_name, tokenizer=model_name)
50
+ QA_input = {
51
+ 'question': 'Why is model conversion important?',
52
+ 'context': 'The option to convert models between FARM and transformers gives freedom to the user and let people easily switch between frameworks.'
53
+ }
54
+ res = nlp(QA_input)
55
+
56
+ # b) Load model & tokenizer
57
+ model = AutoModelForQuestionAnswering.from_pretrained(model_name)
58
+ tokenizer = AutoTokenizer.from_pretrained(model_name)
59
+ ```
60
+
61
+ ## Authors
62
+ **Branden Chan:** [email protected]
63
+ **Timo M枚ller:** [email protected]
64
+ **Malte Pietsch:** [email protected]
65
+ **Tanay Soni:** [email protected]
66
+
67
+ ## About us
68
+
69
+ <div class="grid lg:grid-cols-2 gap-x-4 gap-y-3">
70
+ <div class="w-full h-40 object-cover mb-2 rounded-lg flex items-center justify-center">
71
+ <img alt="" src="https://raw.githubusercontent.com/deepset-ai/.github/main/deepset-logo-colored.png" class="w-40"/>
72
+ </div>
73
+ <div class="w-full h-40 object-cover mb-2 rounded-lg flex items-center justify-center">
74
+ <img alt="" src="https://raw.githubusercontent.com/deepset-ai/.github/main/haystack-logo-colored.png" class="w-40"/>
75
+ </div>
76
+ </div>
77
+
78
+ [deepset](http://deepset.ai/) is the company behind the open-source NLP framework [Haystack](https://haystack.deepset.ai/) which is designed to help you build production ready NLP systems that use: Question answering, summarization, ranking etc.
79
+
80
+
81
+ Some of our other work:
82
+ - [Distilled roberta-base-squad2 (aka "tinyroberta-squad2")]([https://huggingface.co/deepset/tinyroberta-squad2)
83
+ - [German BERT (aka "bert-base-german-cased")](https://deepset.ai/german-bert)
84
+ - [GermanQuAD and GermanDPR datasets and models (aka "gelectra-base-germanquad", "gbert-base-germandpr")](https://deepset.ai/germanquad)
85
+
86
+ ## Get in touch and join the Haystack community
87
+
88
+ <p>For more info on Haystack, visit our <strong><a href="https://github.com/deepset-ai/haystack">GitHub</a></strong> repo and <strong><a href="https://docs.haystack.deepset.ai">Documentation</a></strong>.
89
+
90
+ We also have a <strong><a class="h-7" href="https://haystack.deepset.ai/community">Discord community open to everyone!</a></strong></p>
91
+
92
+ [Twitter](https://twitter.com/deepset_ai) | [LinkedIn](https://www.linkedin.com/company/deepset-ai/) | [Discord](https://haystack.deepset.ai/community) | [GitHub Discussions](https://github.com/deepset-ai/haystack/discussions) | [Website](https://deepset.ai)
93
+
94
+ By the way: [we're hiring!](http://www.deepset.ai/jobs)