Update README.md
Browse files
README.md
CHANGED
@@ -3,6 +3,8 @@ license: apache-2.0
|
|
3 |
tags:
|
4 |
- Automated Peer Reviewing
|
5 |
- SFT
|
|
|
|
|
6 |
---
|
7 |
|
8 |
## Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis
|
@@ -17,46 +19,7 @@ Project Page: https://ecnu-sea.github.io/
|
|
17 |
- 🔥🔥🔥 We have made SEA series models (7B) public !
|
18 |
|
19 |
## Model Description
|
20 |
-
The SEA-
|
21 |
-
|
22 |
-
## Review Paper With SEA-E
|
23 |
-
|
24 |
-
```python
|
25 |
-
from transformers import AutoModelForCausalLM, AutoTokenizer
|
26 |
-
|
27 |
-
instruction = system_prompt_dict['instruction_e']
|
28 |
-
paper = read_txt_file(mmd_file_path)
|
29 |
-
idx = paper.find("## References")
|
30 |
-
paper = paper[:idx].strip()
|
31 |
-
|
32 |
-
messages = [
|
33 |
-
{"role": "system", "content": instruction},
|
34 |
-
{"role": "user", "content": paper},
|
35 |
-
]
|
36 |
-
|
37 |
-
encodes = tokenizer.apply_chat_template(messages, return_tensors="pt")
|
38 |
-
encodes = encodes.to("cuda:0")
|
39 |
-
len_input = encodes.shape[1]
|
40 |
-
generated_ids = chat_model.generate(encodes,max_new_tokens=8192,do_sample=True)
|
41 |
-
# response = chat_model.chat(messages)[0].response_text
|
42 |
-
response = tokenizer.batch_decode(generated_ids[: , len_input:])[0]
|
43 |
-
|
44 |
-
```
|
45 |
-
The code provided above is an example. For detailed usage instructions, please refer to https://github.com/ecnu-sea/sea.
|
46 |
-
|
47 |
-
## Additional Clauses
|
48 |
-
|
49 |
-
The additional clauses for this project are as follows:
|
50 |
-
|
51 |
-
- The SEA-E model is intended solely to provide informative reviews for authors to polish their papers instead of directly recommending acceptance/rejection on papers.
|
52 |
-
- Currently, the SEA-E model is only applicable within the field of machine learning and does not guarantee insightful comments for other disciplines.
|
53 |
-
|
54 |
-
|
55 |
-
## Citation
|
56 |
-
|
57 |
-
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
58 |
-
|
59 |
-
If you find our paper or models helpful, please consider cite as follows:
|
60 |
|
61 |
```bibtex
|
62 |
@misc{yu2024sea,
|
|
|
3 |
tags:
|
4 |
- Automated Peer Reviewing
|
5 |
- SFT
|
6 |
+
datasets:
|
7 |
+
- ECNU-SEA/SEA_data
|
8 |
---
|
9 |
|
10 |
## Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis
|
|
|
19 |
- 🔥🔥🔥 We have made SEA series models (7B) public !
|
20 |
|
21 |
## Model Description
|
22 |
+
The SEA-A model aims to integrate all reviews for each paper into one to eliminate redundancy and errors, focusing on the major advantages and disadvantages of the paper. Specifically, we first utilize GPT-4 to integrate multiple reviews of a paper into one (From [ECNU-SEA/SEA_data](https://huggingface.co/datasets/ECNU-SEA/SEA_data)) that is in a unified format and criterion with constructive contents, and form an instruction dataset for SFT. After that, we fine-tune [Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) to distill the knowledge of GPT-4. Therefore, SEA-S provides a novel paradigm for integrating peer review data in an unified format across various conferences.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
23 |
|
24 |
```bibtex
|
25 |
@misc{yu2024sea,
|