heroding77 commited on
Commit
0ddf71a
·
verified ·
1 Parent(s): 3bb9ab2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -40
README.md CHANGED
@@ -3,6 +3,8 @@ license: apache-2.0
3
  tags:
4
  - Automated Peer Reviewing
5
  - SFT
 
 
6
  ---
7
 
8
  ## Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis
@@ -17,46 +19,7 @@ Project Page: https://ecnu-sea.github.io/
17
  - 🔥🔥🔥 We have made SEA series models (7B) public !
18
 
19
  ## Model Description
20
- The SEA-E model utilizes [Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) as its backbone. It is derived by performing supervised fine-tuning (SFT) on a high-quality peer review instruction dataset, standardized through the SEA-S model. This model can provide comprehensive and insightful review feedback for submitted papers.
21
-
22
- ## Review Paper With SEA-E
23
-
24
- ```python
25
- from transformers import AutoModelForCausalLM, AutoTokenizer
26
-
27
- instruction = system_prompt_dict['instruction_e']
28
- paper = read_txt_file(mmd_file_path)
29
- idx = paper.find("## References")
30
- paper = paper[:idx].strip()
31
-
32
- messages = [
33
- {"role": "system", "content": instruction},
34
- {"role": "user", "content": paper},
35
- ]
36
-
37
- encodes = tokenizer.apply_chat_template(messages, return_tensors="pt")
38
- encodes = encodes.to("cuda:0")
39
- len_input = encodes.shape[1]
40
- generated_ids = chat_model.generate(encodes,max_new_tokens=8192,do_sample=True)
41
- # response = chat_model.chat(messages)[0].response_text
42
- response = tokenizer.batch_decode(generated_ids[: , len_input:])[0]
43
-
44
- ```
45
- The code provided above is an example. For detailed usage instructions, please refer to https://github.com/ecnu-sea/sea.
46
-
47
- ## Additional Clauses
48
-
49
- The additional clauses for this project are as follows:
50
-
51
- - The SEA-E model is intended solely to provide informative reviews for authors to polish their papers instead of directly recommending acceptance/rejection on papers.
52
- - Currently, the SEA-E model is only applicable within the field of machine learning and does not guarantee insightful comments for other disciplines.
53
-
54
-
55
- ## Citation
56
-
57
- <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
58
-
59
- If you find our paper or models helpful, please consider cite as follows:
60
 
61
  ```bibtex
62
  @misc{yu2024sea,
 
3
  tags:
4
  - Automated Peer Reviewing
5
  - SFT
6
+ datasets:
7
+ - ECNU-SEA/SEA_data
8
  ---
9
 
10
  ## Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis
 
19
  - 🔥🔥🔥 We have made SEA series models (7B) public !
20
 
21
  ## Model Description
22
+ The SEA-A model aims to integrate all reviews for each paper into one to eliminate redundancy and errors, focusing on the major advantages and disadvantages of the paper. Specifically, we first utilize GPT-4 to integrate multiple reviews of a paper into one (From [ECNU-SEA/SEA_data](https://huggingface.co/datasets/ECNU-SEA/SEA_data)) that is in a unified format and criterion with constructive contents, and form an instruction dataset for SFT. After that, we fine-tune [Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) to distill the knowledge of GPT-4. Therefore, SEA-S provides a novel paradigm for integrating peer review data in an unified format across various conferences.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
23
 
24
  ```bibtex
25
  @misc{yu2024sea,