agsc
/

Model Card for PaperPub

Paper publication prediction based on English computer science abstracts.

Model Details

Model Description

PaperPub is a SciBERT (Beltagy et al 2019) model fine-tuned to predict paper acceptance from computer science abstracts. Acceptance is modeled as a binary decision of accept or reject. The training and evaluation data is based on the arXiv subsection of PeerRead (Kang et al. 2018). Our main use case for PaperPub is to research how attribution scores derived from acceptance predictions can inform reflecting about content and writing quality of abstracts.

  • Developed by: Semantic Computing Group, Bielefeld University, in particular Jan-Philipp Töberg, Christoph Düsing, Jonas Belouadi and Matthias Orlikowski
  • Model type: BERT for binary classification
  • Language(s) (NLP): English
  • License: MIT
  • Finetuned from model: SciBERT

Model Sources

We will add a public demo of PaperPub used in an application which uses attribution scores to highlight words in an abstract that contribute to acceptance/rejection predcitions.

  • Repository: tba
  • Demo: tba

Uses

PaperPub can only be meaningfully used in a research setting. The model should not be used for any consequential paper quality judgements.

Direct Use

The intended use case in research into how attribution scores computed from paper acceptance decisions reflect the abstract's content quality.

Out-of-Scope Use

This model must not be used as part of any type of paper quality judgements, but in particular not in a peer review process. PaperPub is explicitly not meant to automate paper acceptance decisions.

Bias, Risks, and Limitations

Bias, Risks, and Limitations are mainly related to the used datset. In addition to limitations that apply to the SciBERT pre-training corpus, our training data represents only a very specific subset of papers. PaperPub was trained in a hackathon-like setting, so performance is not optimized and not our main goal.

Recommendations

Users should be aware that the dataset (computer science arXive preprints from a specific period) used for fine-tuning represents a very specific idea of what papers and in particular papers fit for publication look like.

How to Get Started with the Model

tba

Training Details

Training Data

Custom stratified split of the arXiv subsection of PeerRead (Kang et al. 2018). We use the data from their GitHub repository, not the Huggingface Hub version.

Training Procedure

Training Hyperparameters

  • Training regime: bf16 mixed precision
  • Epochs: 2
  • Initial Learning Rate: 2^-5

Evaluation

Testing Data, Factors & Metrics

Testing Data

Custom stratified split of the arXiv subsection of PeerRead (Kang et al. 2018). We use the data from their GitHub repository, not the Huggingface Hub version.

Factors

Models, we compare to a naive most-frequent-class baseline.

Metrics

Accuracy, Macro F1

Results

  • Majority Baseline
    • Acc. - 0.75
    • Macro F1 - 0.43
  • PaperPub
    • Acc. - 0.82
    • Macro F1 - 0.76

Environmental Impact

  • Hardware Type: 1xA40
  • Hours used: 0.3
  • Cloud Provider: Private Infrastructure
  • Compute Region: Europe

Technical Specifications

Compute Infrastructure

We are using an internal SLURM cluster with A40 GPUs

Citation

tba

Model Card Contact

Matthias Orlikowski

Downloads last month
21
Safetensors
Model size
110M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train agsc/scibert-bf16-acceptance