AmelieSchreiber
commited on
Commit
·
cbd6888
1
Parent(s):
b9246b3
Update README.md
Browse files
README.md
CHANGED
@@ -1,9 +1,14 @@
|
|
1 |
---
|
2 |
library_name: peft
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
4 |
## Training procedure
|
5 |
Using Hugging Face's Parameter Efficient Fine-Tuning (PEFT) library, a Low Rank Adaptation was trained for
|
6 |
-
3 epochs on the CAFA-5 protein sequences dataset at an 80/20 train/test split
|
7 |
[found here](https://huggingface.co/datasets/AmelieSchreiber/cafa_5). Somewhat naively, the model was trained on
|
8 |
the `train_sequences.fasta` file of protein sequences, with the `train_terms.tsv` file serving as the labels.
|
9 |
The gene ontology used is a hierarchy, and so the labels lower in the hierchay should be weighted more, or the
|
@@ -20,6 +25,8 @@ Validation Micro Recall: 0.2911,
|
|
20 |
Validation Macro Recall: 0.9968
|
21 |
```
|
22 |
|
|
|
|
|
23 |
### Framework versions
|
24 |
|
25 |
- PEFT 0.4.0
|
|
|
1 |
---
|
2 |
library_name: peft
|
3 |
---
|
4 |
+
|
5 |
+
# ESM-2 LoRA for CAFA-5 Protein Function Prediction
|
6 |
+
This is a Low Rank Adaptation (LoRA) of [cafa_5_protein_function_prediction](https://huggingface.co/AmelieSchreiber/cafa_5_protein_function_prediction),
|
7 |
+
which is a fine-tuned (without LoRA) version of `facebook/esm2_t6_8M_UR50D`, for the same task.
|
8 |
+
|
9 |
## Training procedure
|
10 |
Using Hugging Face's Parameter Efficient Fine-Tuning (PEFT) library, a Low Rank Adaptation was trained for
|
11 |
+
3 epochs on the CAFA-5 protein sequences dataset at an 80/20 train/test split. The dataset can be
|
12 |
[found here](https://huggingface.co/datasets/AmelieSchreiber/cafa_5). Somewhat naively, the model was trained on
|
13 |
the `train_sequences.fasta` file of protein sequences, with the `train_terms.tsv` file serving as the labels.
|
14 |
The gene ontology used is a hierarchy, and so the labels lower in the hierchay should be weighted more, or the
|
|
|
25 |
Validation Macro Recall: 0.9968
|
26 |
```
|
27 |
|
28 |
+
Future iterations of this model will need to take into account class weighting.
|
29 |
+
|
30 |
### Framework versions
|
31 |
|
32 |
- PEFT 0.4.0
|