Model V9 Release (#12)
Browse files- Update model to version 9: Improved performance metrics and evaluation results (5fbbe83dee7ef72c61a8173c4ccf27b19788fc2e)
Co-authored-by: Harshit <[email protected]>
- README.md +80 -83
- config.json +6 -4
- label_encoder.joblib +2 -2
- pytorch_model.bin +2 -2
README.md
CHANGED
@@ -1,12 +1,12 @@
|
|
1 |
---
|
2 |
license: mit
|
3 |
language:
|
4 |
-
- en
|
5 |
---
|
6 |
|
7 |
# Model Card for Model ID
|
8 |
|
9 |
-
This model card outlines the Pebblo Classifier, a machine learning system specialized in text classification. Developed by DAXA.AI, this model is adept at categorizing various agreement documents within organizational structures, trained on
|
10 |
|
11 |
## Model Details
|
12 |
|
@@ -88,102 +88,99 @@ print(decoded_label)
|
|
88 |
|
89 |
### Training Data
|
90 |
|
91 |
-
The training dataset consists of
|
92 |
Here are the labels along with their respective counts in the dataset:
|
93 |
|
94 |
-
| Agreement Type
|
95 |
-
|
|
96 |
-
| BOARD_MEETING_AGREEMENT
|
97 |
-
| CONSULTING_AGREEMENT
|
98 |
-
| CUSTOMER_LIST_AGREEMENT
|
99 |
-
| DISTRIBUTION_PARTNER_AGREEMENT
|
100 |
-
| EMPLOYEE_AGREEMENT
|
101 |
-
| ENTERPRISE_AGREEMENT
|
102 |
-
| ENTERPRISE_LICENSE_AGREEMENT
|
103 |
-
| EXECUTIVE_SEVERANCE_AGREEMENT
|
104 |
-
| FINANCIAL_REPORT_AGREEMENT
|
105 |
-
| HARMFUL_ADVICE
|
106 |
-
| INTERNAL_PRODUCT_ROADMAP_AGREEMENT
|
107 |
-
| LOAN_AND_SECURITY_AGREEMENT
|
108 |
-
| MEDICAL_ADVICE
|
109 |
-
| MERGER_AGREEMENT
|
110 |
-
| NDA_AGREEMENT
|
111 |
-
| NORMAL_TEXT
|
112 |
-
| PATENT_APPLICATION_FILLINGS_AGREEMENT
|
113 |
-
| PRICE_LIST_AGREEMENT
|
114 |
-
| SETTLEMENT_AGREEMENT
|
115 |
-
|
|
116 |
-
|
117 |
-
|
118 |
|
119 |
## Evaluation
|
120 |
|
121 |
### Testing Data & Metrics
|
122 |
|
123 |
#### Testing Data
|
124 |
-
Evaluation was performed on a dataset of 82,917 entries with a temperature range of 1-1.25 for randomness.
|
125 |
-
Here are the labels along with their respective counts in the dataset:
|
126 |
-
|
127 |
-
| Agreement Type | Instances |
|
128 |
-
| --------------------------------------- | --------- |
|
129 |
-
| BOARD_MEETING_AGREEMENT | 4,335 |
|
130 |
-
| CONSULTING_AGREEMENT | 1,533 |
|
131 |
-
| CUSTOMER_LIST_AGREEMENT | 4,995 |
|
132 |
-
| DISTRIBUTION_PARTNER_AGREEMENT | 7,231 |
|
133 |
-
| EMPLOYEE_AGREEMENT | 1,433 |
|
134 |
-
| ENTERPRISE_AGREEMENT | 1,616 |
|
135 |
-
| ENTERPRISE_LICENSE_AGREEMENT | 8,574 |
|
136 |
-
| EXECUTIVE_SEVERANCE_AGREEMENT | 5,177 |
|
137 |
-
| FINANCIAL_REPORT_AGREEMENT | 4,264 |
|
138 |
-
| HARMFUL_ADVICE | 474 |
|
139 |
-
| INTERNAL_PRODUCT_ROADMAP_AGREEMENT | 4,116 |
|
140 |
-
| LOAN_AND_SECURITY_AGREEMENT | 6,354 |
|
141 |
-
| MEDICAL_ADVICE | 289 |
|
142 |
-
| MERGER_AGREEMENT | 7,079 |
|
143 |
-
| NDA_AGREEMENT | 1,452 |
|
144 |
-
| NORMAL_TEXT | 8,335 |
|
145 |
-
| PATENT_APPLICATION_FILLINGS_AGREEMENT | 6,177 |
|
146 |
-
| PRICE_LIST_AGREEMENT | 5,453 |
|
147 |
-
| SETTLEMENT_AGREEMENT | 5,806 |
|
148 |
-
| SEXUAL_HARRASSMENT | 4,750 |
|
149 |
|
|
|
|
|
150 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
151 |
|
152 |
#### Metrics
|
153 |
|
154 |
-
| Agreement Type
|
155 |
-
|
|
156 |
-
| BOARD_MEETING_AGREEMENT
|
157 |
-
| CONSULTING_AGREEMENT
|
158 |
-
| CUSTOMER_LIST_AGREEMENT
|
159 |
-
| DISTRIBUTION_PARTNER_AGREEMENT
|
160 |
-
| EMPLOYEE_AGREEMENT
|
161 |
-
| ENTERPRISE_AGREEMENT
|
162 |
-
| ENTERPRISE_LICENSE_AGREEMENT
|
163 |
-
| EXECUTIVE_SEVERANCE_AGREEMENT
|
164 |
-
| FINANCIAL_REPORT_AGREEMENT
|
165 |
-
| HARMFUL_ADVICE
|
166 |
-
| INTERNAL_PRODUCT_ROADMAP_AGREEMENT
|
167 |
-
| LOAN_AND_SECURITY_AGREEMENT
|
168 |
-
| MEDICAL_ADVICE
|
169 |
-
| MERGER_AGREEMENT
|
170 |
-
| NDA_AGREEMENT
|
171 |
-
| NORMAL_TEXT
|
172 |
-
| PATENT_APPLICATION_FILLINGS_AGREEMENT
|
173 |
-
| PRICE_LIST_AGREEMENT
|
174 |
-
| SETTLEMENT_AGREEMENT
|
175 |
-
|
|
176 |
-
|
|
177 |
-
| accuracy
|
178 |
-
| macro avg
|
179 |
-
| weighted avg
|
180 |
-
|
181 |
|
182 |
#### Results
|
183 |
|
184 |
-
The model’s performance is summarized by precision, recall, and f1-score metrics, which are detailed across all
|
185 |
-
|
186 |
-
The evaluation loss, which measures the discrepancy between the model’s predictions and the actual values, is 0.5616. Lower loss values indicate better model performance.
|
187 |
|
188 |
-
The
|
189 |
|
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
language:
|
4 |
+
- en
|
5 |
---
|
6 |
|
7 |
# Model Card for Model ID
|
8 |
|
9 |
+
This model card outlines the Pebblo Classifier, a machine learning system specialized in text classification. Developed by DAXA.AI, this model is adept at categorizing various agreement documents within organizational structures, trained on 21 distinct labels.
|
10 |
|
11 |
## Model Details
|
12 |
|
|
|
88 |
|
89 |
### Training Data
|
90 |
|
91 |
+
The training dataset consists of 141,055 entries, with 21 unique labels. The labels span various document types, with instances distributed across three text sizes (128 ± x, 256 ± x, and 512 ± x words; x varies within 20).
|
92 |
Here are the labels along with their respective counts in the dataset:
|
93 |
|
94 |
+
| Agreement Type | Instances |
|
95 |
+
| ------------------------------------- | --------- |
|
96 |
+
| BOARD_MEETING_AGREEMENT | 4,206 |
|
97 |
+
| CONSULTING_AGREEMENT | 2,965 |
|
98 |
+
| CUSTOMER_LIST_AGREEMENT | 8,966 |
|
99 |
+
| DISTRIBUTION_PARTNER_AGREEMENT | 5,144 |
|
100 |
+
| EMPLOYEE_AGREEMENT | 3,876 |
|
101 |
+
| ENTERPRISE_AGREEMENT | 4,213 |
|
102 |
+
| ENTERPRISE_LICENSE_AGREEMENT | 8,999 |
|
103 |
+
| EXECUTIVE_SEVERANCE_AGREEMENT | 8,996 |
|
104 |
+
| FINANCIAL_REPORT_AGREEMENT | 11,384 |
|
105 |
+
| HARMFUL_ADVICE | 1,887 |
|
106 |
+
| INTERNAL_PRODUCT_ROADMAP_AGREEMENT | 6,982 |
|
107 |
+
| LOAN_AND_SECURITY_AGREEMENT | 8,957 |
|
108 |
+
| MEDICAL_ADVICE | 3,847 |
|
109 |
+
| MERGER_AGREEMENT | 7,704 |
|
110 |
+
| NDA_AGREEMENT | 5,221 |
|
111 |
+
| NORMAL_TEXT | 8,994 |
|
112 |
+
| PATENT_APPLICATION_FILLINGS_AGREEMENT | 8,802 |
|
113 |
+
| PRICE_LIST_AGREEMENT | 8,906 |
|
114 |
+
| SETTLEMENT_AGREEMENT | 3,737 |
|
115 |
+
| SEXUAL_CONTENT | 8,957 |
|
116 |
+
| SEXUAL_INCIDENT_REPORT | 8,321 |
|
|
|
117 |
|
118 |
## Evaluation
|
119 |
|
120 |
### Testing Data & Metrics
|
121 |
|
122 |
#### Testing Data
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
123 |
|
124 |
+
Evaluation was performed on a dataset of 86,281 entries with a temperature range of 1-1.25 for randomness.
|
125 |
+
Here are the labels along with their respective counts in the dataset:
|
126 |
|
127 |
+
| Agreement Type | Instances |
|
128 |
+
| ------------------------------------- | --------- |
|
129 |
+
| BOARD_MEETING_AGREEMENT | 3,975 |
|
130 |
+
| CONSULTING_AGREEMENT | 1,430 |
|
131 |
+
| CUSTOMER_LIST_AGREEMENT | 4,488 |
|
132 |
+
| DISTRIBUTION_PARTNER_AGREEMENT | 6,696 |
|
133 |
+
| EMPLOYEE_AGREEMENT | 1,310 |
|
134 |
+
| ENTERPRISE_AGREEMENT | 1,501 |
|
135 |
+
| ENTERPRISE_LICENSE_AGREEMENT | 7,967 |
|
136 |
+
| EXECUTIVE_SEVERANCE_AGREEMENT | 4,795 |
|
137 |
+
| FINANCIAL_REPORT_AGREEMENT | 4,686 |
|
138 |
+
| HARMFUL_ADVICE | 361 |
|
139 |
+
| INTERNAL_PRODUCT_ROADMAP_AGREEMENT | 3,740 |
|
140 |
+
| LOAN_AND_SECURITY_AGREEMENT | 5,833 |
|
141 |
+
| MEDICAL_ADVICE | 643 |
|
142 |
+
| MERGER_AGREEMENT | 6,557 |
|
143 |
+
| NDA_AGREEMENT | 1,352 |
|
144 |
+
| NORMAL_TEXT | 5,811 |
|
145 |
+
| PATENT_APPLICATION_FILLINGS_AGREEMENT | 5,608 |
|
146 |
+
| PRICE_LIST_AGREEMENT | 5,044 |
|
147 |
+
| SETTLEMENT_AGREEMENT | 5,377 |
|
148 |
+
| SEXUAL_CONTENT | 4,356 |
|
149 |
+
| SEXUAL_INCIDENT_REPORT | 4,750 |
|
150 |
|
151 |
#### Metrics
|
152 |
|
153 |
+
| Agreement Type | precision | recall | f1-score | support |
|
154 |
+
| ------------------------------------- | --------- | ------ | -------- | ------- |
|
155 |
+
| BOARD_MEETING_AGREEMENT | 0.92 | 0.95 | 0.93 | 3,975 |
|
156 |
+
| CONSULTING_AGREEMENT | 0.81 | 0.85 | 0.83 | 1,430 |
|
157 |
+
| CUSTOMER_LIST_AGREEMENT | 0.90 | 0.88 | 0.89 | 4,488 |
|
158 |
+
| DISTRIBUTION_PARTNER_AGREEMENT | 0.73 | 0.63 | 0.68 | 6,696 |
|
159 |
+
| EMPLOYEE_AGREEMENT | 0.85 | 0.84 | 0.85 | 1,310 |
|
160 |
+
| ENTERPRISE_AGREEMENT | 0.18 | 0.70 | 0.29 | 1,501 |
|
161 |
+
| ENTERPRISE_LICENSE_AGREEMENT | 0.92 | 0.78 | 0.84 | 7,967 |
|
162 |
+
| EXECUTIVE_SEVERANCE_AGREEMENT | 0.97 | 0.88 | 0.92 | 4,795 |
|
163 |
+
| FINANCIAL_REPORT_AGREEMENT | 0.93 | 0.99 | 0.96 | 4,686 |
|
164 |
+
| HARMFUL_ADVICE | 0.92 | 0.94 | 0.93 | 361 |
|
165 |
+
| INTERNAL_PRODUCT_ROADMAP_AGREEMENT | 0.94 | 0.98 | 0.96 | 3,740 |
|
166 |
+
| LOAN_AND_SECURITY_AGREEMENT | 0.93 | 0.97 | 0.95 | 5,833 |
|
167 |
+
| MEDICAL_ADVICE | 0.93 | 1.00 | 0.96 | 643 |
|
168 |
+
| MERGER_AGREEMENT | 0.93 | 0.45 | 0.61 | 6,557 |
|
169 |
+
| NDA_AGREEMENT | 0.68 | 0.91 | 0.78 | 1,352 |
|
170 |
+
| NORMAL_TEXT | 0.95 | 0.94 | 0.95 | 5,811 |
|
171 |
+
| PATENT_APPLICATION_FILLINGS_AGREEMENT | 0.96 | 0.99 | 0.98 | 5,608 |
|
172 |
+
| PRICE_LIST_AGREEMENT | 0.76 | 0.79 | 0.77 | 5,044 |
|
173 |
+
| SETTLEMENT_AGREEMENT | 0.76 | 0.78 | 0.77 | 5,377 |
|
174 |
+
| SEXUAL_CONTENT | 0.92 | 0.97 | 0.94 | 4,356 |
|
175 |
+
| SEXUAL_INCIDENT_REPORT | 0.99 | 0.94 | 0.96 | 4,750 |
|
176 |
+
| accuracy | | | 0.84 | 86,280 |
|
177 |
+
| macro avg | 0.85 | 0.86 | 0.84 | 86,280 |
|
178 |
+
| weighted avg | 0.88 | 0.84 | 0.85 | 86,280 |
|
|
|
179 |
|
180 |
#### Results
|
181 |
|
182 |
+
The model’s performance is summarized by precision, recall, and f1-score metrics, which are detailed across all 21 labels in the dataset. Based on the test data evaluation results, the model achieved an accuracy of 0.8424, a precision of 0.8794, and a recall of 0.8424. The F1-score, which is the harmonic mean of precision and recall, stands at 0.8505.
|
|
|
|
|
183 |
|
184 |
+
The evaluation loss, which measures the discrepancy between the model’s predictions and the actual values, is 0.6815. Lower loss values indicate better model performance.
|
185 |
|
186 |
+
The model was able to process approximately 97.684 samples per second during the evaluation, which took a total runtime of 883.2545 seconds. The model performed approximately 0.764 evaluation steps per second.
|
config.json
CHANGED
@@ -9,7 +9,6 @@
|
|
9 |
"dropout": 0.1,
|
10 |
"hidden_dim": 3072,
|
11 |
"id2label": {
|
12 |
-
|
13 |
"0": "BOARD_MEETING_AGREEMENT",
|
14 |
"1": "CONSULTING_AGREEMENT",
|
15 |
"2": "CUSTOMER_LIST_AGREEMENT",
|
@@ -29,7 +28,8 @@
|
|
29 |
"16": "PATENT_APPLICATION_FILLINGS_AGREEMENT",
|
30 |
"17": "PRICE_LIST_AGREEMENT",
|
31 |
"18": "SETTLEMENT_AGREEMENT",
|
32 |
-
"19": "
|
|
|
33 |
},
|
34 |
"initializer_range": 0.02,
|
35 |
"label2id": {
|
@@ -44,8 +44,9 @@
|
|
44 |
"PATENT_APPLICATION_FILLINGS_AGREEMENT": 16,
|
45 |
"PRICE_LIST_AGREEMENT": 17,
|
46 |
"SETTLEMENT_AGREEMENT": 18,
|
47 |
-
"
|
48 |
"CUSTOMER_LIST_AGREEMENT": 2,
|
|
|
49 |
"DISTRIBUTION_PARTNER_AGREEMENT": 3,
|
50 |
"EMPLOYEE_AGREEMENT": 4,
|
51 |
"ENTERPRISE_AGREEMENT": 5,
|
@@ -59,11 +60,12 @@
|
|
59 |
"n_heads": 12,
|
60 |
"n_layers": 6,
|
61 |
"pad_token_id": 0,
|
|
|
62 |
"qa_dropout": 0.1,
|
63 |
"seq_classif_dropout": 0.2,
|
64 |
"sinusoidal_pos_embds": false,
|
65 |
"tie_weights_": true,
|
66 |
"torch_dtype": "float32",
|
67 |
-
"transformers_version": "4.
|
68 |
"vocab_size": 30522
|
69 |
}
|
|
|
9 |
"dropout": 0.1,
|
10 |
"hidden_dim": 3072,
|
11 |
"id2label": {
|
|
|
12 |
"0": "BOARD_MEETING_AGREEMENT",
|
13 |
"1": "CONSULTING_AGREEMENT",
|
14 |
"2": "CUSTOMER_LIST_AGREEMENT",
|
|
|
28 |
"16": "PATENT_APPLICATION_FILLINGS_AGREEMENT",
|
29 |
"17": "PRICE_LIST_AGREEMENT",
|
30 |
"18": "SETTLEMENT_AGREEMENT",
|
31 |
+
"19": "SEXUAL_CONTENT",
|
32 |
+
"20": "SEXUAL_INCIDENT_REPORT"
|
33 |
},
|
34 |
"initializer_range": 0.02,
|
35 |
"label2id": {
|
|
|
44 |
"PATENT_APPLICATION_FILLINGS_AGREEMENT": 16,
|
45 |
"PRICE_LIST_AGREEMENT": 17,
|
46 |
"SETTLEMENT_AGREEMENT": 18,
|
47 |
+
"SEXUAL_CONTENT": 19,
|
48 |
"CUSTOMER_LIST_AGREEMENT": 2,
|
49 |
+
"SEXUAL_INCIDENT_REPORT": 20,
|
50 |
"DISTRIBUTION_PARTNER_AGREEMENT": 3,
|
51 |
"EMPLOYEE_AGREEMENT": 4,
|
52 |
"ENTERPRISE_AGREEMENT": 5,
|
|
|
60 |
"n_heads": 12,
|
61 |
"n_layers": 6,
|
62 |
"pad_token_id": 0,
|
63 |
+
"problem_type": "single_label_classification",
|
64 |
"qa_dropout": 0.1,
|
65 |
"seq_classif_dropout": 0.2,
|
66 |
"sinusoidal_pos_embds": false,
|
67 |
"tie_weights_": true,
|
68 |
"torch_dtype": "float32",
|
69 |
+
"transformers_version": "4.40.2",
|
70 |
"vocab_size": 30522
|
71 |
}
|
label_encoder.joblib
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f21f9707a92949f7085d2370e49e3be8e7ba71ed1508f0e0f6f21f48f6fbb8e9
|
3 |
+
size 1118
|
pytorch_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:231c166001b51adbeecfddd55563569a52981f7c88e3802364368f76e86279d2
|
3 |
+
size 268212925
|