Gurveer05 commited on
Commit
8f1f9b5
·
verified ·
1 Parent(s): 3c3364b

0.3428, cosine with 5e-5

Browse files
Files changed (2) hide show
  1. README.md +106 -129
  2. model.safetensors +1 -1
README.md CHANGED
@@ -7,161 +7,140 @@ tags:
7
  - sentence-similarity
8
  - feature-extraction
9
  - generated_from_trainer
10
- - dataset_size:2940
11
  - loss:MultipleNegativesRankingLoss
12
  widget:
13
- - source_sentence: "Construct: Identify a unit of area.\n\nQuestion: James has answered\
14
- \ a question on the area of a trapezium and got an answer of 54 .\n\nBehind the\
15
- \ star he has written the units that he used.\n\n(\n54 , * \n)\n\nWhich of the\
16
- \ following units could be correct?\n\nOptions:\nA. m\nB. cm\nC. km^3\nD. mm^2\n\
17
- \nAnswer: cm"
18
- sentences:
19
- - Identify a unit of area
20
- - "James has answered a question on the area of a trapezium and got an answer of\
21
- \ 54 .\n\nBehind the star he has written the units that he used.\n\n(\n54 , *\
22
- \ \n)\n\nWhich of the following units could be correct?"
23
- - Does not know units of area should be squared
24
- - source_sentence: 'Construct: Can calculate missing coordinates in a geometric shape.
25
-
26
-
27
- Question: Here are 3 vertices of a rectangle:
28
 
29
- (0,3),(4,-1),(2,-3),
30
 
31
-
32
- What are the coordinates of the 4^th vertex?
33
 
34
 
35
  Options:
36
 
37
- A. (0,1)
38
 
39
- B. (4,1)
40
 
41
- C. (-2,1)
42
 
43
- D. (-2,-3)
44
 
45
 
46
- Answer: (4,1)'
47
  sentences:
48
- - 'Here are 3 vertices of a rectangle:
49
-
50
- (0,3),(4,-1),(2,-3),
 
 
51
 
52
 
53
- What are the coordinates of the 4^th vertex?'
54
- - Can calculate missing coordinates in a geometric shape
55
- - Believes the sides of a rectangle must be horizontal and vertical, not diagonal
56
- - source_sentence: 'Construct: Reflect an object or a point in a diagonal line with
57
- the line of reflection drawn, where the line of reflection lies on the edge or
58
- outside the object.
59
-
60
-
61
- Question: The triangle is reflected in the dashed line
62
-
63
- What are the new coordinates of point P ? Four quadrant, coordinate grid with
64
- the reflection line y=-x drawn and a triangle. The triangle has coordinates: (-2,3)
65
- (-2,6) and (0,5). The point P is the coordinate (0,5).
66
 
67
 
68
  Options:
69
 
70
- A. (-5,0)
71
 
72
- B. (-4,-1)
73
 
74
- C. (-2,2)
75
 
76
- D. (-4,5)
77
 
78
 
79
- Answer: (-4,5)'
80
  sentences:
81
- - Reflects horizontally or vertically instead of across a diagonal line of symmetry
82
- - Reflect an object or a point in a diagonal line with the line of reflection drawn,
83
- where the line of reflection lies on the edge or outside the object
84
- - 'The triangle is reflected in the dashed line
85
 
86
- What are the new coordinates of point P ? Four quadrant, coordinate grid with
87
- the reflection line y=-x drawn and a triangle. The triangle has coordinates: (-2,3)
88
- (-2,6) and (0,5). The point P is the coordinate (0,5)'
89
- - source_sentence: 'Construct: Rearrange formulae to change the subject where the
90
- subject appears once and two steps are needed.
91
 
 
 
92
 
93
- Question: Rearrange the following equation to make r the subject
94
 
95
- [
96
 
97
- p=3 r-5
98
 
99
- ].
100
 
 
101
 
102
- Options:
103
 
104
- A. r=(p+5 / 3)
105
 
106
- B. r=p+5 / 3
 
 
 
 
 
 
107
 
108
- C. r=(p-5 / 3)
109
 
110
- D. r=p-5 / 3
111
 
 
112
 
113
- Answer: r=p+5 / 3'
114
- sentences:
115
- - Rearrange formulae to change the subject where the subject appears once and two
116
- steps are needed
117
- - 'Rearrange the following equation to make r the subject
118
 
119
- [
120
 
121
- p=3 r-5
122
 
123
- ]'
124
- - Misunderstands order of operations in algebraic expressions
125
- - source_sentence: 'Construct: Solve length problems involving similar shapes where
126
- the missing length is the shorter one.
127
 
 
128
 
129
- Question: These two shapes are similar.
130
 
 
131
 
132
- What is the value of y ? The diagram is of two trapeziums. The base of each
133
- trapezium is parallel with its top side.
134
 
135
- The smaller trapezium has base 60cm and top labelled y.
136
 
137
- The larger trapezium has base 150cm and top 70cm.
 
 
 
 
 
138
 
 
139
 
140
- Options:
141
 
142
- A. 28 cm
 
143
 
144
- B. 20 cm
145
 
146
- C. 25 cm
 
147
 
148
- D. 30 cm
149
 
 
150
 
151
- Answer: 30 cm'
152
- sentences:
153
- - 'These two shapes are similar.
154
 
 
155
 
156
- What is the value of y ? The diagram is of two trapeziums. The base of each
157
- trapezium is parallel with its top side.
158
 
159
- The smaller trapezium has base 60cm and top labelled y.
160
 
161
- The larger trapezium has base 150cm and top 70cm.'
162
- - Estimated when not appropriate
163
- - Solve length problems involving similar shapes where the missing length is the
164
- shorter one
 
 
 
 
165
  ---
166
 
167
  # SentenceTransformer based on Alibaba-NLP/gte-base-en-v1.5
@@ -214,9 +193,9 @@ from sentence_transformers import SentenceTransformer
214
  model = SentenceTransformer("Gurveer05/gte-base-eedi-2024")
215
  # Run inference
216
  sentences = [
217
- 'Construct: Solve length problems involving similar shapes where the missing length is the shorter one.\n\nQuestion: These two shapes are similar.\n\nWhat is the value of y ? The diagram is of two trapeziums. The base of each trapezium is parallel with its top side.\nThe smaller trapezium has base 60cm and top labelled y.\nThe larger trapezium has base 150cm and top 70cm.\n\nOptions:\nA. 28 cm\nB. 20 cm\nC. 25 cm\nD. 30 cm\n\nAnswer: 30 cm',
218
- 'Estimated when not appropriate',
219
- 'Solve length problems involving similar shapes where the missing length is the shorter one',
220
  ]
221
  embeddings = model.encode(sentences)
222
  print(embeddings.shape)
@@ -271,19 +250,19 @@ You can finetune this model on your own dataset.
271
  #### csv
272
 
273
  * Dataset: csv
274
- * Size: 2,940 training samples
275
  * Columns: <code>qa_pair_text</code>, <code>MisconceptionName</code>, <code>ConstructName</code>, and <code>QuestionText</code>
276
  * Approximate statistics based on the first 1000 samples:
277
  | | qa_pair_text | MisconceptionName | ConstructName | QuestionText |
278
  |:--------|:------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
279
  | type | string | string | string | string |
280
- | details | <ul><li>min: 35 tokens</li><li>mean: 94.91 tokens</li><li>max: 903 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 14.61 tokens</li><li>max: 39 tokens</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 14.39 tokens</li><li>max: 41 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 38.43 tokens</li><li>max: 278 tokens</li></ul> |
281
  * Samples:
282
- | qa_pair_text | MisconceptionName | ConstructName | QuestionText |
283
- |:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
284
- | <code>Construct: Work out the size of each interior angle in a regular polygon.<br><br>Question: The octagon below is regular. Find angle x . A regular octagon (8 sided polygon). Each side is marked with a single dash to show that all sides are equal in length. One of the interior angles of the octagon is labelled with the letter "x".<br><br>Options:<br>A. 22.5°<br>B. 120°<br>C. 135°<br>D. 45°<br><br>Answer: 22.5°</code> | <code>Thinks the interior angles of any polygon add up to 180 degrees</code> | <code>Work out the size of each interior angle in a regular polygon</code> | <code>The octagon below is regular. Find angle x . A regular octagon (8 sided polygon). Each side is marked with a single dash to show that all sides are equal in length. One of the interior angles of the octagon is labelled with the letter "x".</code> |
285
- | <code>Construct: Manipulate both equations so that they are in a form to be added or subtracted.<br><br>Question: Sandeep wants to eliminate l from these simultaneous equations.<br>(<br>2 l+3 m=13 <br>3 l+4 m=17<br>)<br><br>First he will transform both of them to get them in an appropriate form.<br>Which of the following is a correct transformation?<br><br>Options:<br>A. 6 l+9 m=26 6 l+8 m=51<br>B. 6 l+9 m=13 6 l+8 m=17<br>C. 6 l+3 m=13 6 l+4 m=17<br>D. 6 l+9 m=39 6 l+8 m=34<br><br>Answer: 6 l+9 m=13 6 l+8 m=17</code> | <code>When solving simultaneous equations, thinks you only transform the algebraic terms, rather than the whole equation.</code> | <code>Manipulate both equations so that they are in a form to be added or subtracted</code> | <code>Sandeep wants to eliminate l from these simultaneous equations.<br>(<br>2 l+3 m=13 <br>3 l+4 m=17<br>)<br><br>First he will transform both of them to get them in an appropriate form.<br>Which of the following is a correct transformation?</code> |
286
- | <code>Construct: Express one quantity as a percentage of another mentally.<br><br>Question: What is 8 out of 40 as a percentage?<br><br>Options:<br>A. 8.4 %<br>B. 20 %<br>C. 16 %<br>D. 24 %<br><br>Answer: 16 %</code> | <code>Thinks they double the numerator to turn a fraction into a percentage</code> | <code>Express one quantity as a percentage of another mentally</code> | <code>What is 8 out of 40 as a percentage?</code> |
287
  * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
288
  ```json
289
  {
@@ -297,19 +276,19 @@ You can finetune this model on your own dataset.
297
  #### csv
298
 
299
  * Dataset: csv
300
- * Size: 1,430 evaluation samples
301
  * Columns: <code>qa_pair_text</code>, <code>MisconceptionName</code>, <code>ConstructName</code>, and <code>QuestionText</code>
302
  * Approximate statistics based on the first 1000 samples:
303
  | | qa_pair_text | MisconceptionName | ConstructName | QuestionText |
304
  |:--------|:------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
305
  | type | string | string | string | string |
306
- | details | <ul><li>min: 35 tokens</li><li>mean: 95.63 tokens</li><li>max: 903 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 14.59 tokens</li><li>max: 40 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 14.26 tokens</li><li>max: 41 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 39.45 tokens</li><li>max: 212 tokens</li></ul> |
307
  * Samples:
308
- | qa_pair_text | MisconceptionName | ConstructName | QuestionText |
309
- |:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------|
310
- | <code>Construct: Calculate the mean as an average from a list of data.<br><br>Question: Without using a calculator, what is the mean of the following numbers?<br><br>[<br>2,2,3,3,2<br>].<br><br>Options:<br>A. 2<br>B. A little more than 2<br>C. 3<br>D. A little less than 2<br><br>Answer: 2</code> | <code>When asked for the mean of a list of data, gives the mode</code> | <code>Calculate the mean as an average from a list of data</code> | <code>Without using a calculator, what is the mean of the following numbers?<br><br>[<br>2,2,3,3,2<br>]</code> |
311
- | <code>Construct: Solve problems that combine addition or subtraction with multiplication.<br><br>Question: Sara has £ 2.79 .<br>Rhys has 3 times as much as Sara.<br>How much do they have altogether?<br>Which calculation would not find the answer?<br><br>Options:<br>A. £ 2.79 x 4<br>B. (£ 2.79 x 3)+£ 2.79<br>C. £ 2.80 x 4-£ 0.04<br>D. £ 2.79 x 3<br><br>Answer: £ 2.80 x 4-£ 0.04</code> | <code>Does not think that the compensation strategy for the multiplication of 2 numbers is valid.</code> | <code>Solve problems that combine addition or subtraction with multiplication</code> | <code>Sara has £ 2.79 .<br>Rhys has 3 times as much as Sara.<br>How much do they have altogether?<br>Which calculation would not find the answer?</code> |
312
- | <code>Construct: Recognise and name an identity.<br><br>Question: Which of the following is the most appropriate name for this:<br>p-3 equiv-(3-p).<br><br>Options:<br>A. Equation<br>B. Expression<br>C. Formula<br>D. Identity<br><br>Answer: Equation</code> | <code>Confuses an equation with an identity</code> | <code>Recognise and name an identity</code> | <code>Which of the following is the most appropriate name for this:<br>p-3 equiv-(3-p)</code> |
313
  * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
314
  ```json
315
  {
@@ -450,24 +429,22 @@ You can finetune this model on your own dataset.
450
  </details>
451
 
452
  ### Training Logs
453
- | Epoch | Step | Training Loss | loss |
454
- |:-------:|:-------:|:-------------:|:----------:|
455
- | 0.25 | 46 | 4.1456 | - |
456
- | 0.5 | 92 | 1.3677 | 0.8336 |
457
- | 0.75 | 138 | 0.7569 | - |
458
- | 1.0 | 184 | 0.651 | 0.4735 |
459
- | 1.25 | 230 | 0.587 | - |
460
- | 1.5 | 276 | 0.4073 | 0.4675 |
461
- | 1.75 | 322 | 0.3533 | - |
462
- | 2.0 | 368 | 0.348 | 0.4002 |
463
- | 2.25 | 414 | 0.308 | - |
464
- | 2.5 | 460 | 0.2247 | 0.3935 |
465
- | 2.75 | 506 | 0.194 | - |
466
- | **3.0** | **552** | **0.1819** | **0.3638** |
467
- | 3.25 | 598 | 0.1758 | - |
468
- | 3.5 | 644 | 0.1247 | 0.4158 |
469
- | 3.75 | 690 | 0.1035 | - |
470
- | 4.0 | 736 | 0.1155 | 0.4073 |
471
 
472
  * The bold row denotes the saved checkpoint.
473
 
 
7
  - sentence-similarity
8
  - feature-extraction
9
  - generated_from_trainer
10
+ - dataset_size:2632
11
  - loss:MultipleNegativesRankingLoss
12
  widget:
13
+ - source_sentence: 'Construct: Identify the line of symmetry of curves in the form
14
+ (x + a)(x + b).
 
 
 
 
 
 
 
 
 
 
 
 
 
15
 
 
16
 
17
+ Question: This is the graph of y=(x+4)(x-2) W What is the equation of the line
18
+ of symmetry?
19
 
20
 
21
  Options:
22
 
23
+ A. y=-8
24
 
25
+ B. x=--4
26
 
27
+ C. y=-1
28
 
29
+ D. x = -1
30
 
31
 
32
+ Answer: y=-1'
33
  sentences:
34
+ - This is the graph of y=(x+4)(x-2) W What is the equation of the line of symmetry?
35
+ - Identify the line of symmetry of curves in the form (x + a)(x + b)
36
+ - Confuses the equations of vertical and horizontal lines
37
+ - source_sentence: 'Construct: Calculate percentage of an amount mentally where the
38
+ percent is a decimal.
39
 
40
 
41
+ Question: What is 5.5 % of 400 ?
 
 
 
 
 
 
 
 
 
 
 
 
42
 
43
 
44
  Options:
45
 
46
+ A. 22
47
 
48
+ B. 11
49
 
50
+ C. 5.5
51
 
52
+ D. 220
53
 
54
 
55
+ Answer: 220'
56
  sentences:
57
+ - Believes that dividing by 10 gives 1%
58
+ - What is 5.5 % of 400 ?
59
+ - Calculate percentage of an amount mentally where the percent is a decimal
60
+ - source_sentence: 'Construct: Convert from weeks to hours.
61
 
 
 
 
 
 
62
 
63
+ Question: Which one of the following calculations would work out the number of
64
+ hours in 5 weeks?
65
 
 
66
 
67
+ Options:
68
 
69
+ A. 5 x 24
70
 
71
+ B. 5 x 7 x 12
72
 
73
+ C. 5 x 5 x 24
74
 
75
+ D. 5 x 7 x 24
76
 
 
77
 
78
+ Answer: 5 x 5 x 24'
79
+ sentences:
80
+ - Which one of the following calculations would work out the number of hours in 5 weeks?
81
+ - Convert from weeks to hours
82
+ - Thinks there are 5 days in a week
83
+ - source_sentence: 'Construct: Expand two brackets with linear terms in the form
84
+ (ax + b)².
85
 
 
86
 
87
+ Question: Expand and simplify:
88
 
89
+ (
90
 
91
+ (3 y-4)^2
 
 
 
 
92
 
93
+ ).
94
 
 
95
 
96
+ Options:
 
 
 
97
 
98
+ A. 9 y^2-16
99
 
100
+ B. 9 y^2+16
101
 
102
+ C. 9 y^2-24 y+16
103
 
104
+ D. (3 y-4)(3 y-4)
 
105
 
 
106
 
107
+ Answer: 9 y^2+16'
108
+ sentences:
109
+ - Believes they only need to multiply the first and last pairs of terms when expanding
110
+ double brackets
111
+ - Expand two brackets with linear terms in the form (ax + b)²
112
+ - 'Expand and simplify:
113
 
114
+ (
115
 
116
+ (3 y-4)^2
117
 
118
+ )'
119
+ - source_sentence: 'Construct: Write a given translation in vector notation.
120
 
 
121
 
122
+ Question: Three left and five down in vector notation is... triangle bigcirc What
123
+ should replace the triangle ?
124
 
 
125
 
126
+ Options:
127
 
128
+ A. 3
 
 
129
 
130
+ B. -3
131
 
132
+ C. 5
 
133
 
134
+ D. -5
135
 
136
+
137
+ Answer: -5'
138
+ sentences:
139
+ - Write a given translation in vector notation
140
+ - Thinks the movement of the top number in a column vector is vertical, and the
141
+ bottom number is horizontal rather than the other way round
142
+ - Three left and five down in vector notation is... triangle bigcirc What should
143
+ replace the triangle ?
144
  ---
145
 
146
  # SentenceTransformer based on Alibaba-NLP/gte-base-en-v1.5
 
193
  model = SentenceTransformer("Gurveer05/gte-base-eedi-2024")
194
  # Run inference
195
  sentences = [
196
+ 'Construct: Write a given translation in vector notation.\n\nQuestion: Three left and five down in vector notation is... triangle bigcirc What should replace the triangle ?\n\nOptions:\nA. 3\nB. -3\nC. 5\nD. -5\n\nAnswer: -5',
197
+ 'Thinks the movement of the top number in a column vector is vertical, and the bottom number is horizontal rather than the other way round',
198
+ 'Write a given translation in vector notation',
199
  ]
200
  embeddings = model.encode(sentences)
201
  print(embeddings.shape)
 
250
  #### csv
251
 
252
  * Dataset: csv
253
+ * Size: 2,632 training samples
254
  * Columns: <code>qa_pair_text</code>, <code>MisconceptionName</code>, <code>ConstructName</code>, and <code>QuestionText</code>
255
  * Approximate statistics based on the first 1000 samples:
256
  | | qa_pair_text | MisconceptionName | ConstructName | QuestionText |
257
  |:--------|:------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
258
  | type | string | string | string | string |
259
+ | details | <ul><li>min: 35 tokens</li><li>mean: 92.39 tokens</li><li>max: 507 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 15.37 tokens</li><li>max: 39 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 14.01 tokens</li><li>max: 42 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 38.59 tokens</li><li>max: 274 tokens</li></ul> |
260
  * Samples:
261
+ | qa_pair_text | MisconceptionName | ConstructName | QuestionText |
262
+ |:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------|
263
+ | <code>Construct: Given the name of a polygon, draw or identify a corresponding diagram.<br><br>Question: Which of the following shapes is a hexagon?<br><br>Options:<br>A. An irregular, five sided shape<br>B. A regular, five sided shape<br>C. A regular, eight sided shape<br>D. None of these<br><br>Answer: A regular, eight sided shape</code> | <code>Confuses octagon and hexagon</code> | <code>Given the name of a polygon, draw or identify a corresponding diagram</code> | <code>Which of the following shapes is a hexagon?</code> |
264
+ | <code>Construct: Rearrange formulae to change the subject where the subject appears once and one step is needed.<br><br>Question: Step 2 <br><br>Which is the correct next step rearrangement of the following formula, making u the subject?<br>[<br>7 t-13=u(5+2 t)<br>].<br><br>Options:<br>A. u=5 t-8<br>B. u=(7 t-13 / 5+2 t)<br>C. u=(5+2 t / 7 t-13)<br>D. u=9 t-8<br><br>Answer: u=9 t-8</code> | <code>Misunderstands the process of making a variable the subject of an equation</code> | <code>Rearrange formulae to change the subject where the subject appears once and one step is needed</code> | <code>Step 2 <br><br>Which is the correct next step rearrangement of the following formula, making u the subject?<br>[<br>7 t-13=u(5+2 t)<br>]</code> |
265
+ | <code>Construct: Divide proper fractions in the form: Fraction ÷ Integer.<br><br>Question: Calculate<br>(<br>(2 / 5) / 3<br>).<br><br>Options:<br>A. (6 / 5)<br>B. (15 / 2)<br>C. (2 / 15)<br>D. (6 / 15)<br><br>Answer: (6 / 5)</code> | <code>when dividing a fraction by an integer, multiplies the numerator by the integer</code> | <code>Divide proper fractions in the form: Fraction ÷ Integer</code> | <code>Calculate<br>(<br>(2 / 5) / 3<br>)</code> |
266
  * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
267
  ```json
268
  {
 
276
  #### csv
277
 
278
  * Dataset: csv
279
+ * Size: 1,738 evaluation samples
280
  * Columns: <code>qa_pair_text</code>, <code>MisconceptionName</code>, <code>ConstructName</code>, and <code>QuestionText</code>
281
  * Approximate statistics based on the first 1000 samples:
282
  | | qa_pair_text | MisconceptionName | ConstructName | QuestionText |
283
  |:--------|:------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
284
  | type | string | string | string | string |
285
+ | details | <ul><li>min: 35 tokens</li><li>mean: 95.35 tokens</li><li>max: 903 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 14.36 tokens</li><li>max: 40 tokens</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 14.44 tokens</li><li>max: 40 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 37.54 tokens</li><li>max: 212 tokens</li></ul> |
286
  * Samples:
287
+ | qa_pair_text | MisconceptionName | ConstructName | QuestionText |
288
+ |:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
289
+ | <code>Construct: Add proper fractions with different denominators which do not share a common factor.<br><br>Question: (<br>(1 / 3)+(1 / 10)=(* / 30)<br>)<br>What should replace the star?<br><br>Options:<br>A. 13<br>B. 2<br>C. 4<br>D. 11<br><br>Answer: 2</code> | <code>Does not find a common denominator when adding/subtracting fractions</code> | <code>Add proper fractions with different denominators which do not share a common factor</code> | <code>(<br>(1 / 3)+(1 / 10)=(* / 30)<br>)<br>What should replace the star?</code> |
290
+ | <code>Construct: Round decimals to the nearest whole number.<br><br>Question: What is 9 0 . 6 8 2 rounded to the nearest integer?<br><br>Options:<br>A. 90<br>B. 90.7<br>C. 91<br>D. 92<br><br>Answer: 90.7</code> | <code>Rounds to the wrong degree of accuracy (rounds too much)</code> | <code>Round decimals to the nearest whole number</code> | <code>What is 9 0 . 6 8 2 rounded to the nearest integer?</code> |
291
+ | <code>Construct: Subtract fractions involving mixed number fractions with different denominators.<br><br>Question: (<br>2 (2 / 5)-(1 / 15)=2 (1 / *)<br>)<br>What should replace the star?<br><br>Options:<br>A. 10<br>B. 15<br>C. 5<br>D. 3<br><br>Answer: 10</code> | <code>When subtracting fractions, subtracts the numerators and denominators</code> | <code>Subtract fractions involving mixed number fractions with different denominators</code> | <code>(<br>2 (2 / 5)-(1 / 15)=2 (1 / *)<br>)<br>What should replace the star?</code> |
292
  * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
293
  ```json
294
  {
 
429
  </details>
430
 
431
  ### Training Logs
432
+ | Epoch | Step | Training Loss | loss |
433
+ |:----------:|:-------:|:-------------:|:----------:|
434
+ | 0.2545 | 42 | 4.1578 | - |
435
+ | 0.5030 | 83 | - | 1.0737 |
436
+ | 0.5091 | 84 | 1.4123 | - |
437
+ | 0.7636 | 126 | 0.7791 | - |
438
+ | 1.0061 | 166 | - | 0.7048 |
439
+ | 1.0182 | 168 | 0.5538 | - |
440
+ | 1.2727 | 210 | 0.5208 | - |
441
+ | **1.5091** | **249** | **-** | **0.5806** |
442
+ | 1.5273 | 252 | 0.428 | - |
443
+ | 1.7818 | 294 | 0.3175 | - |
444
+ | 2.0121 | 332 | - | 0.6534 |
445
+ | 2.0364 | 336 | 0.2301 | - |
446
+ | 2.2909 | 378 | 0.2247 | - |
447
+ | 2.5152 | 415 | - | 0.6442 |
 
 
448
 
449
  * The bold row denotes the saved checkpoint.
450
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0e86404d491b64c16bd9f3112a8b6abea6c8ae1efb0aefaaf5cb170bf4b5dff0
3
  size 547119128
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bfd2d15951004574fa79cd6f336c5bafaea48639523392e277aaa60efd5f8ca8
3
  size 547119128