Add limitations and a caption for the gif.
Browse files
README.md
CHANGED
@@ -67,6 +67,12 @@ In this fashion we obtain the following metrics:
|
|
67 |
![A gif illustrating correspondance between true and predicted prosodic
|
68 |
units](output.gif)
|
69 |
|
|
|
|
|
|
|
|
|
|
|
|
|
70 |
## Uses
|
71 |
|
72 |
### Simple use (short files)
|
|
|
67 |
![A gif illustrating correspondance between true and predicted prosodic
|
68 |
units](output.gif)
|
69 |
|
70 |
+
As seen in the gif image above, we observe generally good correspondence between true (blue) and predicted (orange) prosodic units, but there are cases where the grouping is incorrect: the model will annotate only a single prosodic unit where a human annotator would annotate two or more.
|
71 |
+
|
72 |
+
### Known limitations
|
73 |
+
|
74 |
+
* Edge cases: if the input audio starts or ends within a prosodic unit, there is a high chance of not detecting the ending or starting prosodic unit.
|
75 |
+
* Unknown behaviour on non-speech audio: as of the time of writing, no tests were performed to check what happens in cases of music, noise, pure sine, ...
|
76 |
## Uses
|
77 |
|
78 |
### Simple use (short files)
|