5roop commited on
Commit
fefb572
·
1 Parent(s): 37c8654

Add limitations and a caption for the gif.

Browse files
Files changed (1) hide show
  1. README.md +6 -0
README.md CHANGED
@@ -67,6 +67,12 @@ In this fashion we obtain the following metrics:
67
  ![A gif illustrating correspondance between true and predicted prosodic
68
  units](output.gif)
69
 
 
 
 
 
 
 
70
  ## Uses
71
 
72
  ### Simple use (short files)
 
67
  ![A gif illustrating correspondance between true and predicted prosodic
68
  units](output.gif)
69
 
70
+ As seen in the gif image above, we observe generally good correspondence between true (blue) and predicted (orange) prosodic units, but there are cases where the grouping is incorrect: the model will annotate only a single prosodic unit where a human annotator would annotate two or more.
71
+
72
+ ### Known limitations
73
+
74
+ * Edge cases: if the input audio starts or ends within a prosodic unit, there is a high chance of not detecting the ending or starting prosodic unit.
75
+ * Unknown behaviour on non-speech audio: as of the time of writing, no tests were performed to check what happens in cases of music, noise, pure sine, ...
76
  ## Uses
77
 
78
  ### Simple use (short files)