dslee2601 commited on
Commit
3aa23f4
·
verified ·
1 Parent(s): 87da80c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -23,6 +23,8 @@ z = model('path/audio_fname.wav') # (768, 8, 64) = (latent_dim_size, latent_fre
23
  Depending on a task, a different pooling strategy should be facilitated.
24
  For instance, a global average pooling can be used for a classification task. [2] uses an adaptive pooling.
25
 
 
 
26
 
27
  # Sanity Check Result
28
  In the following, a spectrogram of an input audio and corresponding $z$ are visualized.
 
23
  Depending on a task, a different pooling strategy should be facilitated.
24
  For instance, a global average pooling can be used for a classification task. [2] uses an adaptive pooling.
25
 
26
+ ⚠️ AudioMAE accepts audio with maximum length of 10s (as described in [1]). Any audio longer than 10s will be clipped to 10s, meaning the excess beyond 10s will be discarded.
27
+
28
 
29
  # Sanity Check Result
30
  In the following, a spectrogram of an input audio and corresponding $z$ are visualized.