IPA-Transcription-EN

Running

arunasrivastava commited on 25 days ago

Commit

9926870

1 Parent(s): c8d97f4

update panphon

Files changed (2) hide show

app.py CHANGED Viewed

@@ -153,6 +153,13 @@ with gr.Blocks(css="""
     - **PER (Phoneme Error Rate)**: The Levenshtein distance calculated between phoneme sequences of the predicted and actual transcriptions.
     - **PWED (Phoneme Weighted Edit Distance)**: A measure of the edit distance between the predicted and actual phoneme sequences, weighted by the phonemic feature distance. Feature vectors provided by panphon library
     """)
     with gr.Tabs():
         with gr.TabItem("🏆 Leaderboard"):
             leaderboard_html = gr.HTML(create_html_table(format_leaderboard_df(load_leaderboard_data())))

     - **PER (Phoneme Error Rate)**: The Levenshtein distance calculated between phoneme sequences of the predicted and actual transcriptions.
     - **PWED (Phoneme Weighted Edit Distance)**: A measure of the edit distance between the predicted and actual phoneme sequences, weighted by the phonemic feature distance. Feature vectors provided by panphon library
     """)
+    gr.Markdown("""
+    ## Test Set Information
+    The test set used for evaluation is from the [TIMIT speech corpus](https://www.kaggle.com/datasets/mfekadu/darpa-timit-acousticphonetic-continuous-speech). The TIMIT corpus is a widely used dataset for speech recognition research.
+    ## Processing Time
+    Please note that processing will take around 2 minutes.
+    """)
     with gr.Tabs():
         with gr.TabItem("🏆 Leaderboard"):
             leaderboard_html = gr.HTML(create_html_table(format_leaderboard_df(load_leaderboard_data())))

requirements.txt CHANGED Viewed

@@ -5,7 +5,7 @@ torchvision==0.15.2
 transformers==4.46.3
 tokenizers>=0.20,<0.21
 safetensors>=0.4.1
-evaluate==0.4.0
 gradio==5.7.1
 huggingface-hub==0.25.1
 panphon==0.21

 transformers==4.46.3
 tokenizers>=0.20,<0.21
 safetensors>=0.4.1
+evaluate==0.4.3
 gradio==5.7.1
 huggingface-hub==0.25.1
 panphon==0.21