arunasrivastava commited on
Commit
f06db26
·
1 Parent(s): 381f539

test set specificaiton

Browse files
Files changed (5) hide show
  1. README-github.md +124 -0
  2. main.py +1 -1
  3. queue/leaderboard.json +19 -19
  4. queue/results.json +22 -22
  5. queue/tasks.json +26 -46
README-github.md ADDED
@@ -0,0 +1,124 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # 🎯 Phonemic Transcription Leaderboard
2
+
3
+ Welcome to the Phonemic Transcription Leaderboard! This simple leaderboard helps you track and compare the performance of different speech-to-phoneme model. Feel free to use it for your own hugging face leaderboards!
4
+
5
+ ![leaderboard](img/leaderboard.png)
6
+
7
+ ## ✨ Features
8
+
9
+ * 📊 Interactive leaderboard with real-time sorting
10
+ * 🔄 Easy model submission system
11
+ * 📈 Automatic evaluation of submitted models
12
+ * 📱 Responsive design that works on all devices
13
+
14
+ ## 🎯 What This Project Does
15
+
16
+ This leaderboard tracks two key metrics for phonemic transcription models:
17
+
18
+ * **PER (Phoneme Error Rate)**: How accurately your model converts speech to phonemes
19
+ * **PWED (Phoneme Weighted Edit Distance)**: A more nuanced metric that considers phonemic features
20
+
21
+ Models are evaluated on the TIMIT speech corpus, a gold standard in speech recognition research.
22
+
23
+ ## 🚀 Getting Started
24
+
25
+ ### Prerequisites
26
+
27
+ * Python 3.10
28
+ * Git
29
+ * A love for speech recognition! 🎤
30
+
31
+ ### Quick Installation
32
+
33
+ 1. Clone this repository:
34
+
35
+ ```bash
36
+ git clone [your-repo-url]
37
+ cd phonemic-leaderboard
38
+ ```
39
+
40
+ 2. Set up your environment:
41
+
42
+ ```bash
43
+ # Create a virtual environment with Python 3.10
44
+ python3.10 -m venv venv
45
+
46
+ # Activate the virtual environment
47
+ source venv/bin/activate
48
+
49
+ # Install the required dependencies
50
+ pip install -r requirements.txt
51
+ ```
52
+
53
+ 3. Launch the leaderboard:
54
+
55
+ ```bash
56
+ # Run the application
57
+ uvicorn app:app --host 0.0.0.0 --port 7860
58
+ ```
59
+
60
+ 4. Visit `http://localhost:7860` in your browser and see the magic! ✨
61
+
62
+ ## 🎮 Using the Leaderboard
63
+
64
+ ### Submitting a Model
65
+
66
+ 1. Go to the "Submit Model" tab
67
+ 2. Enter your model details:
68
+ * Model name (e.g., "wav2vec2-phoneme-wizard")
69
+ * Submission name (e.g., "MyAwesomeModel v1.0")
70
+ * GitHub/Kaggle/HuggingFace URL (optional)
71
+ 3. Click Submit and watch your model climb the ranks! 🚀
72
+
73
+ ### Checking Model Status
74
+
75
+ 1. Navigate to the "Model Status" tab
76
+ 2. Enter your model name or task ID
77
+ 3. Get real-time updates on your model's evaluation progress
78
+
79
+ ## 📊 Understanding the Results
80
+
81
+ The leaderboard shows:
82
+
83
+ * Model names and submission details
84
+ * PER and PWED scores (lower is better!)
85
+ * Links to model repositories
86
+ * Submission dates
87
+
88
+ Sort by either metric to see who's leading the pack!
89
+
90
+ ## 🛠️ Technical Details
91
+
92
+ * Built with Gradio for a smooth UI experience
93
+ * Runs on a basic compute plan (16GB RAM, 2vCPUs) for easy reproducibility
94
+ * Evaluation can take several hours - perfect time to grab a coffee ☕
95
+
96
+ ## 🤝 Contributing
97
+
98
+ Want to make this leaderboard even better? We'd love your help! Here are some ways you can contribute:
99
+
100
+ * Add new evaluation metrics
101
+ * Improve the UI design
102
+ * Enhance documentation
103
+ * Submit bug fixes
104
+ * Add new features
105
+
106
+ ## 📝 License
107
+
108
+ This project is licensed under the MIT License - see the LICENSE file for details.
109
+
110
+ ## 🌟 Acknowledgments
111
+
112
+ * Thanks to the TIMIT speech corpus for providing evaluation data
113
+ * Shoutout to the [panphon library](https://github.com/dmort27/panphon) for PWED calculations
114
+ * Built with love by Koel Labs 💙
115
+
116
+ ## 🆘 Need Help?
117
+
118
+ Got questions? Found a bug? Want to contribute? Open an issue or reach out to us! We're here to help make speech recognition evaluation fun and accessible for everyone!
119
+
120
+ Remember: Every great model deserves its moment to shine! 🌟
121
+
122
+ ---
123
+
124
+ Happy Transcribing! 🎤✨
main.py CHANGED
@@ -298,7 +298,7 @@ class StorageManager:
298
  class EvaluationRequest(BaseModel):
299
  """Request model for TIMIT evaluation"""
300
  transcription_model: str
301
- subset: str = "test"
302
  max_samples: Optional[int] = None
303
  submission_name: str
304
  github_url: Optional[str] = None
 
298
  class EvaluationRequest(BaseModel):
299
  """Request model for TIMIT evaluation"""
300
  transcription_model: str
301
+ subset: str = "timit-test"
302
  max_samples: Optional[int] = None
303
  submission_name: str
304
  github_url: Optional[str] = None
queue/leaderboard.json CHANGED
@@ -5,7 +5,7 @@
5
  "model": "facebook/wav2vec2-lv-60-espeak-cv-ft",
6
  "average_per": 0.33667301260691423,
7
  "average_pwed": 0.1276725657099669,
8
- "subset": "test",
9
  "github_url": "https://github.com/facebookresearch/fairseq/blob/main/examples/wav2vec/README.md",
10
  "submission_date": "2024-12-05T07:32:06.850230"
11
  },
@@ -15,7 +15,7 @@
15
  "model": "KoelLabs/xlsr-timit-b0",
16
  "average_per": 0.12572285528714347,
17
  "average_pwed": 0.06476636812791145,
18
- "subset": "test",
19
  "github_url": "https://github.com/KoelLabs/",
20
  "submission_date": "2024-12-05T08:25:24.982477"
21
  },
@@ -25,7 +25,7 @@
25
  "model": "speech31/wav2vec2-large-TIMIT-IPA",
26
  "average_per": 0.4415425496841929,
27
  "average_pwed": 0.18625930002594002,
28
- "subset": "test",
29
  "github_url": "https://huggingface.co/speech31/wav2vec2-large-TIMIT-IPA2",
30
  "submission_date": "2024-12-05T09:36:14.570315"
31
  },
@@ -35,7 +35,7 @@
35
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5",
36
  "average_per": 0.6318471187460027,
37
  "average_pwed": 0.222932144739126,
38
- "subset": "test",
39
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5WithoutSpaces/tree/d5312009d8e620b183c334dfdd9ffc6b4f06f8c1",
40
  "submission_date": "2024-12-05T10:17:21.334530"
41
  },
@@ -45,7 +45,7 @@
45
  "model": "facebook/wav2vec2-xlsr-53-espeak-cv-ft",
46
  "average_per": 0.348845592557092,
47
  "average_pwed": 0.1386742019529415,
48
- "subset": "test",
49
  "github_url": "https://github.com/facebookresearch/fairseq/blob/main/examples/wav2vec/README.md",
50
  "submission_date": "2024-12-05T10:34:26.157054"
51
  },
@@ -55,7 +55,7 @@
55
  "model": "ginic/hyperparam_tuning_1_wav2vec2-large-xlsr-buckeye-ipa",
56
  "average_per": 0.2766466385175833,
57
  "average_pwed": 0.10410683992600853,
58
- "subset": "test",
59
  "github_url": "https://huggingface.co/ginic/vary_individuals_old_only_1_wav2vec2-large-xlsr-buckeye-ipa",
60
  "submission_date": "2024-12-05T11:06:07.984825"
61
  },
@@ -65,7 +65,7 @@
65
  "model": "KoelLabs/xlsr-timit-a0",
66
  "average_per": 0.24242141955346685,
67
  "average_pwed": 0.17395311976938,
68
- "subset": "test",
69
  "github_url": "https://github.com/KoelLabs/ML/",
70
  "submission_date": "2024-12-12T16:07:25.391145"
71
  },
@@ -75,7 +75,7 @@
75
  "model": "snu-nia-12/wav2vec2-large_nia12_phone-ipa_english",
76
  "average_per": 0.4847029843149011,
77
  "average_pwed": 0.2072006544586948,
78
- "subset": "test",
79
  "github_url": null,
80
  "submission_date": "2024-12-18T22:01:20.855881"
81
  },
@@ -85,7 +85,7 @@
85
  "model": "vitouphy/wav2vec2-xls-r-300m-timit-phoneme",
86
  "average_per": 0.2561961414705681,
87
  "average_pwed": 0.1378394393452702,
88
- "subset": "test",
89
  "github_url": "https://www.kaggle.com/code/vitouphy/phoneme-recognition-with-wav2vec2",
90
  "submission_date": "2024-12-18T22:50:59.627338"
91
  },
@@ -95,7 +95,7 @@
95
  "model": "ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa-plus-2000",
96
  "average_per": 0.6479484324708775,
97
  "average_pwed": 0.18710002665151734,
98
- "subset": "test",
99
  "github_url": "https://huggingface.co/ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
100
  "submission_date": "2024-12-18T23:29:27.322286"
101
  },
@@ -105,7 +105,7 @@
105
  "model": "ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
106
  "average_per": 0.6417205190285036,
107
  "average_pwed": 0.19048963968896404,
108
- "subset": "test",
109
  "github_url": "https://huggingface.co/ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
110
  "submission_date": "2024-12-19T07:41:18.135985"
111
  },
@@ -115,7 +115,7 @@
115
  "model": "ginic/gender_split_70_female_4_wav2vec2-large-xlsr-buckeye-ipa",
116
  "average_per": 0.2810165988557621,
117
  "average_pwed": 0.10703377161801164,
118
- "subset": "test",
119
  "github_url": "https://github.com/ginic/multipa/tree/buckeye_experiments",
120
  "submission_date": "2024-12-20T13:45:52.010575"
121
  },
@@ -125,7 +125,7 @@
125
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.9.2WithoutSpaces",
126
  "average_per": 0.9537775908999574,
127
  "average_pwed": 0.9351204819224959,
128
- "subset": "test",
129
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5WithoutSpaces",
130
  "submission_date": "2024-12-20T14:21:32.293694"
131
  },
@@ -135,7 +135,7 @@
135
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-nl",
136
  "average_per": 0.9887075544197294,
137
  "average_pwed": 0.9692486915717254,
138
- "subset": "test",
139
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-nl1.1",
140
  "submission_date": "2024-12-20T15:40:51.632895"
141
  },
@@ -145,7 +145,7 @@
145
  "model": "speech31/wavlm-large-english-ipa",
146
  "average_per": 0.3694017596969614,
147
  "average_pwed": 0.1356824900612308,
148
- "subset": "test",
149
  "github_url": "https://huggingface.co/speech31/wavlm-large-english-ipa",
150
  "submission_date": "2024-12-20T16:26:47.982209"
151
  },
@@ -155,7 +155,7 @@
155
  "model": "speech31/XLS-R-300m-english-ipa",
156
  "average_per": 0.36382554692045954,
157
  "average_pwed": 0.1299702312124616,
158
- "subset": "test",
159
  "github_url": "https://huggingface.co/speech31/XLS-R-300m-english-ipa",
160
  "submission_date": "2024-12-20T16:47:54.826509"
161
  },
@@ -165,7 +165,7 @@
165
  "model": "speech31/wav2vec2-large-english-TIMIT-phoneme_v3",
166
  "average_per": 0.44563344149564776,
167
  "average_pwed": 0.18844914029048124,
168
- "subset": "test",
169
  "github_url": "https://huggingface.co/speech31/wav2vec2-large-english-TIMIT-phoneme_v3",
170
  "submission_date": "2024-12-20T17:05:35.213738"
171
  },
@@ -175,7 +175,7 @@
175
  "model": "speech31/wav2vec2-large-TIMIT-IPA2",
176
  "average_per": 0.4847029843149011,
177
  "average_pwed": 0.2072006544586948,
178
- "subset": "test",
179
  "github_url": "https://huggingface.co/speech31/wav2vec2-large-TIMIT-IPA2",
180
  "submission_date": "2024-12-20T22:50:50.645178"
181
  },
@@ -185,7 +185,7 @@
185
  "model": "ginic/vary_individuals_young_only_3_wav2vec2-large-xlsr-buckeye-ipa",
186
  "average_per": 0.2807914104790719,
187
  "average_pwed": 0.10494355278037441,
188
- "subset": "test",
189
  "github_url": "https://huggingface.co/ginic/vary_individuals_young_only_3_wav2vec2-large-xlsr-buckeye-ipa",
190
  "submission_date": "2024-12-21T01:31:04.862397"
191
  }
 
5
  "model": "facebook/wav2vec2-lv-60-espeak-cv-ft",
6
  "average_per": 0.33667301260691423,
7
  "average_pwed": 0.1276725657099669,
8
+ "subset": "timit-test",
9
  "github_url": "https://github.com/facebookresearch/fairseq/blob/main/examples/wav2vec/README.md",
10
  "submission_date": "2024-12-05T07:32:06.850230"
11
  },
 
15
  "model": "KoelLabs/xlsr-timit-b0",
16
  "average_per": 0.12572285528714347,
17
  "average_pwed": 0.06476636812791145,
18
+ "subset": "timit-test",
19
  "github_url": "https://github.com/KoelLabs/",
20
  "submission_date": "2024-12-05T08:25:24.982477"
21
  },
 
25
  "model": "speech31/wav2vec2-large-TIMIT-IPA",
26
  "average_per": 0.4415425496841929,
27
  "average_pwed": 0.18625930002594002,
28
+ "subset": "timit-test",
29
  "github_url": "https://huggingface.co/speech31/wav2vec2-large-TIMIT-IPA2",
30
  "submission_date": "2024-12-05T09:36:14.570315"
31
  },
 
35
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5",
36
  "average_per": 0.6318471187460027,
37
  "average_pwed": 0.222932144739126,
38
+ "subset": "timit-test",
39
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5WithoutSpaces/tree/d5312009d8e620b183c334dfdd9ffc6b4f06f8c1",
40
  "submission_date": "2024-12-05T10:17:21.334530"
41
  },
 
45
  "model": "facebook/wav2vec2-xlsr-53-espeak-cv-ft",
46
  "average_per": 0.348845592557092,
47
  "average_pwed": 0.1386742019529415,
48
+ "subset": "timit-test",
49
  "github_url": "https://github.com/facebookresearch/fairseq/blob/main/examples/wav2vec/README.md",
50
  "submission_date": "2024-12-05T10:34:26.157054"
51
  },
 
55
  "model": "ginic/hyperparam_tuning_1_wav2vec2-large-xlsr-buckeye-ipa",
56
  "average_per": 0.2766466385175833,
57
  "average_pwed": 0.10410683992600853,
58
+ "subset": "timit-test",
59
  "github_url": "https://huggingface.co/ginic/vary_individuals_old_only_1_wav2vec2-large-xlsr-buckeye-ipa",
60
  "submission_date": "2024-12-05T11:06:07.984825"
61
  },
 
65
  "model": "KoelLabs/xlsr-timit-a0",
66
  "average_per": 0.24242141955346685,
67
  "average_pwed": 0.17395311976938,
68
+ "subset": "timit-test",
69
  "github_url": "https://github.com/KoelLabs/ML/",
70
  "submission_date": "2024-12-12T16:07:25.391145"
71
  },
 
75
  "model": "snu-nia-12/wav2vec2-large_nia12_phone-ipa_english",
76
  "average_per": 0.4847029843149011,
77
  "average_pwed": 0.2072006544586948,
78
+ "subset": "timit-test",
79
  "github_url": null,
80
  "submission_date": "2024-12-18T22:01:20.855881"
81
  },
 
85
  "model": "vitouphy/wav2vec2-xls-r-300m-timit-phoneme",
86
  "average_per": 0.2561961414705681,
87
  "average_pwed": 0.1378394393452702,
88
+ "subset": "timit-test",
89
  "github_url": "https://www.kaggle.com/code/vitouphy/phoneme-recognition-with-wav2vec2",
90
  "submission_date": "2024-12-18T22:50:59.627338"
91
  },
 
95
  "model": "ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa-plus-2000",
96
  "average_per": 0.6479484324708775,
97
  "average_pwed": 0.18710002665151734,
98
+ "subset": "timit-test",
99
  "github_url": "https://huggingface.co/ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
100
  "submission_date": "2024-12-18T23:29:27.322286"
101
  },
 
105
  "model": "ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
106
  "average_per": 0.6417205190285036,
107
  "average_pwed": 0.19048963968896404,
108
+ "subset": "timit-test",
109
  "github_url": "https://huggingface.co/ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
110
  "submission_date": "2024-12-19T07:41:18.135985"
111
  },
 
115
  "model": "ginic/gender_split_70_female_4_wav2vec2-large-xlsr-buckeye-ipa",
116
  "average_per": 0.2810165988557621,
117
  "average_pwed": 0.10703377161801164,
118
+ "subset": "timit-test",
119
  "github_url": "https://github.com/ginic/multipa/tree/buckeye_experiments",
120
  "submission_date": "2024-12-20T13:45:52.010575"
121
  },
 
125
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.9.2WithoutSpaces",
126
  "average_per": 0.9537775908999574,
127
  "average_pwed": 0.9351204819224959,
128
+ "subset": "timit-test",
129
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5WithoutSpaces",
130
  "submission_date": "2024-12-20T14:21:32.293694"
131
  },
 
135
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-nl",
136
  "average_per": 0.9887075544197294,
137
  "average_pwed": 0.9692486915717254,
138
+ "subset": "timit-test",
139
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-nl1.1",
140
  "submission_date": "2024-12-20T15:40:51.632895"
141
  },
 
145
  "model": "speech31/wavlm-large-english-ipa",
146
  "average_per": 0.3694017596969614,
147
  "average_pwed": 0.1356824900612308,
148
+ "subset": "timit-test",
149
  "github_url": "https://huggingface.co/speech31/wavlm-large-english-ipa",
150
  "submission_date": "2024-12-20T16:26:47.982209"
151
  },
 
155
  "model": "speech31/XLS-R-300m-english-ipa",
156
  "average_per": 0.36382554692045954,
157
  "average_pwed": 0.1299702312124616,
158
+ "subset": "timit-test",
159
  "github_url": "https://huggingface.co/speech31/XLS-R-300m-english-ipa",
160
  "submission_date": "2024-12-20T16:47:54.826509"
161
  },
 
165
  "model": "speech31/wav2vec2-large-english-TIMIT-phoneme_v3",
166
  "average_per": 0.44563344149564776,
167
  "average_pwed": 0.18844914029048124,
168
+ "subset": "timit-test",
169
  "github_url": "https://huggingface.co/speech31/wav2vec2-large-english-TIMIT-phoneme_v3",
170
  "submission_date": "2024-12-20T17:05:35.213738"
171
  },
 
175
  "model": "speech31/wav2vec2-large-TIMIT-IPA2",
176
  "average_per": 0.4847029843149011,
177
  "average_pwed": 0.2072006544586948,
178
+ "subset": "timit-test",
179
  "github_url": "https://huggingface.co/speech31/wav2vec2-large-TIMIT-IPA2",
180
  "submission_date": "2024-12-20T22:50:50.645178"
181
  },
 
185
  "model": "ginic/vary_individuals_young_only_3_wav2vec2-large-xlsr-buckeye-ipa",
186
  "average_per": 0.2807914104790719,
187
  "average_pwed": 0.10494355278037441,
188
+ "subset": "timit-test",
189
  "github_url": "https://huggingface.co/ginic/vary_individuals_young_only_3_wav2vec2-large-xlsr-buckeye-ipa",
190
  "submission_date": "2024-12-21T01:31:04.862397"
191
  }
queue/results.json CHANGED
@@ -2,7 +2,7 @@
2
  {
3
  "task_id": "721b4c64-a825-42d3-bb0a-bdff9ee1ed0f",
4
  "model": "facebook/wav2vec2-lv-60-espeak-cv-ft",
5
- "subset": "test",
6
  "num_files": 1680,
7
  "average_per": 0.33667301260691423,
8
  "average_pwed": 0.1276725657099669,
@@ -48,7 +48,7 @@
48
  {
49
  "task_id": "d6fe0956-b5b4-4105-835e-8dee1872ee4d",
50
  "model": "KoelLabs/xlsr-timit-b0",
51
- "subset": "test",
52
  "num_files": 1680,
53
  "average_per": 0.12572285528714347,
54
  "average_pwed": 0.06476636812791145,
@@ -94,7 +94,7 @@
94
  {
95
  "task_id": "dbf4642a-fb13-402c-8a74-cc41fc4be599",
96
  "model": "speech31/wav2vec2-large-TIMIT-IPA",
97
- "subset": "test",
98
  "num_files": 1680,
99
  "average_per": 0.4415425496841929,
100
  "average_pwed": 0.18625930002594002,
@@ -140,7 +140,7 @@
140
  {
141
  "task_id": "912449a4-d7ed-4af4-b5be-5c2c57ec09ff",
142
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5",
143
- "subset": "test",
144
  "num_files": 1680,
145
  "average_per": 0.6318471187460027,
146
  "average_pwed": 0.222932144739126,
@@ -186,7 +186,7 @@
186
  {
187
  "task_id": "c79df17e-2bb2-4253-ae26-f7cc6ab21265",
188
  "model": "facebook/wav2vec2-xlsr-53-espeak-cv-ft",
189
- "subset": "test",
190
  "num_files": 1680,
191
  "average_per": 0.348845592557092,
192
  "average_pwed": 0.1386742019529415,
@@ -232,7 +232,7 @@
232
  {
233
  "task_id": "f36060e6-a746-44dc-a527-54995b270053",
234
  "model": "ginic/hyperparam_tuning_1_wav2vec2-large-xlsr-buckeye-ipa",
235
- "subset": "test",
236
  "num_files": 1680,
237
  "average_per": 0.2766466385175833,
238
  "average_pwed": 0.10410683992600853,
@@ -278,7 +278,7 @@
278
  {
279
  "task_id": "47d56349-8111-4bda-a47f-e007dbedd36d",
280
  "model": "KoelLabs/xlsr-timit-a0",
281
- "subset": "test",
282
  "num_files": 1680,
283
  "average_per": 0.24242141955346685,
284
  "average_pwed": 0.17395311976938,
@@ -324,7 +324,7 @@
324
  {
325
  "task_id": "51dd5735-63bd-4fe5-a588-c0fc079076e0",
326
  "model": "KoelLabs/xlsr-timit-a0",
327
- "subset": "test",
328
  "num_files": 1680,
329
  "average_per": 0.24242141955346685,
330
  "average_pwed": 0.17395311976938,
@@ -370,7 +370,7 @@
370
  {
371
  "task_id": "2e592612-ca38-4afb-a6a0-3c870b288960",
372
  "model": "snu-nia-12/wav2vec2-large_nia12_phone-ipa_english",
373
- "subset": "test",
374
  "num_files": 1680,
375
  "average_per": 0.4847029843149011,
376
  "average_pwed": 0.2072006544586948,
@@ -416,7 +416,7 @@
416
  {
417
  "task_id": "d38e65ce-75b5-4dbf-8ade-bff6a5803790",
418
  "model": "vitouphy/wav2vec2-xls-r-300m-timit-phoneme",
419
- "subset": "test",
420
  "num_files": 1680,
421
  "average_per": 0.2561961414705681,
422
  "average_pwed": 0.1378394393452702,
@@ -462,7 +462,7 @@
462
  {
463
  "task_id": "2839c0c6-8f3b-426e-9eb7-04b6e133dc47",
464
  "model": "ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa-plus-2000",
465
- "subset": "test",
466
  "num_files": 1680,
467
  "average_per": 0.6479484324708775,
468
  "average_pwed": 0.18710002665151734,
@@ -508,7 +508,7 @@
508
  {
509
  "task_id": "59afc37a-0072-44dd-a02a-0cf47d89c120",
510
  "model": "ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
511
- "subset": "test",
512
  "num_files": 1680,
513
  "average_per": 0.6417205190285036,
514
  "average_pwed": 0.19048963968896404,
@@ -554,7 +554,7 @@
554
  {
555
  "task_id": "5517f6b2-6a76-4a2d-a6ce-33446f390c3b",
556
  "model": "ginic/gender_split_70_female_4_wav2vec2-large-xlsr-buckeye-ipa",
557
- "subset": "test",
558
  "num_files": 1680,
559
  "average_per": 0.2810165988557621,
560
  "average_pwed": 0.10703377161801164,
@@ -600,7 +600,7 @@
600
  {
601
  "task_id": "c2139f96-e79e-4f25-a525-aa039f65555f",
602
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.9.2WithoutSpaces",
603
- "subset": "test",
604
  "num_files": 1680,
605
  "average_per": 0.9537775908999574,
606
  "average_pwed": 0.9351204819224959,
@@ -646,7 +646,7 @@
646
  {
647
  "task_id": "d146f1f1-6e6e-4b28-9420-c652ae9a1002",
648
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-nl",
649
- "subset": "test",
650
  "num_files": 1680,
651
  "average_per": 0.9887075544197294,
652
  "average_pwed": 0.9692486915717254,
@@ -692,7 +692,7 @@
692
  {
693
  "task_id": "265c5859-e7ba-492d-a6c9-45733dc17c99",
694
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-nl",
695
- "subset": "test",
696
  "num_files": 1680,
697
  "average_per": 0.9887075544197294,
698
  "average_pwed": 0.9692486915717254,
@@ -738,7 +738,7 @@
738
  {
739
  "task_id": "e297dfde-95e5-462b-a6e5-8fa43bc30bc0",
740
  "model": "speech31/wavlm-large-english-ipa",
741
- "subset": "test",
742
  "num_files": 1680,
743
  "average_per": 0.3694017596969614,
744
  "average_pwed": 0.1356824900612308,
@@ -784,7 +784,7 @@
784
  {
785
  "task_id": "efe95f71-05e3-485d-8e0c-1823a3037cf4",
786
  "model": "speech31/wavlm-large-english-ipa",
787
- "subset": "test",
788
  "num_files": 1680,
789
  "average_per": 0.3694017596969614,
790
  "average_pwed": 0.1356824900612308,
@@ -830,7 +830,7 @@
830
  {
831
  "task_id": "4b2ae2fc-fe2f-4f8b-9e8f-25c0bae13c0d",
832
  "model": "speech31/XLS-R-300m-english-ipa",
833
- "subset": "test",
834
  "num_files": 1680,
835
  "average_per": 0.36382554692045954,
836
  "average_pwed": 0.1299702312124616,
@@ -876,7 +876,7 @@
876
  {
877
  "task_id": "33d387c0-703c-415d-b8e2-81cea87a2146",
878
  "model": "speech31/wav2vec2-large-english-TIMIT-phoneme_v3",
879
- "subset": "test",
880
  "num_files": 1680,
881
  "average_per": 0.44563344149564776,
882
  "average_pwed": 0.18844914029048124,
@@ -922,7 +922,7 @@
922
  {
923
  "task_id": "c89bcefc-3884-435a-a54c-24297fe6f041",
924
  "model": "speech31/wav2vec2-large-TIMIT-IPA2",
925
- "subset": "test",
926
  "num_files": 1680,
927
  "average_per": 0.4847029843149011,
928
  "average_pwed": 0.2072006544586948,
@@ -968,7 +968,7 @@
968
  {
969
  "task_id": "81fa94f8-94ae-4601-952c-24abaddaf691",
970
  "model": "ginic/vary_individuals_young_only_3_wav2vec2-large-xlsr-buckeye-ipa",
971
- "subset": "test",
972
  "num_files": 1680,
973
  "average_per": 0.2807914104790719,
974
  "average_pwed": 0.10494355278037441,
 
2
  {
3
  "task_id": "721b4c64-a825-42d3-bb0a-bdff9ee1ed0f",
4
  "model": "facebook/wav2vec2-lv-60-espeak-cv-ft",
5
+ "subset": "timit-test",
6
  "num_files": 1680,
7
  "average_per": 0.33667301260691423,
8
  "average_pwed": 0.1276725657099669,
 
48
  {
49
  "task_id": "d6fe0956-b5b4-4105-835e-8dee1872ee4d",
50
  "model": "KoelLabs/xlsr-timit-b0",
51
+ "subset": "timit-test",
52
  "num_files": 1680,
53
  "average_per": 0.12572285528714347,
54
  "average_pwed": 0.06476636812791145,
 
94
  {
95
  "task_id": "dbf4642a-fb13-402c-8a74-cc41fc4be599",
96
  "model": "speech31/wav2vec2-large-TIMIT-IPA",
97
+ "subset": "timit-test",
98
  "num_files": 1680,
99
  "average_per": 0.4415425496841929,
100
  "average_pwed": 0.18625930002594002,
 
140
  {
141
  "task_id": "912449a4-d7ed-4af4-b5be-5c2c57ec09ff",
142
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5",
143
+ "subset": "timit-test",
144
  "num_files": 1680,
145
  "average_per": 0.6318471187460027,
146
  "average_pwed": 0.222932144739126,
 
186
  {
187
  "task_id": "c79df17e-2bb2-4253-ae26-f7cc6ab21265",
188
  "model": "facebook/wav2vec2-xlsr-53-espeak-cv-ft",
189
+ "subset": "timit-test",
190
  "num_files": 1680,
191
  "average_per": 0.348845592557092,
192
  "average_pwed": 0.1386742019529415,
 
232
  {
233
  "task_id": "f36060e6-a746-44dc-a527-54995b270053",
234
  "model": "ginic/hyperparam_tuning_1_wav2vec2-large-xlsr-buckeye-ipa",
235
+ "subset": "timit-test",
236
  "num_files": 1680,
237
  "average_per": 0.2766466385175833,
238
  "average_pwed": 0.10410683992600853,
 
278
  {
279
  "task_id": "47d56349-8111-4bda-a47f-e007dbedd36d",
280
  "model": "KoelLabs/xlsr-timit-a0",
281
+ "subset": "timit-test",
282
  "num_files": 1680,
283
  "average_per": 0.24242141955346685,
284
  "average_pwed": 0.17395311976938,
 
324
  {
325
  "task_id": "51dd5735-63bd-4fe5-a588-c0fc079076e0",
326
  "model": "KoelLabs/xlsr-timit-a0",
327
+ "subset": "timit-test",
328
  "num_files": 1680,
329
  "average_per": 0.24242141955346685,
330
  "average_pwed": 0.17395311976938,
 
370
  {
371
  "task_id": "2e592612-ca38-4afb-a6a0-3c870b288960",
372
  "model": "snu-nia-12/wav2vec2-large_nia12_phone-ipa_english",
373
+ "subset": "timit-test",
374
  "num_files": 1680,
375
  "average_per": 0.4847029843149011,
376
  "average_pwed": 0.2072006544586948,
 
416
  {
417
  "task_id": "d38e65ce-75b5-4dbf-8ade-bff6a5803790",
418
  "model": "vitouphy/wav2vec2-xls-r-300m-timit-phoneme",
419
+ "subset": "timit-test",
420
  "num_files": 1680,
421
  "average_per": 0.2561961414705681,
422
  "average_pwed": 0.1378394393452702,
 
462
  {
463
  "task_id": "2839c0c6-8f3b-426e-9eb7-04b6e133dc47",
464
  "model": "ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa-plus-2000",
465
+ "subset": "timit-test",
466
  "num_files": 1680,
467
  "average_per": 0.6479484324708775,
468
  "average_pwed": 0.18710002665151734,
 
508
  {
509
  "task_id": "59afc37a-0072-44dd-a02a-0cf47d89c120",
510
  "model": "ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
511
+ "subset": "timit-test",
512
  "num_files": 1680,
513
  "average_per": 0.6417205190285036,
514
  "average_pwed": 0.19048963968896404,
 
554
  {
555
  "task_id": "5517f6b2-6a76-4a2d-a6ce-33446f390c3b",
556
  "model": "ginic/gender_split_70_female_4_wav2vec2-large-xlsr-buckeye-ipa",
557
+ "subset": "timit-test",
558
  "num_files": 1680,
559
  "average_per": 0.2810165988557621,
560
  "average_pwed": 0.10703377161801164,
 
600
  {
601
  "task_id": "c2139f96-e79e-4f25-a525-aa039f65555f",
602
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.9.2WithoutSpaces",
603
+ "subset": "timit-test",
604
  "num_files": 1680,
605
  "average_per": 0.9537775908999574,
606
  "average_pwed": 0.9351204819224959,
 
646
  {
647
  "task_id": "d146f1f1-6e6e-4b28-9420-c652ae9a1002",
648
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-nl",
649
+ "subset": "timit-test",
650
  "num_files": 1680,
651
  "average_per": 0.9887075544197294,
652
  "average_pwed": 0.9692486915717254,
 
692
  {
693
  "task_id": "265c5859-e7ba-492d-a6c9-45733dc17c99",
694
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-nl",
695
+ "subset": "timit-test",
696
  "num_files": 1680,
697
  "average_per": 0.9887075544197294,
698
  "average_pwed": 0.9692486915717254,
 
738
  {
739
  "task_id": "e297dfde-95e5-462b-a6e5-8fa43bc30bc0",
740
  "model": "speech31/wavlm-large-english-ipa",
741
+ "subset": "timit-test",
742
  "num_files": 1680,
743
  "average_per": 0.3694017596969614,
744
  "average_pwed": 0.1356824900612308,
 
784
  {
785
  "task_id": "efe95f71-05e3-485d-8e0c-1823a3037cf4",
786
  "model": "speech31/wavlm-large-english-ipa",
787
+ "subset": "timit-test",
788
  "num_files": 1680,
789
  "average_per": 0.3694017596969614,
790
  "average_pwed": 0.1356824900612308,
 
830
  {
831
  "task_id": "4b2ae2fc-fe2f-4f8b-9e8f-25c0bae13c0d",
832
  "model": "speech31/XLS-R-300m-english-ipa",
833
+ "subset": "timit-test",
834
  "num_files": 1680,
835
  "average_per": 0.36382554692045954,
836
  "average_pwed": 0.1299702312124616,
 
876
  {
877
  "task_id": "33d387c0-703c-415d-b8e2-81cea87a2146",
878
  "model": "speech31/wav2vec2-large-english-TIMIT-phoneme_v3",
879
+ "subset": "timit-test",
880
  "num_files": 1680,
881
  "average_per": 0.44563344149564776,
882
  "average_pwed": 0.18844914029048124,
 
922
  {
923
  "task_id": "c89bcefc-3884-435a-a54c-24297fe6f041",
924
  "model": "speech31/wav2vec2-large-TIMIT-IPA2",
925
+ "subset": "timit-test",
926
  "num_files": 1680,
927
  "average_per": 0.4847029843149011,
928
  "average_pwed": 0.2072006544586948,
 
968
  {
969
  "task_id": "81fa94f8-94ae-4601-952c-24abaddaf691",
970
  "model": "ginic/vary_individuals_young_only_3_wav2vec2-large-xlsr-buckeye-ipa",
971
+ "subset": "timit-test",
972
  "num_files": 1680,
973
  "average_per": 0.2807914104790719,
974
  "average_pwed": 0.10494355278037441,
queue/tasks.json CHANGED
@@ -2,7 +2,7 @@
2
  {
3
  "id": "721b4c64-a825-42d3-bb0a-bdff9ee1ed0f",
4
  "model": "facebook/wav2vec2-lv-60-espeak-cv-ft",
5
- "subset": "test",
6
  "submission_name": "facebook espeak",
7
  "github_url": "https://github.com/facebookresearch/fairseq/blob/main/examples/wav2vec/README.md",
8
  "status": "completed",
@@ -11,7 +11,7 @@
11
  {
12
  "id": "d6fe0956-b5b4-4105-835e-8dee1872ee4d",
13
  "model": "KoelLabs/xlsr-timit-b0",
14
- "subset": "test",
15
  "submission_name": "english phoneme model",
16
  "github_url": "https://github.com/KoelLabs/",
17
  "status": "completed",
@@ -20,7 +20,7 @@
20
  {
21
  "id": "dbf4642a-fb13-402c-8a74-cc41fc4be599",
22
  "model": "speech31/wav2vec2-large-TIMIT-IPA",
23
- "subset": "test",
24
  "submission_name": "speech 31 model",
25
  "github_url": "https://huggingface.co/speech31/wav2vec2-large-TIMIT-IPA2",
26
  "status": "completed",
@@ -29,7 +29,7 @@
29
  {
30
  "id": "4e3b80be-b255-47f2-b4ae-18a12e232e8a",
31
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5",
32
- "subset": "test",
33
  "submission_name": "Jubliano model",
34
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5WithoutSpaces/tree/d5312009d8e620b183c334dfdd9ffc6b4f06f8c1",
35
  "status": "processing",
@@ -38,7 +38,7 @@
38
  {
39
  "id": "912449a4-d7ed-4af4-b5be-5c2c57ec09ff",
40
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5",
41
- "subset": "test",
42
  "submission_name": "jubiliano model wav2vec2",
43
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5WithoutSpaces/tree/d5312009d8e620b183c334dfdd9ffc6b4f06f8c1",
44
  "status": "completed",
@@ -47,7 +47,7 @@
47
  {
48
  "id": "c79df17e-2bb2-4253-ae26-f7cc6ab21265",
49
  "model": "facebook/wav2vec2-xlsr-53-espeak-cv-ft",
50
- "subset": "test",
51
  "submission_name": "xlsr 53 model",
52
  "github_url": "https://github.com/facebookresearch/fairseq/blob/main/examples/wav2vec/README.md",
53
  "status": "completed",
@@ -56,7 +56,7 @@
56
  {
57
  "id": "f36060e6-a746-44dc-a527-54995b270053",
58
  "model": "ginic/hyperparam_tuning_1_wav2vec2-large-xlsr-buckeye-ipa",
59
- "subset": "test",
60
  "submission_name": "ginic model wav2vec2 finetuned on buckeye",
61
  "github_url": "https://huggingface.co/ginic/vary_individuals_old_only_1_wav2vec2-large-xlsr-buckeye-ipa",
62
  "status": "completed",
@@ -65,7 +65,7 @@
65
  {
66
  "id": "abf6c247-9faf-46ef-b0fa-25f2669da922",
67
  "model": "KoelLabs/xlsr-timit-a0",
68
- "subset": "test",
69
  "submission_name": "Koel Labs early version of finetuned model ",
70
  "github_url": "https://github.com/KoelLabs/ML",
71
  "status": "processing",
@@ -74,7 +74,7 @@
74
  {
75
  "id": "47d56349-8111-4bda-a47f-e007dbedd36d",
76
  "model": "KoelLabs/xlsr-timit-a0",
77
- "subset": "test",
78
  "submission_name": "koel labs initial ",
79
  "github_url": "https://github.com/KoelLabs/ML/",
80
  "status": "completed",
@@ -83,7 +83,7 @@
83
  {
84
  "id": "51dd5735-63bd-4fe5-a588-c0fc079076e0",
85
  "model": "KoelLabs/xlsr-timit-a0",
86
- "subset": "test",
87
  "submission_name": "koel labs initial ",
88
  "github_url": "https://github.com/KoelLabs/ML/",
89
  "status": "completed",
@@ -92,7 +92,7 @@
92
  {
93
  "id": "2e592612-ca38-4afb-a6a0-3c870b288960",
94
  "model": "snu-nia-12/wav2vec2-large_nia12_phone-ipa_english",
95
- "subset": "test",
96
  "submission_name": "wav2vec2 ipa eng ",
97
  "github_url": "",
98
  "status": "completed",
@@ -101,7 +101,7 @@
101
  {
102
  "id": "ac4cbe86-4dbe-4929-8f76-4d2052e0acf1",
103
  "model": "vitouphy/wav2vec2-xls-r-300m-timit-phoneme",
104
- "subset": "test",
105
  "submission_name": "fine-tuned version of facebook/wav2vec2-xls-r-300m on the Timit dataset",
106
  "github_url": "https://www.kaggle.com/code/vitouphy/phoneme-recognition-with-wav2vec2",
107
  "status": "processing",
@@ -110,7 +110,7 @@
110
  {
111
  "id": "d38e65ce-75b5-4dbf-8ade-bff6a5803790",
112
  "model": "vitouphy/wav2vec2-xls-r-300m-timit-phoneme",
113
- "subset": "test",
114
  "submission_name": "fine-tuned version of facebook/wav2vec2-xls-r-300m on the Timit dataset",
115
  "github_url": "https://www.kaggle.com/code/vitouphy/phoneme-recognition-with-wav2vec2",
116
  "status": "completed",
@@ -119,7 +119,7 @@
119
  {
120
  "id": "2839c0c6-8f3b-426e-9eb7-04b6e133dc47",
121
  "model": "ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa-plus-2000",
122
- "subset": "test",
123
  "submission_name": "wav2vec2 model",
124
  "github_url": "https://huggingface.co/ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
125
  "status": "completed",
@@ -128,7 +128,7 @@
128
  {
129
  "id": "59afc37a-0072-44dd-a02a-0cf47d89c120",
130
  "model": "ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
131
- "subset": "test",
132
  "submission_name": "wav2vec2 non-english transcription",
133
  "github_url": "https://huggingface.co/ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
134
  "status": "completed",
@@ -137,7 +137,7 @@
137
  {
138
  "id": "e57eda9d-7a1d-4b41-9d47-a3d3839cac8b",
139
  "model": "ginic/gender_split_70_female_4_wav2vec2-large-xlsr-buckeye-ipa",
140
- "subset": "test",
141
  "submission_name": "phonetic transcription with the Buckeye corpus, from xlsr-53 model ",
142
  "github_url": "https://github.com/ginic/multipa/tree/buckeye_experiments",
143
  "status": "failed",
@@ -147,7 +147,7 @@
147
  {
148
  "id": "5517f6b2-6a76-4a2d-a6ce-33446f390c3b",
149
  "model": "ginic/gender_split_70_female_4_wav2vec2-large-xlsr-buckeye-ipa",
150
- "subset": "test",
151
  "submission_name": "phonetic transcription with the Buckeye corpus, from xlsr-53 model",
152
  "github_url": "https://github.com/ginic/multipa/tree/buckeye_experiments",
153
  "status": "completed",
@@ -156,7 +156,7 @@
156
  {
157
  "id": "c2139f96-e79e-4f25-a525-aa039f65555f",
158
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.9.2WithoutSpaces",
159
- "subset": "test",
160
  "submission_name": "phonetic transcription",
161
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5WithoutSpaces",
162
  "status": "completed",
@@ -165,7 +165,7 @@
165
  {
166
  "id": "d146f1f1-6e6e-4b28-9420-c652ae9a1002",
167
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-nl",
168
- "subset": "test",
169
  "submission_name": "Jubliano xlsr model",
170
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-nl1.1",
171
  "status": "completed",
@@ -174,7 +174,7 @@
174
  {
175
  "id": "265c5859-e7ba-492d-a6c9-45733dc17c99",
176
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-nl",
177
- "subset": "test",
178
  "submission_name": "Jubliano xlsr model",
179
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-nl1.1",
180
  "status": "completed",
@@ -183,7 +183,7 @@
183
  {
184
  "id": "e297dfde-95e5-462b-a6e5-8fa43bc30bc0",
185
  "model": "speech31/wavlm-large-english-ipa",
186
- "subset": "test",
187
  "submission_name": "speech31 phoneme transcription english",
188
  "github_url": "https://huggingface.co/speech31/wavlm-large-english-ipa",
189
  "status": "completed",
@@ -192,7 +192,7 @@
192
  {
193
  "id": "efe95f71-05e3-485d-8e0c-1823a3037cf4",
194
  "model": "speech31/wavlm-large-english-ipa",
195
- "subset": "test",
196
  "submission_name": "speech31 phoneme transcription english",
197
  "github_url": "https://huggingface.co/speech31/wavlm-large-english-ipa",
198
  "status": "completed",
@@ -201,7 +201,7 @@
201
  {
202
  "id": "4b2ae2fc-fe2f-4f8b-9e8f-25c0bae13c0d",
203
  "model": "speech31/XLS-R-300m-english-ipa",
204
- "subset": "test",
205
  "submission_name": "speech31 xlsr model",
206
  "github_url": "https://huggingface.co/speech31/XLS-R-300m-english-ipa",
207
  "status": "completed",
@@ -210,36 +210,16 @@
210
  {
211
  "id": "33d387c0-703c-415d-b8e2-81cea87a2146",
212
  "model": "speech31/wav2vec2-large-english-TIMIT-phoneme_v3",
213
- "subset": "test",
214
  "submission_name": "model is a fine-tuned version of facebook/wav2vec2-large on the TIMIT dataset",
215
  "github_url": "https://huggingface.co/speech31/wav2vec2-large-english-TIMIT-phoneme_v3",
216
  "status": "completed",
217
  "submitted_at": "2024-12-20T16:52:07.883839"
218
  },
219
- {
220
- "id": "03e4e265-dc1c-4052-88bd-4ef481938d9d",
221
- "model": "speech31/wav2vec2-large-TIMIT-IPA2\"",
222
- "subset": "test",
223
- "submission_name": "fine-tuned version of facebook/wav2vec2-large on the None dataset",
224
- "github_url": "https://huggingface.co/speech31/wav2vec2-large-TIMIT-IPA2",
225
- "status": "failed",
226
- "submitted_at": "2024-12-20T21:54:21.539246",
227
- "error": "Evaluation failed: Incorrect path_or_model_id: 'speech31/wav2vec2-large-TIMIT-IPA2\"'. Please provide either the path to a local folder or the repo_id of a model on the Hub."
228
- },
229
- {
230
- "id": "4575245b-ae3c-4969-adef-fd07a58560b9",
231
- "model": "speech31/wav2vec2-large-TIMIT-IPA2\"",
232
- "subset": "test",
233
- "submission_name": "fine-tuned version of facebook/wav2vec2-large on the None dataset",
234
- "github_url": "https://huggingface.co/speech31/wav2vec2-large-TIMIT-IPA2",
235
- "status": "failed",
236
- "submitted_at": "2024-12-20T21:54:25.670276",
237
- "error": "Evaluation failed: Incorrect path_or_model_id: 'speech31/wav2vec2-large-TIMIT-IPA2\"'. Please provide either the path to a local folder or the repo_id of a model on the Hub."
238
- },
239
  {
240
  "id": "c89bcefc-3884-435a-a54c-24297fe6f041",
241
  "model": "speech31/wav2vec2-large-TIMIT-IPA2",
242
- "subset": "test",
243
  "submission_name": "fine-tuned version of facebook/wav2vec2-large on the None dataset",
244
  "github_url": "https://huggingface.co/speech31/wav2vec2-large-TIMIT-IPA2",
245
  "status": "completed",
@@ -248,7 +228,7 @@
248
  {
249
  "id": "81fa94f8-94ae-4601-952c-24abaddaf691",
250
  "model": "ginic/vary_individuals_young_only_3_wav2vec2-large-xlsr-buckeye-ipa",
251
- "subset": "test",
252
  "submission_name": "ginic model, facebook/wav2vec2-large-xlsr-53 fine tuned",
253
  "github_url": "https://huggingface.co/ginic/vary_individuals_young_only_3_wav2vec2-large-xlsr-buckeye-ipa",
254
  "status": "completed",
 
2
  {
3
  "id": "721b4c64-a825-42d3-bb0a-bdff9ee1ed0f",
4
  "model": "facebook/wav2vec2-lv-60-espeak-cv-ft",
5
+ "subset": "timit-test",
6
  "submission_name": "facebook espeak",
7
  "github_url": "https://github.com/facebookresearch/fairseq/blob/main/examples/wav2vec/README.md",
8
  "status": "completed",
 
11
  {
12
  "id": "d6fe0956-b5b4-4105-835e-8dee1872ee4d",
13
  "model": "KoelLabs/xlsr-timit-b0",
14
+ "subset": "timit-test",
15
  "submission_name": "english phoneme model",
16
  "github_url": "https://github.com/KoelLabs/",
17
  "status": "completed",
 
20
  {
21
  "id": "dbf4642a-fb13-402c-8a74-cc41fc4be599",
22
  "model": "speech31/wav2vec2-large-TIMIT-IPA",
23
+ "subset": "timit-test",
24
  "submission_name": "speech 31 model",
25
  "github_url": "https://huggingface.co/speech31/wav2vec2-large-TIMIT-IPA2",
26
  "status": "completed",
 
29
  {
30
  "id": "4e3b80be-b255-47f2-b4ae-18a12e232e8a",
31
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5",
32
+ "subset": "timit-test",
33
  "submission_name": "Jubliano model",
34
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5WithoutSpaces/tree/d5312009d8e620b183c334dfdd9ffc6b4f06f8c1",
35
  "status": "processing",
 
38
  {
39
  "id": "912449a4-d7ed-4af4-b5be-5c2c57ec09ff",
40
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5",
41
+ "subset": "timit-test",
42
  "submission_name": "jubiliano model wav2vec2",
43
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5WithoutSpaces/tree/d5312009d8e620b183c334dfdd9ffc6b4f06f8c1",
44
  "status": "completed",
 
47
  {
48
  "id": "c79df17e-2bb2-4253-ae26-f7cc6ab21265",
49
  "model": "facebook/wav2vec2-xlsr-53-espeak-cv-ft",
50
+ "subset": "timit-test",
51
  "submission_name": "xlsr 53 model",
52
  "github_url": "https://github.com/facebookresearch/fairseq/blob/main/examples/wav2vec/README.md",
53
  "status": "completed",
 
56
  {
57
  "id": "f36060e6-a746-44dc-a527-54995b270053",
58
  "model": "ginic/hyperparam_tuning_1_wav2vec2-large-xlsr-buckeye-ipa",
59
+ "subset": "timit-test",
60
  "submission_name": "ginic model wav2vec2 finetuned on buckeye",
61
  "github_url": "https://huggingface.co/ginic/vary_individuals_old_only_1_wav2vec2-large-xlsr-buckeye-ipa",
62
  "status": "completed",
 
65
  {
66
  "id": "abf6c247-9faf-46ef-b0fa-25f2669da922",
67
  "model": "KoelLabs/xlsr-timit-a0",
68
+ "subset": "timit-test",
69
  "submission_name": "Koel Labs early version of finetuned model ",
70
  "github_url": "https://github.com/KoelLabs/ML",
71
  "status": "processing",
 
74
  {
75
  "id": "47d56349-8111-4bda-a47f-e007dbedd36d",
76
  "model": "KoelLabs/xlsr-timit-a0",
77
+ "subset": "timit-test",
78
  "submission_name": "koel labs initial ",
79
  "github_url": "https://github.com/KoelLabs/ML/",
80
  "status": "completed",
 
83
  {
84
  "id": "51dd5735-63bd-4fe5-a588-c0fc079076e0",
85
  "model": "KoelLabs/xlsr-timit-a0",
86
+ "subset": "timit-test",
87
  "submission_name": "koel labs initial ",
88
  "github_url": "https://github.com/KoelLabs/ML/",
89
  "status": "completed",
 
92
  {
93
  "id": "2e592612-ca38-4afb-a6a0-3c870b288960",
94
  "model": "snu-nia-12/wav2vec2-large_nia12_phone-ipa_english",
95
+ "subset": "timit-test",
96
  "submission_name": "wav2vec2 ipa eng ",
97
  "github_url": "",
98
  "status": "completed",
 
101
  {
102
  "id": "ac4cbe86-4dbe-4929-8f76-4d2052e0acf1",
103
  "model": "vitouphy/wav2vec2-xls-r-300m-timit-phoneme",
104
+ "subset": "timit-test",
105
  "submission_name": "fine-tuned version of facebook/wav2vec2-xls-r-300m on the Timit dataset",
106
  "github_url": "https://www.kaggle.com/code/vitouphy/phoneme-recognition-with-wav2vec2",
107
  "status": "processing",
 
110
  {
111
  "id": "d38e65ce-75b5-4dbf-8ade-bff6a5803790",
112
  "model": "vitouphy/wav2vec2-xls-r-300m-timit-phoneme",
113
+ "subset": "timit-test",
114
  "submission_name": "fine-tuned version of facebook/wav2vec2-xls-r-300m on the Timit dataset",
115
  "github_url": "https://www.kaggle.com/code/vitouphy/phoneme-recognition-with-wav2vec2",
116
  "status": "completed",
 
119
  {
120
  "id": "2839c0c6-8f3b-426e-9eb7-04b6e133dc47",
121
  "model": "ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa-plus-2000",
122
+ "subset": "timit-test",
123
  "submission_name": "wav2vec2 model",
124
  "github_url": "https://huggingface.co/ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
125
  "status": "completed",
 
128
  {
129
  "id": "59afc37a-0072-44dd-a02a-0cf47d89c120",
130
  "model": "ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
131
+ "subset": "timit-test",
132
  "submission_name": "wav2vec2 non-english transcription",
133
  "github_url": "https://huggingface.co/ctaguchi/wav2vec2-large-xlsr-japlmthufielta-ipa1000-ns",
134
  "status": "completed",
 
137
  {
138
  "id": "e57eda9d-7a1d-4b41-9d47-a3d3839cac8b",
139
  "model": "ginic/gender_split_70_female_4_wav2vec2-large-xlsr-buckeye-ipa",
140
+ "subset": "timit-test",
141
  "submission_name": "phonetic transcription with the Buckeye corpus, from xlsr-53 model ",
142
  "github_url": "https://github.com/ginic/multipa/tree/buckeye_experiments",
143
  "status": "failed",
 
147
  {
148
  "id": "5517f6b2-6a76-4a2d-a6ce-33446f390c3b",
149
  "model": "ginic/gender_split_70_female_4_wav2vec2-large-xlsr-buckeye-ipa",
150
+ "subset": "timit-test",
151
  "submission_name": "phonetic transcription with the Buckeye corpus, from xlsr-53 model",
152
  "github_url": "https://github.com/ginic/multipa/tree/buckeye_experiments",
153
  "status": "completed",
 
156
  {
157
  "id": "c2139f96-e79e-4f25-a525-aa039f65555f",
158
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.9.2WithoutSpaces",
159
+ "subset": "timit-test",
160
  "submission_name": "phonetic transcription",
161
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-INTERNATIONAL1.5WithoutSpaces",
162
  "status": "completed",
 
165
  {
166
  "id": "d146f1f1-6e6e-4b28-9420-c652ae9a1002",
167
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-nl",
168
+ "subset": "timit-test",
169
  "submission_name": "Jubliano xlsr model",
170
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-nl1.1",
171
  "status": "completed",
 
174
  {
175
  "id": "265c5859-e7ba-492d-a6c9-45733dc17c99",
176
  "model": "Jubliano/wav2vec2-large-xls-r-300m-ipa-nl",
177
+ "subset": "timit-test",
178
  "submission_name": "Jubliano xlsr model",
179
  "github_url": "https://huggingface.co/Jubliano/wav2vec2-large-xls-r-300m-ipa-nl1.1",
180
  "status": "completed",
 
183
  {
184
  "id": "e297dfde-95e5-462b-a6e5-8fa43bc30bc0",
185
  "model": "speech31/wavlm-large-english-ipa",
186
+ "subset": "timit-test",
187
  "submission_name": "speech31 phoneme transcription english",
188
  "github_url": "https://huggingface.co/speech31/wavlm-large-english-ipa",
189
  "status": "completed",
 
192
  {
193
  "id": "efe95f71-05e3-485d-8e0c-1823a3037cf4",
194
  "model": "speech31/wavlm-large-english-ipa",
195
+ "subset": "timit-test",
196
  "submission_name": "speech31 phoneme transcription english",
197
  "github_url": "https://huggingface.co/speech31/wavlm-large-english-ipa",
198
  "status": "completed",
 
201
  {
202
  "id": "4b2ae2fc-fe2f-4f8b-9e8f-25c0bae13c0d",
203
  "model": "speech31/XLS-R-300m-english-ipa",
204
+ "subset": "timit-test",
205
  "submission_name": "speech31 xlsr model",
206
  "github_url": "https://huggingface.co/speech31/XLS-R-300m-english-ipa",
207
  "status": "completed",
 
210
  {
211
  "id": "33d387c0-703c-415d-b8e2-81cea87a2146",
212
  "model": "speech31/wav2vec2-large-english-TIMIT-phoneme_v3",
213
+ "subset": "timit-test",
214
  "submission_name": "model is a fine-tuned version of facebook/wav2vec2-large on the TIMIT dataset",
215
  "github_url": "https://huggingface.co/speech31/wav2vec2-large-english-TIMIT-phoneme_v3",
216
  "status": "completed",
217
  "submitted_at": "2024-12-20T16:52:07.883839"
218
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
219
  {
220
  "id": "c89bcefc-3884-435a-a54c-24297fe6f041",
221
  "model": "speech31/wav2vec2-large-TIMIT-IPA2",
222
+ "subset": "timit-test",
223
  "submission_name": "fine-tuned version of facebook/wav2vec2-large on the None dataset",
224
  "github_url": "https://huggingface.co/speech31/wav2vec2-large-TIMIT-IPA2",
225
  "status": "completed",
 
228
  {
229
  "id": "81fa94f8-94ae-4601-952c-24abaddaf691",
230
  "model": "ginic/vary_individuals_young_only_3_wav2vec2-large-xlsr-buckeye-ipa",
231
+ "subset": "timit-test",
232
  "submission_name": "ginic model, facebook/wav2vec2-large-xlsr-53 fine tuned",
233
  "github_url": "https://huggingface.co/ginic/vary_individuals_young_only_3_wav2vec2-large-xlsr-buckeye-ipa",
234
  "status": "completed",