SungBeom commited on
Commit
24a9843
ยท
1 Parent(s): 197080a

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -0
README.md ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - ko
5
+ library_name: nemo
6
+ pipeline_tag: automatic-speech-recognition
7
+ tags:
8
+ - conformer-ctc
9
+ metrics:
10
+ - wer
11
+ ---
12
+ # Conformer-ctc-medium-ko
13
+ ํ•ด๋‹น ๋ชจ๋ธ์€ [RIVA Conformer ASR Korean](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/tao/models/speechtotext_ko_kr_conformer)์„ AI hub dataset์— ๋Œ€ํ•ด ํŒŒ์ธํŠœ๋‹์„ ์ง„ํ–‰ํ–ˆ์Šต๋‹ˆ๋‹ค. <br>
14
+ Conformer ๊ธฐ๋ฐ˜์˜ ๋ชจ๋ธ์€ whisper์™€ ๊ฐ™์€ attention ๊ธฐ๋ฐ˜ ๋ชจ๋ธ๊ณผ ๋‹ฌ๋ฆฌ streaming์„ ์ง„ํ–‰ํ•˜์—ฌ๋„ ์„ฑ๋Šฅ์ด ํฌ๊ฒŒ ๋–จ์–ด์ง€์ง€ ์•Š๊ณ , ์†๋„๊ฐ€ ๋น ๋ฅด๋‹ค๋Š” ์žฅ์ ์ด ์žˆ์Šต๋‹ˆ๋‹ค.
15
+
16
+
17
+ ### dataset
18
+
19
+ | ๋ฐ์ดํ„ฐ์…‹ ์ด๋ฆ„ | ๋ฐ์ดํ„ฐ ์ƒ˜ํ”Œ ์ˆ˜(train/test) |
20
+ | --- | --- |
21
+ | ๊ณ ๊ฐ์‘๋Œ€์Œ์„ฑ | 2067668/21092 |
22
+ | ํ•œ๊ตญ์–ด ์Œ์„ฑ | 620000/3000 |
23
+ | ํ•œ๊ตญ์ธ ๋Œ€ํ™” ์Œ์„ฑ | 2483570/142399 |
24
+ | ์ž์œ ๋Œ€ํ™”์Œ์„ฑ(์ผ๋ฐ˜๋‚จ๋…€) | 1886882/263371 |
25
+ | ๋ณต์ง€ ๋ถ„์•ผ ์ฝœ์„ผํ„ฐ ์ƒ๋‹ด๋ฐ์ดํ„ฐ | 1096704/206470 |
26
+ | ์ฐจ๋Ÿ‰๋‚ด ๋Œ€ํ™” ๋ฐ์ดํ„ฐ | 2624132/332787 |
27
+ | ๋ช…๋ น์–ด ์Œ์„ฑ(๋…ธ์ธ๋‚จ์—ฌ) | 137467/237469 |
28
+ | ์ „์ฒด | 10916423(13946์‹œ๊ฐ„)/1206588(1474์‹œ๊ฐ„) |
29
+
30
+
31
+ ## Training procedure
32
+
33
+ ### Training hyperparameters
34
+
35
+ The following hyperparameters were used during training:
36
+ - learning_rate: 1e-05
37
+ - train_batch_size: 16
38
+ - eval_batch_size: 16
39
+ - num_train_epoch: 2
40
+ - sample_rate: 16000
41
+ - max_duration: 20.0
42
+
43
+ ### Training results
44
+
45
+ | Training Loss | Epoch | Wer |
46
+ |:-------------:|:-----:|:-------:|
47
+ | 9.09 | 1.0 | 11.51 |