Support datasets:
- MDCC
- AISHELL-1
- THCHS-30
- MAGICDATA Mandarin Chinese Read Speech Corpus
sh mdcc.sh
Cantonse-ASR: Yu, Tiezheng, Frieske, Rita, Xu, Peng, Cahyawijaya, Samuel, Yiu, Cheuk Tung, Lovenia, Holy, Dai, Wenliang, Barezi, Elham, Chen, Qifeng, Ma, Xiaojuan, Shi, Bertram, Fung, Pascale (2022) "Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset", 2022. Link: https://arxiv.org/pdf/2201.02419.pdf
sh aishell_1.sh
sh thchs_30.sh
sh magicdata_mcrsc.sh