DECM: Evaluating Bilingual ASR Performance on a Code-switching/mixing Benchmark

Enes Yavuz Ugan; Ngoc-Quan Pham; Alex Waibel

DECM: Evaluating Bilingual ASR Performance on a Code-switching/mixing Benchmark

Enes Yavuz Ugan, Ngoc-Quan Pham, Alexander Waibel

Abstract

Automatic Speech Recognition has made significant progress, but challenges persist. Code-switched (CSW) Speech presents one such challenge, involving the mixing of multiple languages by a speaker. Even when multilingual ASR models are trained, each utterance on its own usually remains monolingual. We introduce an evaluation dataset for German-English CSW, with German as the matrix language and English as the embedded language. The dataset comprises spontaneous speech from diverse domains, enabling realistic CSW evaluation in German-English. It includes splits with varying degrees of CSW to facilitate specialized model analysis. As it is difficult to collect CSW data for all language pairs, the provision of such evaluation data, is crucial for developing and analyzing ASR models capable of generalizing across unseen pairs. Detailed data statistics are presented, and state-of-the-art (SOTA) multilingual models are evaluated showing challanges of CSW speech.

Anthology ID:: 2024.lrec-main.400
Volume:: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Month:: May
Year:: 2024
Address:: Torino, Italia
Editors:: Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue
Venues:: LREC | COLING
SIG:
Publisher:: ELRA and ICCL
Note:
Pages:: 4468–4475
Language:
URL:: https://aclanthology.org/2024.lrec-main.400
DOI:
Bibkey:
Cite (ACL):: Enes Yavuz Ugan, Ngoc-Quan Pham, and Alexander Waibel. 2024. DECM: Evaluating Bilingual ASR Performance on a Code-switching/mixing Benchmark. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 4468–4475, Torino, Italia. ELRA and ICCL.
Cite (Informal):: DECM: Evaluating Bilingual ASR Performance on a Code-switching/mixing Benchmark (Ugan et al., LREC-COLING 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.lrec-main.400.pdf

PDF Cite Search