Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling

Daniel Korzekwa; Jaime Lorenzo-trueba; Szymon Zaporowski; Shira Calamaro; Thomas Drugman; Bożena Kostek

doi:10.1109/icassp39728.2021.9413953

A common approach to the automatic detection of mispronunciation in language learning is to recognize the phonemes produced by a student and compare it to the expected pronunciation of a native speaker. This approach makes two simplifying assumptions: a) phonemes can be recognized from speech with high accuracy, b) there is a single correct way for a sentence to be pronounced. These assumptions do not always hold, which can result in a significant amount of false mispronunciation alarms. We propose a novel approach to overcome this problem based on two principles: a) taking into account uncertainty in the automatic phoneme recognition step, b) accounting for the fact that there may be multiple valid pronunciations. We evaluate the model on non-native (L2) English speech of German, Italian and Polish speakers, where it is shown to increase the precision of detecting mispronunciations by up to 18% (relative) compared to the common approach.

Autorzy

mgr inż. Daniel Korzekwa,
Jaime Lorenzo-trueba,
mgr inż. Szymon Zaporowski link otwiera się w nowej karcie ,
Shira Calamaro,
dr Thomas Drugman,
prof. dr hab. inż. Bożena Kostek link otwiera się w nowej karcie

Informacje dodatkowe

DOI: Cyfrowy identyfikator dokumentu elektronicznego link otwiera się w nowej karcie 10.1109/icassp39728.2021.9413953
Kategoria: Aktywność konferencyjna
Typ: publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
Język: angielski
Rok wydania: 2021

Źródło danych: MOSTWiedzy.pl - publikacja "Mispronunciation Detection in Non-Native (L2) English with Uncertainty Modeling" link otwiera się w nowej karcie

link otwiera się w nowej karcie

Repozytorium publikacji - Politechnika Gdańska

Treść strony