INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH

Grazina Korvel; Povilas Treigys; Krzysztof Kąkol; Bożena Kostek

doi:10.34768/amcs-2023-0035

The Lombard effect is an involuntary increase in the speaker’s pitch, intensity, and duration in the presence of noise. It makes it possible to communicate in noisy environments more effectively. This study aims to investigate an efficient method for detecting the Lombard effect in uttered speech. The influence of interfering noise, room type, and the gender of the person on the detection process is examined. First, acoustic parameters related to speech changes produced by the Lombard effect are extracted. Mid-term statistics are built upon the parameters and used for the self-similarity matrix construction. They constitute input data for a convolutional neural network (CNN). The self-similarity-based approach is then compared with two other methods, i.e., spectrograms used as input to the CNN and speech acoustic parameters combined with the k-nearest neighbors algorithm. The experimental investigations show the superiority of the self-similarity approach applied to Lombard effect detection over the other two methods utilized. Moreover, small standard deviation values for the self-similarity approach prove the resulting high accuracies.

Autorzy

Grazina Korvel,
dr Povilas Treigys,
mgr inż. Krzysztof Kąkol,
prof. dr hab. inż. Bożena Kostek link otwiera się w nowej karcie

Pobierz publikację

Informacje dodatkowe

DOI: Cyfrowy identyfikator dokumentu elektronicznego link otwiera się w nowej karcie 10.34768/amcs-2023-0035
Kategoria: Publikacja w czasopiśmie
Typ: artykuły w czasopismach
Język: angielski
Rok wydania: 2023

Źródło danych: MOSTWiedzy.pl - publikacja "INVESTIGATION OF THE LOMBARD EFFECT BASED ON A MACHINE LEARNING APPROACH" link otwiera się w nowej karcie

link otwiera się w nowej karcie

Repozytorium publikacji - Politechnika Gdańska

Treść strony