An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics

Grazina Korvel; Olga Kurasova; Bożena Kostek

doi:10.5220/0007854302800289

The speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing the speech signal into harmonics and modeling them as the output of a SISO system whose transfer function poles are multiple, and inputs vary in time. An analysis of the Lombard effect of the synthesized signal is performed on the noise residual. The synthesized signal residual is described by vectors of acoustic parameters related to the Lombard effect. For testing the performance of the created models in various noise conditions two classifiers are employed, namely kNN and Naive Bayes. For comparison of results, we created models of sinusoids based on frequency tracks. The results show that a model based on the residual sinewave sum demonstrates the possibility of retaining the Lombard effect. Finally, future work directions are outlined in conclusions.

Authors

Grazina Korvel,
dr Olga Kurasova,
prof. dr hab. inż. Bożena Kostek link open in new tab

Download

Additional information

DOI: Digital Object Identifier link open in new tab 10.5220/0007854302800289
Category: Aktywność konferencyjna
Type: publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
Language: angielski
Publication year: 2019

Source: MOSTWiedzy.pl - publication "An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics" link open in new tab

link open in new tab

Publications Repository - Gdańsk University of Technology

Treść strony