Investigating Feature Spaces for Isolated Word Recognition

Grazina Korvel; Gintautas Tamulevicus; Povilas Treigys; Jolita Bernataviciene; Bożena Kostek

doi:10.15388/damss.2018.1

Much attention is given by researchers to the speech processing task in automatic speech recognition (ASR) over the past decades. The study addresses the issue related to the investigation of the appropriateness of a two-dimensional representation of speech feature spaces for speech recognition tasks based on deep learning techniques. The approach combines Convolutional Neural Networks (CNNs) and timefrequency signal representation converted to the investigative feature spaces. In particular, fractal dimension features of the signal were chosen for the time domain, and two feature spaces were investigated for the frequency domain, namely: frequency tracks obtained from the frequencies and amplitudes of the detected spectral peaks and the modified chromagrams. Both are constructed from a series of short-time Fourier transforms, which were computed along the window speech signal in the time domain. Due to the fact that deep learning requires a sufficiently large training set as the size of the corpus may significantly influence the outcome, thus for the data augmentation purpose, the created dataset was extended by adding various noise levels and mixed with the speech signal. In order to evaluate the applicability of implemented feature spaces for isolated word recognition task, three experiments were conducted: a 10-word, a 70-word, and a 111-word cases were analyzed.

Autorzy

dr Grazina Korvel link otwiera się w nowej karcie ,
Gintautas Tamulevicus,
Povilas Treigys,
Jolita Bernataviciene,
prof. dr hab. inż. Bożena Kostek link otwiera się w nowej karcie

Informacje dodatkowe

DOI: Cyfrowy identyfikator dokumentu elektronicznego link otwiera się w nowej karcie 10.15388/damss.2018.1
Kategoria: Aktywność konferencyjna
Typ: publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
Język: angielski
Rok wydania: 2018

Źródło danych: MOSTWiedzy.pl - publikacja "Investigating Feature Spaces for Isolated Word Recognition" link otwiera się w nowej karcie

link otwiera się w nowej karcie

Repozytorium publikacji - Politechnika Gdańska

Treść strony