Repozytorium publikacji - Politechnika Gdańska

Ustawienia strony

english
Repozytorium publikacji
Politechniki Gdańskiej

Treść strony

Automatic labeling of traffic sound recordings using autoencoder-derived features

An approach to detection of events occurring in road traffic using autoencoders is presented. Extensions of existing algorithms of acoustic road events detection employing Mel Frequency Cepstral Coefficients combined with classifiers based on k nearest neighbors, Support Vector Machines, and random forests are used. In our research, the acoustic signal gathered from the microphone placed near the road is split into frames and converted into a 2-dimensional form of Mel-cepstrogram. Next, the sequence of mel-cepstrograms is processed by the autoencoder neural network, which assigns a unique embedding to each of processed mel-cepstrograms. The embeddings may be treated as features which can be fed on the input of other machine learning-based classifiers. In our research, we prepared such an autoencoder and compared it with a standard solution of parameterization consisting of averaging MFCC throughout all the analyzed frames. Both types of features were then treated as an input for selected types classifiers. It was found, that parameters derived by the autoencoder neural network may be useful for improving the performance of classifiers in case of problematic classes such as detection of single and multiple vehicles passes.

Autorzy

Informacje dodatkowe

Kategoria
Aktywność konferencyjna
Typ
publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
Język
angielski
Rok wydania
2019

Źródło danych: MOSTWiedzy.pl - publikacja "Automatic labeling of traffic sound recordings using autoencoder-derived features" link otwiera się w nowej karcie

Portal MOST Wiedzy link otwiera się w nowej karcie