Repozytorium publikacji - Politechnika Gdańska

Ustawienia strony

english
Repozytorium publikacji
Politechniki Gdańskiej

Treść strony

Vehicle Type Recognition Based on Audio Data

Identifying different vehicle types can help manage traffic more efficiently, reduce congestion, and improve public safety. This study aims to create a classification model that can recognize vehicle types based on the sound of passing vehicles. To achieve this, a database of raw audio files containing 1763 samples from two sources was assembled. The time-domain signals were converted to a time-frequency representation using the short-time Fourier transform to generate Mel Spectrograms. Mel-frequency Cepstral Coefficients (MFCCs) were also generated using the discrete cosine transform. In our experiments we compared these approaches. Since the data was imbalanced we applied online augmentation. Based on the literature review, we chose a Convolutional Neural Network (CNN) classifier because it is particularly well suited for analyzing large datasets due to its automatic feature extraction, parameter sharing and sparsity. The results showed that Mel Spectrograms were more effective for audio data preprocessing in this particular use case, achieving the highest accuracy of 0.875 and the highest f1-score of 0.877 compared to MFCCs.

Autorzy

Informacje dodatkowe

DOI
Cyfrowy identyfikator dokumentu elektronicznego link otwiera się w nowej karcie 10.24251/hicss.2025.144
Kategoria
Aktywność konferencyjna
Typ
publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
Język
angielski
Rok wydania
2025

Źródło danych: MOSTWiedzy.pl - publikacja "Vehicle Type Recognition Based on Audio Data" link otwiera się w nowej karcie

Portal MOST Wiedzy link otwiera się w nowej karcie