Vehicle Type Recognition Based on Audio Data

Dariusz Kobiela; Michał Hajdasz; Mateusz Erezman; Karolina Nurzyńska; Szymon Zaporowski; Adam Kurowski; Paweł Weichbroth

doi:10.24251/hicss.2025.144

Identifying different vehicle types can help manage traffic more efficiently, reduce congestion, and improve public safety. This study aims to create a classification model that can recognize vehicle types based on the sound of passing vehicles. To achieve this, a database of raw audio files containing 1763 samples from two sources was assembled. The time-domain signals were converted to a time-frequency representation using the short-time Fourier transform to generate Mel Spectrograms. Mel-frequency Cepstral Coefficients (MFCCs) were also generated using the discrete cosine transform. In our experiments we compared these approaches. Since the data was imbalanced we applied online augmentation. Based on the literature review, we chose a Convolutional Neural Network (CNN) classifier because it is particularly well suited for analyzing large datasets due to its automatic feature extraction, parameter sharing and sparsity. The results showed that Mel Spectrograms were more effective for audio data preprocessing in this particular use case, achieving the highest accuracy of 0.875 and the highest f1-score of 0.877 compared to MFCCs.

Autorzy

Pobierz publikację

Informacje dodatkowe

DOI: Cyfrowy identyfikator dokumentu elektronicznego link otwiera się w nowej karcie 10.24251/hicss.2025.144
Kategoria: Aktywność konferencyjna
Typ: publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
Język: angielski
Rok wydania: 2025

Źródło danych: MOSTWiedzy.pl - publikacja "Vehicle Type Recognition Based on Audio Data" link otwiera się w nowej karcie

link otwiera się w nowej karcie

Repozytorium publikacji - Politechnika Gdańska

Treść strony