Vehicle Type Recognition Based on Audio Data

Dariusz Kobiela; Michał Hajdasz; Mateusz Erezman; Karolina Nurzyńska; Szymon Zaporowski; Adam Kurowski; Paweł Weichbroth

doi:10.24251/hicss.2025.144

Identifying different vehicle types can help manage traffic more efficiently, reduce congestion, and improve public safety. This study aims to create a classification model that can recognize vehicle types based on the sound of passing vehicles. To achieve this, a database of raw audio files containing 1763 samples from two sources was assembled. The time-domain signals were converted to a time-frequency representation using the short-time Fourier transform to generate Mel Spectrograms. Mel-frequency Cepstral Coefficients (MFCCs) were also generated using the discrete cosine transform. In our experiments we compared these approaches. Since the data was imbalanced we applied online augmentation. Based on the literature review, we chose a Convolutional Neural Network (CNN) classifier because it is particularly well suited for analyzing large datasets due to its automatic feature extraction, parameter sharing and sparsity. The results showed that Mel Spectrograms were more effective for audio data preprocessing in this particular use case, achieving the highest accuracy of 0.875 and the highest f1-score of 0.877 compared to MFCCs.

Authors

Download

Additional information

DOI: Digital Object Identifier link open in new tab 10.24251/hicss.2025.144
Category: Aktywność konferencyjna
Type: publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
Language: angielski
Publication year: 2025

Source: MOSTWiedzy.pl - publication "Vehicle Type Recognition Based on Audio Data" link open in new tab

link open in new tab

Publications Repository - Gdańsk University of Technology

Treść strony