Urban scene semantic segmentation using the U-Net model

Marcin Ciecholewski

doi:10.15439/2023f3686

Vision-based semantic segmentation of complex urban street scenes is a very important function during autonomous driving (AD), which will become an important technology in industrialized countries in the near future. Today, advanced driver assistance systems (ADAS) improve traffic safety thanks to the application of solutions that enable detecting objects, recognising road signs, segmenting the road, etc. The basis for these functionalities is the adoption of various classifiers. This publication presents solutions utilising convolutional neural networks, such as MobileNet and ResNet50, which were used as encoders in the U-Net model to semantically segment images of complex urban scenes taken from the publicly available Cityscapes dataset. Some modifications of the encoder/decoder architecture of the U-Net model were also proposed and the result was named the MU-Net. During tests carried out on 500 images, the MU-Net model produced slightly better segmentation results than the universal MobileNet and ResNet networks, as measured by the Jaccard index, which amounted to 88.85\%. The experiments showed that the MobileNet network had the best ratio of accuracy to the number of parameters used and at the same time was the least sensitive to unusual phenomena occurring in images.

Authors

dr hab. Marcin Ciecholewski link open in new tab

Additional information

DOI: Digital Object Identifier link open in new tab 10.15439/2023f3686
Category: Aktywność konferencyjna
Type: publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
Language: angielski
Publication year: 2023

Source: MOSTWiedzy.pl - publication "Urban scene semantic segmentation using the U-Net model" link open in new tab

link open in new tab

Publications Repository - Gdańsk University of Technology

Treść strony