CNN Architectures for Human Pose Estimation from a Very Low Resolution Depth Image

Piotr Szczuko

doi:10.1109/hsi.2018.8431338

The paper is dedicated to proposing and evaluating a number of convolutional neural network architectures for calculating a multiple regression on 3D coordinates of human body joints tracked in a single low resolution depth image. The main challenge was to obtain a high precision in case of a noisy and coarse scan of the body, as observed by a depth sensor from a large distance. The regression network was expected to reason about relations of body parts based on depth image, and to extract locations of joints. The method involved creation of a dataset with 200,000 realistic depth images of a 3D body model, then training and testing numerous CNN architectures. The results are included and discussed. The achieved accuracy was similar to a reference Kinect algorithm results, with a great advantage of fast processing speed and significantly lower requirements on sensor resolution, as it used 100 times less pixels than Kinect depth sensor.

Authors

dr hab. inż. Piotr Szczuko link open in new tab

Additional information

DOI: Digital Object Identifier link open in new tab 10.1109/hsi.2018.8431338
Category: Aktywność konferencyjna
Type: publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
Language: angielski
Publication year: 2018

Source: MOSTWiedzy.pl - publication "CNN Architectures for Human Pose Estimation from a Very Low Resolution Depth Image" link open in new tab

link open in new tab

Publications Repository - Gdańsk University of Technology

Treść strony