Implementation of the background subtraction algorithm on parallel GPUs is presented. The algorithm processes video streams and extracts foreground pixels. The work focuses on optimizing parallel algorithm implementation by taking into account specific features of the GPU architecture, such as memory access, data transfers and work group organization. The algorithm is implemented in both OpenCL and CUDA. Various optimizations of the algorithm are presented and tested using devices with varying processing power, including desktop PC graphic cards, ultrabooks and Tegra mobile processor. The main aim of the work is to determine if the optimized algorithm run on currently available GPUs is able to perform on-line processing of high resolution video streams.
Authors
Additional information
- DOI
- Digital Object Identifier link open in new tab 10.15199/13.2015.4.4
- Category
- Publikacja w czasopiśmie
- Type
- artykuły w czasopismach recenzowanych i innych wydawnictwach ciągłych
- Language
- angielski
- Publication year
- 2015