Home /Research /A fast feature extraction in object recognition using parallel processing on CPU and GPU
PERCEPTION

A fast feature extraction in object recognition using parallel processing on CPU and GPU

Junchul Kim, Eunsoo Park, Xuenan Cui, Hakil Kim, W.A. Gruver

Year
2009
Citations
32

Abstract

Due to the advents of multi-core CPU and GPU, various parallel processing techniques have been widely applied to many application fields including computer vision. This paper presents a parallel processing technique for realtime feature extraction in object recognition by autonomous mobile robots, which utilizes both CPU and GPU by combining OpenMP, SSE (Streaming SIMD Extension) and CUDA programming. Firstly, the algorithms and codes for feature extraction are optimized and implemented in parallel processing. After the parallel algorithms are assured to maintain the same level of performance, the process for extracting key points and obtaining dominant orientation with respect to the key points is parallelized. Following the extraction is the construction of a parallel descriptor via SSE instructions. Finally, the GPU version of SIFT is also implemented using CUDA. The experiments have shown that the CPU version of SIFT is almost five times faster than the original SIFT while maintaining robust performance. Further, the GPU-Parallel descriptor achieves acceleration up to five times higher than the CPU-Parallel descriptor at a cost of a bit lower performance.

Keywords

Computer scienceCUDASIMDParallel computingScale-invariant feature transformFeature extractionParallel processingCoprocessorCentral processing unitGeneral-purpose computing on graphics processing units

Related papers

Browse all PERCEPTION papers