Hand posture recognition robust to rotation using temporal correlation between adjacent frames

인접 프레임의 시간적 상관 관계를 이용한 회전에 강인한 손 모양 인식

  • Received : 2010.02.23
  • Accepted : 2010.10.06
  • Published : 2010.11.30

Abstract

Recently, there is an increasing need for developing the technique of Hand Gesture Recognition (HGR), for vision based interface. Since hand gesture is defined as consecutive change of hand posture, developing the algorithm of Hand Posture Recognition (HPR) is required. Among the factors that decrease the performance of HPR, we focus on rotation factor. To achieve rotation invariant HPR, we propose a method that uses the property of video that adjacent frames in video have high correlation, considering the environment of HGR. The proposed method introduces template update of object tracking using the above mentioned property, which is different from previous works based on still images. To compare our proposed method with previous methods such as template matching, PCA and LBP, we performed experiments with video that has hand rotation. The accuracy rate of the proposed method is 22.7%, 14.5%, 10.7% and 4.3% higher than ordinary template matching, template matching using KL-Transform, PCA and LBP, respectively.

최근 시각 기반 인터페이스의 실현을 위해 손 동작 인식 기술 개발의 필요성이 증가하고 있다. 이러한 시각 기반 인터페이스의 입력으로 사용되는 손 동작은 손 모양의 연속적인 변화로 정의되므로, 효율적인 손 모양 인식 알고리즘의 개발은 필수적이다. 본 논문에서는 손 모양 인식 과정 중 빈번히 발생할 수 있는 손의 회전에 의한 인식 성능 저하를 다룬다. 제안하는 방법은 회전에 강인한 손 모양 인식 알고리즘 개발을 위해 손 동작 인식 환경을 고려하여 비디오 내 인접 프레임간의 높은 상관관계를 이용한다. 특히, 정지 영상에 기반한 기존 연구와의 차별 점은 객체 추적에서 사용되는 템플릿 갱신을 손 모양 인식에 도입하였다는 것이다. 제안한 방법의 유효함을 보이기 위해, 손이 좌우로 회전하는 비디오를 입력으로 템플릿 정합 기반의 방법, PCA와 LBP을 제안하는 방법과 비교 실험하였다. 제안한 방법은 일반적인 템플릿 정합 기반의 손 모양 인식보다 22.7%, KL-Transform을 도입한 템플릿 정합보다 14.5%, PCA 보다 10.7%, LBP 보다 4.3%의 성능 개선을 보였다.

Keywords

References

  1. Richard Watson, "A Survey of Gesture Recognition Techniques," Technical Report, TCD-CS-1993-11, pp.1-31, 1993
  2. Pragati Garg, Naveen Aggarwal and Sanjeev Sofat, "Vision Based Hand Gesture Recognition," Proceedings of World Academy of Science, Engineering and Technology, vol. 49, pp 972-977, Jan., 2009
  3. Nuwan Gamage, Kuang Ye Chow and Rini Akmeliawati, "Static Hand Sign Recognition using Linear Projection Methods," Proceedings of the 4th International Conference on Autonomous Robots and Agents, pp.403-407, Feb., 2009
  4. Xiujuan Chai, Yikai Fang and Kongqiao Wang, "ROBUST HAND GESTURE ANALYSIS AND APPLICATION IN GALLERY BROWSING," IEEE International Coference on Multimedia & Expo, pp.938-941, Jun., 2009
  5. Qing Chen, Nicolas D. Georganas, Fellow, IEEE, and Emil M. Petriu, Fellow, IEEE, "Hand Gesture Recognition Using Haar-Like Features and a Stochastic Context-Free Grammar," IEEE Transactions On Instrumentation and Measurement, Vol. 57, No. 8, pp. 1562-1571, Aug., 2008 https://doi.org/10.1109/TIM.2008.922070
  6. Elena Sanchez-Nielsen, Luis Anton-Canalis and Mario Hernandez- Tejera, "Hand Getsure recognition for Human Machine Interaction," Journal of WSCG, Vol. 12, No. 1-3, pp. 395-402, Feb., 2004
  7. Erdem Yoruk, Ender Konukoglu, Bulent Sankur, Senior Member, IEEE, and Jerome Darbon, "Shape-Based Hand Recognition," IEEE Transactions On Image Processing, Vol. 15, No. 7, pp. 1803-1815, Jul., 2006 https://doi.org/10.1109/TIP.2006.873439
  8. Qutaishat Munib, Moussa Habeeb, Bayan Takruri and Hiba Abed Al-Malik, "American sign language (ASL) recognition based on Hough transform and neural networks," Expert Systems with Applications, Vol. 32, No.1, pp.24-37, Jan. 2007 https://doi.org/10.1016/j.eswa.2005.11.018
  9. Lilly Spirkovska and Max B. Reid, "Robust Position, Scale, and Rotation Invariant Object Recognition Using Higher-Order Neural Networks," Pattern Recognition, Vol. 25, No. 9, pp. 975-985, Sept., 1992 https://doi.org/10.1016/0031-3203(92)90062-N
  10. Iain Matthews, Takahiro Ishikawa, and Simon Baker, "The Template Update Problem," IEEE Transactions On Pattern Analysis and Machine Intelligence, Vol. 26, No. 6, pp. 810-815, Jun., 2004 https://doi.org/10.1109/TPAMI.2004.16
  11. Tao Wu, Xiaoqing Ding, and Shengjin Wang, "Video Tracking using Improved Chamfer Matching and Particle Filter," IEEE International Conference Computational Intelligence and Multimedia Applications, pp.l69-173, Dec., 2007
  12. F. Ullah and S. Kaneko, "Using orientation codes for rotation-invariant template matching," Pattern Recognition, Vol. 37, No. 2, pp.201-209, Feb., 2004 https://doi.org/10.1016/S0031-3203(03)00184-5
  13. Gunilla Borgefors, "Hierachical Chamfer Matching: A Parametric Edge Matching Algorithm," IEEE Transactions On Pattern Analysis and Machine Intelligence, Vol. 10, No. 6, pp. 849-865, Nov., 1988 https://doi.org/10.1109/34.9107
  14. Gianluigi Ciocca, Clauclio Cusano, Francesca Gasparini, and Raimondo Schettini, "Self-Adaptive Image Cropping for Small Displays," IEEE Transactions On Consumer Electronics, Vol. 53, No. 4, pp. 1622-1627, Nov., 2007 https://doi.org/10.1109/TCE.2007.4429261
  15. Luigi Di Stefano and Andrea Bulgarelli, "A Simple and Efficient Connected Components Labeling Algorithm," IEEE Image Analysis and Processing, pp. 322-327, Sept. 1999
  16. B. D. Lucas and T. Kanade, "An iterative image registration technique with an application to stereo vision," Proc. DARPA Image Understanding Workshop, pp.121-130, 1981
  17. Chao Hu, Max Qinghu Meng, Peter Xiaopig Liu and X. Wang, "Visual Gesture Recognition for Human-Machine Interface of Robot Teleoperation," IEEE International Conference on Intelligent Robots and Systems, pp. 1560-1565, Oct., 2003