DOI QR코드

DOI QR Code

An Improved Method for Detecting Caption in image using DCT-coefficient and Transition-map Analysis

DCT계수와 천이지도 분석을 이용한 개선된 영상 내 자막영역 검출방법

  • Received : 2011.01.01
  • Accepted : 2011.01.20
  • Published : 2011.04.30

Abstract

In this paper, we proposed the method for detecting text region on image using DCT-coefficient and transition-map analysis. The detecting rate of traditional method for detecting text region using DCT-coefficient analysis is high, but false positive detecting rate also is high and the method using transition-map often reject true text region in step of verification because of sticky threshold. To overcome these problems, we generated PTRmap(Promising Text Region map) through DCT-coefficient analysis and applied PTRmap to method for detecting text region using transition map. As the result, the false positive detecting rate decreased as compared with the method using DCT-coefficient analysis, and the detecting rate increased as compared with the method using transition map.

본 논문은 DCT계수와 천이지도 분석을 이용하여 영상 내 자막영역을 검출하는 방법에 대해 제안한다. 기존 DCT계수 분석방법을 이용한 문자영역탐지 방법은 검출률은 높으나 오검출률이 매우 높은 단점이 있고, 천이지도를 이용한문자영역 탐지 방법은 임계값이 정적이기때문에 문자영역 검증단계에서 실제문자영역이 기각되는 일이 빈번히 발생한다. 이러한 문제점을 해결하기 위해 DCT계수 분석방법을 이용하여 유망문자영역맵을 작성하고 이를 천이지도를 이용한 문자영역탐지 방법에 적용하여 임계값을 단계별로 정한다. 그 결과로서 DCT계수 분석을 이용한 문자영역검출방법에 비해 오검출률이 크게 감소하였으며, 기존 천이지도를 이용한 문자영역검출 방법보다 검출률이 크게 향상되었다.

Keywords

References

  1. Palaiahnakote Shivakumara. Trung Quy Phan. Chew Lim Tan.,"New Wavelet and Color Features for Text Detection in Video."., 20th International Conference on Pattern Recognition., 2010., vol.20 no.6 pp.3996-3999.
  2. Xueming Qian. Guizhong Liu. Huan Wang. Rui Su.,"Text detection, localization, and tracking in compressed video."., Signal processing, Image communication., 2007., vol.22 no.9 pp.752-768 https://doi.org/10.1016/j.image.2007.06.005
  3. Yu Zhong. HongJiang Zhang. Anil.Jain.,"Automatic Caption Localization in Compressed Video."., IEEE transactions on pattern analysis and machine intelligence, 2000., vol.2 no.11 pp.385-392
  4. Rainer Lienhart. Axel Wernicke.,"Localizing and segmenting text in images, videos and web pages,"., IEEE transactions on circuits and systems for video technology., 2002., vol.12, no.4, pp.256-268 https://doi.org/10.1109/76.999203
  5. Michael R.Lyu. Jiqiang Song. Min Cai., "A comprehensive method for multilingual video text detection, localization, and extraction", IEEE transactions on circuits and systems for video technology, a publication of the Circuits and Systems Society 2005., vol.15 no.2 pp.243-255 https://doi.org/10.1109/TCSVT.2004.841653
  6. Wonjun Kim. Changick Kim.,"A new approach for overlay text detection from complex video scene."., Journal of Broadcast Engineering., 2008., vol.13 no.43 pp.544-553 https://doi.org/10.5909/JBE.2008.13.4.544
  7. Wonjun Kim. Changick Kim.,"A New Approach for Overlay Text Detection and Extraction From Complex Video Scene."., IEEE transactions on image processing, a publication of the IEEE Signal Processing 2009., vol.18 no.2 pp.401-411 https://doi.org/10.1109/TIP.2008.2008225
  8. Anil K. Jain. and Bin Yu., "Automatic text location in images and video frames."., Pattern Recognition, vol.31 no,12 pp.2055-2076, 1998 https://doi.org/10.1016/S0031-3203(98)00067-3
  9. Rainer Lienhart, Frank Stuber., "Automatic text recognition in digital videos."., SPIE Internat. Soc. Opt. Eng., 1996, pp.180-188
  10. Rafael C. Gonzalez. Richard E. Woods., "Digital Image Processing, 3rd Edition", PEARSON, 2008
  11. Ah-hyun Cho. Hye-hyun Lee. Jae-uk Ryu. Kwang-Baek Kim., "The Extraction of Character from an English Name Card by Using Smearing Method and Contour Tracking Algorithm."., Proceedings of the Korea Inteligent Information System Society Conference 2002 May., 2002. pp.410-413

Cited by

  1. 질감과 깊이 특징 기반의 문자영역 추출 vol.14, pp.2, 2011, https://doi.org/10.5762/kais.2013.14.2.885