Trends on Object Detection Techniques Based on Deep Learning

Lee, J.S.;Lee, S.K.;Kim, D.W.;Hong, S.J.;Yang, S.I.;

doi:10.22648/ETRI.2018.J.330403

Electronics and Telecommunications Trends (전자통신동향분석)

Volume 33 Issue 4
/
Pages.23-32
/
2018
/
1225-6455(pISSN)

Electronics and Telecommunications Research Institute (한국전자통신연구원)

DOI QR Code

Trends on Object Detection Techniques Based on Deep Learning

딥러닝 기반 객체 인식 기술 동향

이진수 (지식이러닝연구그룹/UST) ;
이상광 (지식이러닝연구그룹) ;
김대욱 (지식이러닝연구그룹) ;
홍승진 (홍익대학교 게임학부) ;
양성일 (지식이러닝연구그룹)

Published : 2018.08.01

https://doi.org/10.22648/ETRI.2018.J.330403 Citation PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Object detection is a challenging field in the visual understanding research area, detecting objects in visual scenes, and the location of such objects. It has recently been applied in various fields such as autonomous driving, image surveillance, and face recognition. In traditional methods of object detection, handcrafted features have been designed for overcoming various visual environments; however, they have a trade-off issue between accuracy and computational efficiency. Deep learning is a revolutionary paradigm in the machine-learning field. In addition, because deep-learning-based methods, particularly convolutional neural networks (CNNs), have outperformed conventional methods in terms of object detection, they have been studied in recent years. In this article, we provide a brief descriptive summary of several recent deep-learning methods for object detection and deep learning architectures. We also compare the performance of these methods and present a research guide of the object detection field.

Keywords

Acknowledgement

Grant : 디지털콘텐츠 In-House R&D

Supported by : 정보통신기술진흥센터

References

M. Everingham et al., "The Pascal Visual Object Classes (VOC) Challenge," Int. J. Comput. Vision, vol. 88, no. 2, June 2010, pp. 303-338. https://doi.org/10.1007/s11263-009-0275-4
O. Russakovsky et al., "ImageNet Large Scale Visual Recognition Challenge," Int. J. Comput. Vision, vol. 115, no. 3, Dec. 2015, pp. 211-252. https://doi.org/10.1007/s11263-015-0816-y
T. Lin et al., "Microsoft COCO: Common Objects in Context," Eur. Conf. Comput. Vision(ECCV), Amsterdam, Netherlands, Oct. 8-16, 2014, pp. 740-755.
D.G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," Int. J. Comput. Vision, vol. 60, no. 2, 2004, pp. 91-110. https://doi.org/10.1023/B:VISI.0000029664.99615.94
H. Bay et al., "Speeded-Up Robust Features (SURF)," Comput. Vision Image Understanding, vol. 110, no. 3, 2008, pp. 346-359. https://doi.org/10.1016/j.cviu.2007.09.014
P. Viola and M. Jones, "Rapid Object Detection using a Boosted Cascade of Simple Features," Proc. IEEE Conput. Soc. Conf. Comput. Vision Pattern Recogn., Kauai, HI, USA, Dec. 8-14, 2001, pp. I:511-I:518
N. Dalal and B. Triggs, "Histograms of Oriented Gradients for Human Detection," IEEE Comput. Soc. Conf. Comput. Vision Pattern Recogn., San Diego, CA, USA, June 20-25, 2015, pp. 886-893.
P.F. Felzenszwalb et al., "Object Detection with Discriminatively Trained Part-Based Models," IEEE Trans. Pattern Anal. Mach. Intell., vol. 32, no. 9, 2010, pp. 1627-1645.
A. Krizhevsky et al, "ImageNet Classification with Deep Convolutional Neural Networks," Conf. Neural Inform. Process. Syst., Lake Tahoe, NV, USA, Dec. 3-6, 2012, pp. 1097-1105.
Y. Lecun et al., "Gradient-Based Learning Applied to Document Recognition," Proc. IEEE, vol. 86, no. 11, Nov. 1998, pp. 2278-2324. https://doi.org/10.1109/5.726791
M.D. Zeiler and R. Fergus, "Visualizing and Understanding Convolutional Networks," Eur. Conf. Comput. Vision(ECCV), Amsterdam, Netherlands, Oct. 8-16, 2014, pp. 818-833.
K. Simonyan et al., "Very Deep Convolutional Networks for Large-Scale Image Recognition," Int. Conf. Learning Representations, San Diego, USA, May 7-9, 2015.
K. He et al., "Deep Residual Learning for Image Recognition," IEEE Conf. Comput. Vision Pattern Recogn., Las Vegas, NV, USA, June 27-30, 2016, pp. 770-778.
C. Szegedy et al., "Going Deeper with Convolutions," IEEE Conf. Comput. Vision Pattern Recogn., Boston, MA, USA, June 7-12, 2015, pp. 1-9.
G. Huang et al., "Densely Connected Convolutional Networks," IEEE Conf. Comput. Vision Pattern Recogn., Honolulu, HI, USA , July 21-26, 2017, pp. 2261-2269.
R. Girshick et al., "Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation," IEEE Conf. Comput. Vision Pattern Recogn., Columbus, OH, USA, June 23-28, 2014, pp. 580-587.
R. Girshick, "Fast R-CNN," IEEE Int. Conf. Comput. Vision, Santiago, Chile, Dec. 7-13, 2015, pp. 1440-1448.
S. Ren et al., "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks," IEEE Trans. Pattern Anal. Mach. Intell., vol. 39, no. 6, 2017, pp. 1137-1149. https://doi.org/10.1109/TPAMI.2016.2577031
J. Dai et al., "R-FCN: Object Detection via Region-based Fully Convolutional Networks," Conf. Neural Inform. Process. Syst., Barcelona, Spain, Dec. 4-6, 2016, pp. 379-387.
J. Redmon et al., "You Only Look Once: Unified, Real-Time Object Detection," IEEE, Conf. Comput. Vision Pattern Recogn., Las Vegas, NV, USA, June 27-30, pp.779-788.
W. Liu et al., "SSD: Single Shot MultiBox Detector," Eur. Conf. Comp. Vision, Amsterdam, Netherlands, Oct. 8-16, 2016, pp. 21-37.
J.R.R. Uijlings et al., "Selective Search for Object Recognition," Int. J. Comput. Vision, vol. 104, no. 2, 2013, pp. 154-171. https://doi.org/10.1007/s11263-013-0620-5
N. Srivastava et al., "Dropout: A Simple Way to Prevent Neural Networks from Overfitting," J. Mach. Learning Res., vol. 15, 2014, pp. 1929-1958.
C. Szegedy et al., "Rethinking the Inception Architecture for Computer Vision," Comput. Vision Pattern Recogn., Las Vegas, NV, USA, June 27-30, 2016, pp. 2818-2826.
C. Szegedy et al., "Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning," AAAI Conf. Artif. Intell., San Francisco, CA, USA, Feb. 4-9, 2017, pp. 4278-4284.
J. Huang et al., "Speed/Accuracy Trade-offs for Modern Convolutional Object Detectors," Comput. Vision Pattern Recogn., Honolulu, HI, USA, July 22-24, 2017, pp. 7310-7319.
K. He et al., "Mask R-CNN," IEEE Int. Conf. Comput. Vision, Venice, Italy, Oct. 22-29, 2017, pp. 2980-2988.
J. Long et al., "Fully Convolutional Networks for Semantic Segmentation," IEEE Conf. Comput. Vision Pattern Recogn., Boston, MA, USA, june 7-12, pp. 3431-3440.

Electronics and Telecommunications Trends (전자통신동향분석)

Trends on Object Detection Techniques Based on Deep Learning

딥러닝 기반 객체 인식 기술 동향

Abstract

Keywords

Acknowledgement

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)