DOI QR코드

DOI QR Code

Trends on Object Detection Techniques Based on Deep Learning

딥러닝 기반 객체 인식 기술 동향

  • Published : 2018.08.01

Abstract

Object detection is a challenging field in the visual understanding research area, detecting objects in visual scenes, and the location of such objects. It has recently been applied in various fields such as autonomous driving, image surveillance, and face recognition. In traditional methods of object detection, handcrafted features have been designed for overcoming various visual environments; however, they have a trade-off issue between accuracy and computational efficiency. Deep learning is a revolutionary paradigm in the machine-learning field. In addition, because deep-learning-based methods, particularly convolutional neural networks (CNNs), have outperformed conventional methods in terms of object detection, they have been studied in recent years. In this article, we provide a brief descriptive summary of several recent deep-learning methods for object detection and deep learning architectures. We also compare the performance of these methods and present a research guide of the object detection field.

Keywords

Acknowledgement

Grant : 디지털콘텐츠 In-House R&D

Supported by : 정보통신기술진흥센터

References

  1. M. Everingham et al., "The Pascal Visual Object Classes (VOC) Challenge," Int. J. Comput. Vision, vol. 88, no. 2, June 2010, pp. 303-338. https://doi.org/10.1007/s11263-009-0275-4
  2. O. Russakovsky et al., "ImageNet Large Scale Visual Recognition Challenge," Int. J. Comput. Vision, vol. 115, no. 3, Dec. 2015, pp. 211-252. https://doi.org/10.1007/s11263-015-0816-y
  3. T. Lin et al., "Microsoft COCO: Common Objects in Context," Eur. Conf. Comput. Vision(ECCV), Amsterdam, Netherlands, Oct. 8-16, 2014, pp. 740-755.
  4. D.G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," Int. J. Comput. Vision, vol. 60, no. 2, 2004, pp. 91-110. https://doi.org/10.1023/B:VISI.0000029664.99615.94
  5. H. Bay et al., "Speeded-Up Robust Features (SURF)," Comput. Vision Image Understanding, vol. 110, no. 3, 2008, pp. 346-359. https://doi.org/10.1016/j.cviu.2007.09.014
  6. P. Viola and M. Jones, "Rapid Object Detection using a Boosted Cascade of Simple Features," Proc. IEEE Conput. Soc. Conf. Comput. Vision Pattern Recogn., Kauai, HI, USA, Dec. 8-14, 2001, pp. I:511-I:518
  7. N. Dalal and B. Triggs, "Histograms of Oriented Gradients for Human Detection," IEEE Comput. Soc. Conf. Comput. Vision Pattern Recogn., San Diego, CA, USA, June 20-25, 2015, pp. 886-893.
  8. P.F. Felzenszwalb et al., "Object Detection with Discriminatively Trained Part-Based Models," IEEE Trans. Pattern Anal. Mach. Intell., vol. 32, no. 9, 2010, pp. 1627-1645.
  9. A. Krizhevsky et al, "ImageNet Classification with Deep Convolutional Neural Networks," Conf. Neural Inform. Process. Syst., Lake Tahoe, NV, USA, Dec. 3-6, 2012, pp. 1097-1105.
  10. Y. Lecun et al., "Gradient-Based Learning Applied to Document Recognition," Proc. IEEE, vol. 86, no. 11, Nov. 1998, pp. 2278-2324. https://doi.org/10.1109/5.726791
  11. M.D. Zeiler and R. Fergus, "Visualizing and Understanding Convolutional Networks," Eur. Conf. Comput. Vision(ECCV), Amsterdam, Netherlands, Oct. 8-16, 2014, pp. 818-833.
  12. K. Simonyan et al., "Very Deep Convolutional Networks for Large-Scale Image Recognition," Int. Conf. Learning Representations, San Diego, USA, May 7-9, 2015.
  13. K. He et al., "Deep Residual Learning for Image Recognition," IEEE Conf. Comput. Vision Pattern Recogn., Las Vegas, NV, USA, June 27-30, 2016, pp. 770-778.
  14. C. Szegedy et al., "Going Deeper with Convolutions," IEEE Conf. Comput. Vision Pattern Recogn., Boston, MA, USA, June 7-12, 2015, pp. 1-9.
  15. G. Huang et al., "Densely Connected Convolutional Networks," IEEE Conf. Comput. Vision Pattern Recogn., Honolulu, HI, USA , July 21-26, 2017, pp. 2261-2269.
  16. R. Girshick et al., "Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation," IEEE Conf. Comput. Vision Pattern Recogn., Columbus, OH, USA, June 23-28, 2014, pp. 580-587.
  17. R. Girshick, "Fast R-CNN," IEEE Int. Conf. Comput. Vision, Santiago, Chile, Dec. 7-13, 2015, pp. 1440-1448.
  18. S. Ren et al., "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks," IEEE Trans. Pattern Anal. Mach. Intell., vol. 39, no. 6, 2017, pp. 1137-1149. https://doi.org/10.1109/TPAMI.2016.2577031
  19. J. Dai et al., "R-FCN: Object Detection via Region-based Fully Convolutional Networks," Conf. Neural Inform. Process. Syst., Barcelona, Spain, Dec. 4-6, 2016, pp. 379-387.
  20. J. Redmon et al., "You Only Look Once: Unified, Real-Time Object Detection," IEEE, Conf. Comput. Vision Pattern Recogn., Las Vegas, NV, USA, June 27-30, pp.779-788.
  21. W. Liu et al., "SSD: Single Shot MultiBox Detector," Eur. Conf. Comp. Vision, Amsterdam, Netherlands, Oct. 8-16, 2016, pp. 21-37.
  22. J.R.R. Uijlings et al., "Selective Search for Object Recognition," Int. J. Comput. Vision, vol. 104, no. 2, 2013, pp. 154-171. https://doi.org/10.1007/s11263-013-0620-5
  23. N. Srivastava et al., "Dropout: A Simple Way to Prevent Neural Networks from Overfitting," J. Mach. Learning Res., vol. 15, 2014, pp. 1929-1958.
  24. C. Szegedy et al., "Rethinking the Inception Architecture for Computer Vision," Comput. Vision Pattern Recogn., Las Vegas, NV, USA, June 27-30, 2016, pp. 2818-2826.
  25. C. Szegedy et al., "Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning," AAAI Conf. Artif. Intell., San Francisco, CA, USA, Feb. 4-9, 2017, pp. 4278-4284.
  26. J. Huang et al., "Speed/Accuracy Trade-offs for Modern Convolutional Object Detectors," Comput. Vision Pattern Recogn., Honolulu, HI, USA, July 22-24, 2017, pp. 7310-7319.
  27. K. He et al., "Mask R-CNN," IEEE Int. Conf. Comput. Vision, Venice, Italy, Oct. 22-29, 2017, pp. 2980-2988.
  28. J. Long et al., "Fully Convolutional Networks for Semantic Segmentation," IEEE Conf. Comput. Vision Pattern Recogn., Boston, MA, USA, june 7-12, pp. 3431-3440.