DOI QR코드

DOI QR Code

Efficient Swimmer Detection Algorithm using CNN-based SVM

  • Hong, Dasol (Dept. of Computer and Communications Engineering, Kangwon National University) ;
  • Kim, Yoon (Dept. of Computer and Communications Engineering, Kangwon National University)
  • Received : 2017.11.22
  • Accepted : 2017.12.09
  • Published : 2017.12.29

Abstract

In this paper, we propose a CNN-based swimmer detection algorithm. Every year, water safety accidents have been occurred frequently, and accordingly, intelligent video surveillance systems are being developed to prevent accidents. Intelligent video surveillance system is a real-time system that detects objects which users want to do. It classifies or detects objects in real-time using algorithms such as GMM (Gaussian Mixture Model), HOG (Histogram of Oriented Gradients), and SVM (Support Vector Machine). However, HOG has a problem that it cannot accurately detect the swimmer in a complex and dynamic environment such as a beach. In other words, there are many false positives that detect swimmers as waves and false negatives that detect waves as swimmers. To solve this problem, in this paper, we propose a swimmer detection algorithm using CNN (Convolutional Neural Network), specialized for small object sizes, in order to detect dynamic objects and swimmers more accurately and efficiently in complex environment. The proposed CNN sets the size of the input image and the size of the filter used in the convolution operation according to the size of objects. In addition, the aspect ratio of the input is adjusted according to the ratio of detected objects. As a result, experimental results show that the proposed CNN-based swimmer detection method performs better than conventional techniques.

Keywords

References

  1. Ministary of Public Safety and Security, "2016 Ministary of Public Safety and Security Statistics Annual Report," pp. 49-53, 2016.
  2. YuJung Choi, Yoon Kim, "Object Detecting and Tracking Algorithm In The Environment of Complex Beach," Korean Institute of Information Scientists and Engineers Academic Publications, Vol. 2016, No. 12, pp. 1123-1125, Dec. 2016.
  3. Z. Zivkovic, "Improved Adaptive Gaussian Mixture Model for Background Subtraction," Pattern Recognition, Vol. 2, pp. 28-31, Aug. 2004.
  4. N. Dalal and B. Triggs, Histogram of Oriented Gradients for Human Detection, In Proc. of the Int'l Conf. on Computer Vision and Pattern Recognition, pp. 886-893, San Diego, CA, June 2005.
  5. C. Burges, "A Tutorial on Support Vector Machines for Pattern Recognition," Data Mining and Knowledge Discovery, Vol. 2, Issue 2, pp. 121-167, June 1998. https://doi.org/10.1023/A:1009715923555
  6. Y. Bengio, "Learning Deep Architectures for AI," Foundations and Trends in Machine Learning, Vol. 2, No. 1, pp. 1-127, Nov. 2009. https://doi.org/10.1561/2200000006
  7. Y. Tang, "Deep Learning using Linear Support Vector Machines," ArXiv preprint arXiv:1306.0239, June 2013.
  8. S. Paisitkriangkrai, C. Shen, and A. Hengel, "Pedestrian Detection with Spatially Pooled Features and Structured Ensemble Learning," IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 38, Issue 6, pp. 1243-1257, Aug. 2016. https://doi.org/10.1109/TPAMI.2015.2474388
  9. H. Jiang and E. Learned-Miller, Face Detection with the Faster R-CNN, In Proc. of the Int'l Conf. on Automatic Face and Gesture Recognition, pp. 650-657, Washington, DC, May-June 2017.
  10. JiWoong Jang, GooMan Park, "License Plate Recognition System based on Normal CCTV," Journal of the Institute of Electronics and Information Engineers, Vol. 54, No. 8, pp. 89-96, Aug. 2017. https://doi.org/10.5573/ieie.2017.54.9.89
  11. S. Krig, "Interest Point Detector And Feature Descriptor Survey," Computer Vision Metrics, pp. 187-246, 2016.
  12. P. Viola and M. Jones, Rapid Object Detection Using a Boosted Cascade of Simple Features, In Proc. of the Int'l Conf. on Computer Vision and Pattern Recognition, pp. 1-9, Kauai, HI, Dec. 2001.
  13. P. Felzenszwalb, D. McAllester, and D. Ramanan, A Discriminatively Trained, Multiscale, Deformable Part Model, In Proc. of the Int'l Conf. on Computer Vision and Pattern Recognition, pp. 1-8, Anchorage, AK, June 2008.
  14. J. Redmon and A. Farhadi, "YOLO9000: Better, Faster, Stronger," Computer Vision and Pattern Recognition, arXiv:1612.08242, Dec. 2016.
  15. R. Girshick, Fast R-CNN, In Proc. of the IEEE Int'l Conf. on Computer Vision, pp. 1440-1448, Santiago, Chile, Dec. 2015.
  16. S. Ren, K. He, R. Girshick, and J. Sun, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, In Proc. of the Int'l Conf. on Advances in Neural Information Processing Systems, pp. 91-99, Quebec, Canada, Dec. 2015.
  17. F. J. Huang and Y. LeCun, Large-Scale Learning with SVM and Convolutional for Generic Object Categorization, In Proc. of the Int'l Conf. on Computer Vision and Pattern Recognition, pp. 284-291, New York, NY, June 2006.
  18. J. F. Henriques and R. Caserio, "High-Speed Tracking with Kernelized Correlation Filters," IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 37, Issue 3, pp. 583-596, Aug. 2015. https://doi.org/10.1109/TPAMI.2014.2345390
  19. V. Nair and G. E. Hinton, Rectified Linear Units Improve Restricted Boltzmann Machines, In Proc. of the 27th Int'l Conf. on Machine Learning, pp. 807-814, Haifa, Israel, June 2010.
  20. K. Diederik and J. Ba, "Adam: A Method for Stochastic Optimization," ArXiv preprint arXiv:1412.6980, Dec. 2014.
  21. J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei., ImageNet: A Large-Scale Hierarchical Image Database, In Proc. of the Int'l Conf. on Computer Vision and Pattern Recognition, pp. 248-255, Miami, FL, June 2009.