DOI QR코드

DOI QR Code

A Study on Person Re-Identification System using Enhanced RNN

확장된 RNN을 활용한 사람재인식 시스템에 관한 연구

  • Choi, Seok-Gyu (Dept. Center for Advanced Image and Information Technology, School of Electronics & Information Engineering, ChonBuk National University) ;
  • Xu, Wenjie
  • Received : 2017.03.14
  • Accepted : 2017.04.07
  • Published : 2017.04.30

Abstract

The person Re-identification is the most challenging part of computer vision due to the significant changes in human pose and background clutter with occlusions. The picture from non-overlapping cameras enhance the difficulty to distinguish some person from the other. To reach a better performance match, most methods use feature selection and distance metrics separately to get discriminative representations and proper distance to describe the similarity between person and kind of ignoring some significant features. This situation has encouraged us to consider a novel method to deal with this problem. In this paper, we proposed an enhanced recurrent neural network with three-tier hierarchical network for person re-identification. Specifically, the proposed recurrent neural network (RNN) model contain an iterative expectation maximum (EM) algorithm and three-tier Hierarchical network to jointly learn both the discriminative features and metrics distance. The iterative EM algorithm can fully use of the feature extraction ability of convolutional neural network (CNN) which is in series before the RNN. By unsupervised learning, the EM framework can change the labels of the patches and train larger datasets. Through the three-tier hierarchical network, the convolutional neural network, recurrent network and pooling layer can jointly be a feature extractor to better train the network. The experimental result shows that comparing with other researchers' approaches in this field, this method also can get a competitive accuracy. The influence of different component of this method will be analyzed and evaluated in the future research.

사람의 빈번한 자세 변화, 그리고 background clutter과 occlusion으로 인해 Person Re-identificatio는 컴퓨터 비전 분야에서 가장 어려운 부분이다. 비겹침 카메라의 이미지는 어떤 사람을 다른 사람과 구별하기 어렵게 한다. 더욱 나은 성능 일치를 달성하기 위해 대부분의 방법은 특징 선택과 거리 메트릭을 개별적으로 사용한다. 그렇게 차별화된 표현과 적절한 거리를 얻을 수 있고, 사람과 중요한 특징의 무시 사이의 유사성을 설명할 수 있다. 이러한 상황은 우리가 이 문제를 다루는 새로운 방법을 고려하도록 한다. 본 논문에서는 Person Re-identification를 위한 3단 계층네트워크를 갖는 향상되고 반복적인 신경 회로망을 제안하였다. 특히 RNN(Revurrent Neural Network) 모델은 반복적인 EM(Expectation Maximum) 알고리즘과 3단 계층 네트워크를 포함하고, 차별적 특징과 지표 거리를 공동으로 학습한다. 반복적인 EM 알고리즘은 RNN 이전에 연속해 있는 CNN(Convoutional Neural Network)의 특징 추출 능력을 충분히 사용할 수 있다. 자율 학습을 통해 EM 프레임 워크는 패치의 레이블을 변경하고 더 큰 데이터 세트를 훈련할 수 있다. 네트워크를 더 잘 훈련시키기 위해 3단 계층 네트워크를 통해 CNN, RNN 및 풀링 계층이 공동으로 특징 추출을 할 수 있다. 실험 결과에 따르면 비전처리 분야에서 다른 연구자의 접근 방식과 비교할 때 이 방법은 경쟁력 있는 정확도를 얻을 수 있다. 이 방법에 대한 다른 요소의 영향은 향후 연구에서 분석되고 평가될 것이다.

Keywords

References

  1. A. Globerson and S. T. Roweis. Metric learning by collapsing classes. In NIPS, pages 451-458, 2005.
  2. D. Gray and H. Tao. Viewpoint invariant pedestrian recognition with an ensemble of localized features. In ECCV, pages 262-275. 2008.
  3. D. Yi, Z. Lei, S. Liao, and S. Z. Li, "Deep metric learning for person re-identification," in 2014 22nd International Conference on Pattern Recognition (ICPR). IEEE, pp. 34-39, 2014.
  4. H. Martin, R. Pete, and B. Horst, "Person re-identification by efficient impostor-based metric learning," in IEEE AVSS, pp. 203-208, 2012.
  5. Raphael Prates and William Robson Schwartz. "Kernel Hierarchical PCA for Person Re-Identification" in ICPR 2016.
  6. Raphael Prates, Marina Oliveira and William Robson Schwartz. " Kernel Partial Least Squares for Person Re-Identification" in AVSS 2016.
  7. K. Q. Weinberger and L. K. Saul, "Distance metric learning for large margin nearest neighbor classification," Journal of Machine Learning Research, vol. 10, no. Feb, pp. 207-244, 2009.
  8. De Cheng, Yihong Gong, Sanping Zhou, Jinjun Wang, Nanning Zheng. "Person Re-Identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function" in CVPR 2016.
  9. Y. Yan, B. Ni, Z. Song, C. Ma, Y. Yan, and X. Yang, "Person identification via recurrent feature aggregation," in European Conference on Computer Vision. Springer, pp. 701-716, 2016.
  10. McLaughlin, N., Martinez del Rincon, J., & Miller, P. Recurrent Convolutional Network for Video-based Person Re-Identification" in CVPR 2016.
  11. S. Khamis, C. Kuo, V. Singh, V. Shet, and L. Davis. Joint learning for attribute-consistent person re-identification. In ECCV Workshop on Visual Surveillance and Identification, 2014.2.
  12. M. Koestinger, M. Hirzer, P. Wohlhart, P. Roth, and H. Bischof. Large scale metric learning from equivalence constraints. In CVPR, 2012. 2, 6.
  13. S. Liao, Y. Hu, X. Zhu, and S. Z. Li, "Person re-identification by local maximal occurrence representation and metric learning," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2197-2206, 2015.
  14. W. Li and X. Wang. Locally aligned feature transforms across views. In CVPR, 2013. 2
  15. M. Koestinger, M. Hirzer, P. Wohlhart, P. Roth, and H. Bischof, "Large scale metric learning from equivalence constraints," in IEEE CVPR, 2012.
  16. Le Hou, Dimitris Samaras, Tahsin M. Kurc, Yi Gao, James E. Davis, Joel H. Saltz; The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2424-2433, 2016.
  17. Yang Yang, Shengcai Liao, Zhen Lei, Dong Yi, and Stan Z. Li, "Color models and weighted covariance estimation for person re-identification," in IAPR ICPR, pp. 1874-1879, 2014.8.
  18. M. Hirzer, P.M. Roth, and H. Bischof, "Person re-identification by efficient impostor-based metric learning," in IEEE AVSS, p. 203-208, 2012.9.
  19. Yang Yang, Jimei Yang, Junjie Yan, Shengcai Liao, Dong Yi, and Stan Z Li, "Salient color names for person reidentification," in ECCV, pp. 536- 551. 2014.
  20. N. Martinel, C. Micheloni, and G. Feresti. Saliency weighted features for person re-identification. In ECCV Workshop on Visual Surveillance and Re-identification, 2014. 2.
  21. D. Gray and H. Tao, "Viewpoint invariant pedestrian recognition with an ensemble of localized features," in European conference on computer vision. Springer, pp. 262-275, 2008.
  22. Martin Koestinger, Martin Hirzer, Paul Wohlhart, Peter M. Roth, and Horst Bischof, "Large scale metric learning from equivalence constraints," in IEEE CVPR, 2012.
  23. R. Hadsell, S. Chopra, and Y. LeCun, "Dimensionality reduction by learning an invariant mapping," in 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06), vol. 2. IEEE, pp. 1735-1742, 2006.
  24. I. Kviatkovsky, A. Adam, and E. Rivlin, "Color invariants for person identification," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 7, pp. 1622-1634, 2013. https://doi.org/10.1109/TPAMI.2012.246
  25. F. Xiong, M. Gou, O. Camps, and M. Sznaier. Person reidentification using kernel-based metric learning methods. In ECCV, 2014. 2.
  26. Y. Yang, J. Yang, J. Yan, S. Liao, D. Yi, and S. Li. Salient color names for person re-identification. In ECCV, 2014. 2.
  27. D. Yi, Z. Lei, and S. Z. Li. Deep metric learning for practical person re-identification. ICPR, 2014. 2, 5.
  28. Z. Zhang, Y. Chen, and V. Saligrama. A novel visual word co-occurrence model for person re-identification. In ECCV Workshop on Visual Surveillance and Re-identification, 2014. 2, 7.
  29. R. Zhao, W. Ouyang, and X. Wang. Person re-identification by salience matching. In ICCV, 2013. 2, 7.
  30. S. Karaman and A. D. Bagdanov, "Identity inference: generalizing person re-identification scenarios," in European Conference on Computer Vision. Springer, pp. 443-452, 2012.
  31. D. Simonnet, M. Lewandowski, S. A. Velastin, J. Orwell, and E. Turkbeyler, "Re-identification of pedestrians in crowds using dynamic time warping," in European Conference on Computer Vision. Springer, pp. 423-432, 2012.
  32. J. You, A. Wu, X. Li, and W.-S. Zheng, "Top - push video - based person re-identification," arXiv preprint arXiv:1604.08683, 2016.
  33. S. Ding, L. Lin, G. Wang, and H. Chao, "Deep feature learning with relative distance comparison for person re-identification," Pattern Recognition, vol. 48, no. 10, pp. 2993-3003, 2015. https://doi.org/10.1016/j.patcog.2015.04.005
  34. In-Kue Park, BoHyeok Ahn, Gyoo-Seok Choi, "An Edge Detection for Face Feature Extraction using $\lambda$- Fuzzy Measure", The Journal of Internet Broadcasting and Communication(JIIBC), Vol.9, No.4, pp.69-74, August 2009.
  35. Kim, Tae-Hyun, "Measurement and Prediction of the Visibility Range by the Variations of the Character Sizes and Illuminance", Journal of the Korea Academia - Industrial cooperation Society, Vol.16, No.12, pp.8222-8227, December 2015. DOI : http://dx.doi.org/ 10.5762/KAIS. 2015.16.12. 8222.

Cited by

  1. Multi-scale Pedestrian Detection in Thermal Imaging Using Deep Convolutional Neural Network and Adaptive NMS vol.16, pp.9, 2018, https://doi.org/10.14801/jkiit.2018.16.9.85
  2. A Study on H-CNN Based Pedestrian Detection Using LGP-FL and Hippocampal Structure vol.16, pp.12, 2018, https://doi.org/10.14801/jkiit.2018.16.12.75