DOI QR코드

DOI QR Code

High-Quality Stereo Depth Map Generation Using Infrared Pattern Projection

  • Jeong, Jae-Chan (IT Convergence Technology Research Laboratory, ETRI, University of Science and Technology) ;
  • Shin, Hochul (IT Convergence Technology Research Laboratory, ETRI, University of Science and Technology) ;
  • Chang, Jiho (IT Convergence Technology Research Laboratory, ETRI) ;
  • Lim, Eul-Gyun (IT Convergence Technology Research Laboratory, ETRI) ;
  • Choi, Seung Min (IT Convergence Technology Research Laboratory, ETRI) ;
  • Yoon, Kuk-Jin (Computer Vision Laboratory, GIST) ;
  • Cho, Jae-Il (IT Convergence Technology Research Laboratory, ETRI)
  • Received : 2013.03.31
  • Accepted : 2013.10.26
  • Published : 2013.12.31

Abstract

In this paper, we present a method for obtaining a high-quality 3D depth. The advantages of active pattern projection and passive stereo matching are combined and a system is established. A diffractive optical element (DOE) is developed to project the active pattern. Cross guidance (CG) and auto guidance (AG) are proposed to perform the passive stereo matching in a stereo image in which a DOE pattern is projected. When obtaining the image, the CG emits a DOE pattern periodically and consecutively receives the original and pattern images. In addition, stereo matching is performed using these images. The AG projects the DOE pattern continuously. It conducts cost aggregation, and the image is restored through the process of removing the pattern from the pattern image. The ground truth is generated to estimate the optimal parameter among various stereo matching algorithms. Using the ground truth, the optimal parameter is estimated and the cost computation and aggregation algorithm are selected. The depth is calculated and bad-pixel errors make up 4.45% of the non-occlusion area.

Keywords

References

  1. J. Lee et al., "A Long-Range Touch Interface for Interaction with Smart TVs," ETRI J., vol. 34, no. 6, Dec. 2012. pp. 932-941. https://doi.org/10.4218/etrij.12.0111.0667
  2. S.J. Krotosky and M.M. Trivedi, "Person Surveillance Using Visual and Infrared Imagery," IEEE Trans. Circuits Syst. Video Technol., vol. 18, no. 8, Aug. 2008, pp. 1096-1105. https://doi.org/10.1109/TCSVT.2008.928217
  3. E. Koh, J. Lee, and J. Park, "Clausius Normalized Field-Based Stereo Matching for Uncalibrated Image Sequences," ETRI J., vol. 32, no. 5, Oct. 2010, pp. 750-760. https://doi.org/10.4218/etrij.10.1510.0067
  4. D. Scharstein and R. Szeliski, "A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms," Int. J. Comput. Vision, vol. 47, no. 1, May 2002, pp. 7-42. https://doi.org/10.1023/A:1014573219977
  5. T. Pribanic, N. Obradovic, and J. Salvi, "Stereo Computation Combining Structured Light and Passive Stereo Matching," Optics Commun., vol. 285, no. 6, 2012, pp. 1017-1022. https://doi.org/10.1016/j.optcom.2011.10.045
  6. J. Lim, "Optimized Projection Pattern Supplementing Stereo Systems," Proc. IEEE ICRA, 2009, pp. 2823-2829.
  7. J. Salvi, J. Pagès, and J. Batlle, "Pattern Codification Strategies in Structured Light Systems," Pattern Recognition, vol. 37, no. 4, 2004, pp. 827-849. https://doi.org/10.1016/j.patcog.2003.10.002
  8. R. Zabih and J. Woodfill, "Non-parametric Local Transforms for Computing Visual Correspondence," Proc. ECCV, 1994, pp. 151- 158.
  9. H. Hirschmuller and D. Scharstein, "Evaluation of Stereo Matching Costs on Images with Radiometric Differences," IEEE Trans. Pattern Anal. Mach. Intell., vol. 31, no. 9, 2009, pp. 1582-1599. https://doi.org/10.1109/TPAMI.2008.221
  10. A. Hosni et al., "Fast Cost-Volume Filtering for Visual Correspondence and Beyond," IEEE Trans. Pattern Anal. Mach. Intell., vol. 35, no. 2, Aug. 2012, pp. 504-511.
  11. X. Mei et al., "On Building an Accurate Stereo Matching System on Graphics Hardware," Proc. IEEE ICCV, 2011, pp. 467-474.
  12. C. Çigla and A.A. Alatan, "Efficient Edge-Preserving Stereo Matching," Proc. IEEE Int. Conf. Comput. Vision Workshops, 2011, pp. 696-699.
  13. A. Hosni, M. Gelautz, and M. Bleyer, "Accuracy-Efficiency Evaluation of Adaptive Support Weight Techniques for Local Stereo Matching," Pattern Recognition, LNCS, vol. 7476, 2002, pp. 337-346.
  14. F. Tombari et al., "Classification and Evaluation of Cost Aggregation Methods for Stereo Correspondence," Proc. IEEE Comput. Vision Pattern Recognition, 2008, pp. 1-8.
  15. C. Pham and J.W. Jeon, "Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching," IEEE Trans. Circuits Syst. Video Technol., vol. 27, no. 7, Oct. 2012, pp. 1119-1130.
  16. K.-J. Yoon and I.S. Kweon, "Adaptive Support-Weight Approach for Correspondence Search," IEEE Trans. Pattern Anal. Mach. Intell., vol. 28, no. 4, 2006, pp. 650-656. https://doi.org/10.1109/TPAMI.2006.70
  17. Middlebury Stereo Vision Page. http://vision.middlebury.edu/stereo
  18. Caltech Calibration Page. http://www.vision.caltech.edu/bouguetj/calib_doc/
  19. D. Scharstein and R. Szeliski, "High-Accuracy Stereo Depth Maps Using Structured Light," Proc. IEEE Comput. Vision Pattern Recognition, vol. 1, 2003, pp. I/195-I/202.

Cited by

  1. Multi-robot Mapping Using Omnidirectional-Vision SLAM Based on Fisheye Images vol.36, pp.6, 2013, https://doi.org/10.4218/etrij.14.0114.0584
  2. Depth map upsampling with image decomposition vol.51, pp.22, 2013, https://doi.org/10.1049/el.2015.1216
  3. Implementation of Real-Time Post-Processing for High-Quality Stereo Vision vol.37, pp.4, 2015, https://doi.org/10.4218/etrij.15.0114.1421
  4. Learning the Image Processing Pipeline vol.26, pp.10, 2017, https://doi.org/10.1109/tip.2017.2713942