DOI QR코드

DOI QR Code

Research Trends of Generative Adversarial Networks and Image Generation and Translation

GAN 적대적 생성 신경망과 이미지 생성 및 변환 기술 동향

  • Published : 2020.08.01

Abstract

Recently, generative adversarial networks (GANs) is a field of research that has rapidly emerged wherein many studies conducted shows overwhelming results. Initially, this was at the level of imitating the training dataset. However, the GAN is currently useful in many fields, such as transformation of data categories, restoration of erased parts of images, copying facial expressions of humans, and creation of artworks depicting a dead painter's style. Although many outstanding research achievements have been attracting attention recently, GANs have encountered many challenges. First, they require a large memory facility for research. Second, there are still technical limitations in processing high-resolution images over 4K. Third, many GAN learning methods have a problem of instability in the training stage. However, recent research results show images that are difficult to distinguish whether they are real or fake, even with the naked eye, and the resolution of 4K and above is being developed. With the increase in image quality and resolution, many applications in the field of design and image and video editing are now available, including those that draw a photorealistic image as a simple sketch or easily modify unnecessary parts of an image or a video. In this paper, we discuss how GANs started, including the base architecture and latest technologies of GANs used in high-resolution, high-quality image creation, image and video editing, style translation, content transfer, and technology.

Keywords

References

  1. Y. Jo et al., "SC-FEGAN: Face Editing Generative Adversarial Network with User's Sketch and Color," in CVPR, 2019.
  2. T. Park et al., "Semantic Image Synthesis with Spatially-Adaptive Normalization," in CVPR, 2019.
  3. I. Goodfellow et al., "Generative adversarial nets," in NIPS, 2014.
  4. A. Radford et al., "Unsupervised Representation Learning With Deep Convolutional Generative Adversarial Networks," in ICLR, 2016.
  5. D. Berthelot et al., "BEGAN: Boundary Equilibrium Generative Adversarial Networks," in arXiv, 2017.
  6. P. Isola et al., "Image-to-Image Translation with Conditional Adversarial Nets," in CVPR, 2017.
  7. Z. Liu et al., "Deep Learning Face Attributes in the Wild," in ICCV, 2015.
  8. T. Karras et al., "Progressive Growing of GANs for Improved Quality, Stability, and Variation," in ICLR, 2018.
  9. J. Zhao et al., "EBGAN: Energy-Based Generative Adversarial Networks," in ICLR, 2017.
  10. T. R. Shaham et al., "SinGAN: Learning a Generative Model from a Single Natural Image," in ICCV, 2019.
  11. T. Wang et al., "High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs," in CVPR, 2019.
  12. W. Sun et al., "Image Synthesis From Reconfigurable Layout and Style," in ICCV, 2019.
  13. Z. Bo et al., "Layout2image: Image Generation from Layout," in IJCV, 2020.
  14. C. Ledig et al., "Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network," in CVPR, 2017.
  15. C. Dong et al., "Image Super-Resolution Using Deep Convolutional Networks," in TPAMI, 2015.
  16. J. Yu et al., "Generative Image Inpainting with Contextual Attention," in CVPR, 2018.
  17. J. Yu et al., "Free-Form Image Inpainting with Gated Convolution," in CVPR, 2019.
  18. K. Nazeri et al., "EdgeConnect: Structure Guided Image Inpainting using Edge Prediction," in ICCVW, 2019.
  19. T. Karras et al., "A Style-Based Generator Architecture for Generative Adversarial Networks," in CVPR, 2019.
  20. X. Huang et al., "Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization," in ICCV, 2017.
  21. T. Karras et al., "Analyzing and Improving the Image Quality of StyleGAN," in arXiv, 2019.
  22. Y. Choi et al., "StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation," in CVPR, 2018.
  23. Y. Choi et al., "StarGAN v2: Diverse Image Synthesis for Multiple Domains," in CVPR, 2020.