Generative Adversarial Networks: A Brief History and Overview

Authors

  • Akhil Gunasekaran University of California, Santa Cruz

DOI:

https://doi.org/10.47611/jsr.v12i1.1848

Keywords:

Deep Learning, Machine Learning, Artificial Intelligence

Abstract

Over the past decade, research in the field of Deep Learning has brought about novel improvements in image generation and feature learning; one such example being a Generative Adversarial Network. However, these improvements have been coupled with an increasing demand on mathematical literacy and previous knowledge in the field. Therefore, in this literature review, I seek to introduce Generative Adversarial Networks (GANs) to a broader audience by explaining their background and intuition at a more foundational level. I begin by discussing the mathematical background of this architecture, specifically topics in linear algebra and probability theory. I then proceed to introduce GANs in a more theoretical framework, along with some of the literature on GANs, including their architectural improvements and image-generation capabilities. Finally, I cover state-of-the-art image generation through style-based methods, as well as their implications on society.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References or Bibliography

M. Arjovsky, S. Chintala, and L. Bottou, “Wasserstein generative adversarial networks,” in International conference on machine learning, pp. 214–223, PMLR, 2017. http://proceedings.mlr.press/v70/arjovsky17a/arjovsky17a.pdf

A. Radford, L. Metz, and S. Chintala, “Unsupervised representation learning with deep convolutional generative adversarial networks,” arXiv preprint arXiv:1511.06434, 2015. http://doi.org/10.1007/978-3-319-71589-6_9

A. Brock, J. Donahue, and K. Simonyan, “Large scale gan training for high fidelity natural image synthesis,” arXiv preprint arXiv:1809.11096, 2018. https://arxiv.org/pdf/1809.11096.pdf

T. Karras, S. Laine, and T. Aila, “A style-based generator architecture for generative adversarial networks,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 4401–4410, 2019. http://doi.org/10.1109/CVPR.2019.00453

T. Brown, B. Mann, N. Ryder, M. Subbiah, J. D. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askell, et al., “Language models are few-shot learners,” Advances in neural information processing systems, vol. 33, pp. 1877–1901, 2020. https://arxiv.org/pdf/2005.14165.pdf

Y. LeCun, Y. Bengio, and G. Hinton, “Deep learning,” nature, vol. 521, no. 7553, pp. 436–444, 2015. http://doi.org/10.1038/nature14539

F. Rosenblatt, “The perceptron: a probabilistic model for information storage and organization in the brain.,” Psychological review, vol. 65, no. 6, p. 386, 1958. http://doi.org/10.1037/h0042519

M. Minsky, “Steps toward artificial intelligence,” Proceedings of the IRE, vol. 49, no. 1, pp. 8–30, 1961. http://doi.org/10.1109/JRPROC.1961.287775

A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,” Communications of the ACM, vol. 60, no. 6, pp. 84–90, 2017. https://doi.org/10.1145/3065386

I. Goodfellow, Y. Bengio, and A. Courville, Deep learning. MIT press, 2016 http://deeplearningbook.org

I. Goodfellow, “Nips 2016 tutorial: Generative adversarial networks,” arXiv preprint arXiv:1701.00160, 2016. https://arxiv.org/pdf/1701.00160.pdf

I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, “Generative adversarial networks,” Communications of the ACM, vol. 63, no. 11, pp. 139–144, 2020. https://doi.org/10.1145/3422622

S. Ioffe and C. Szegedy, “Batch normalization: Accelerating deep network training by reducing internal covariate shift,” in International conference on machine learning, pp. 448–456, PMLR, 2015. https://dl.acm.org/doi/10.5555/3045118.3045167

M. Mirza and S. Osindero, “Conditional generative adversarial nets,” arXiv preprint arXiv:1411.1784, 2014. https://arxiv.org/pdf/1411.1784.pdf

J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, “Imagenet: A large-scale hierarchical image database,” in 2009 IEEE conference on computer vision and pattern recognition, pp. 248–255, Ieee, 2009. https://doi.org/10.1109/CVPR.2009.5206848

DeepMind, https://tfhub.dev/deepmind/biggan-deep-256/1

L. A. Gatys, A. S. Ecker, and M. Bethge, “A neural algorithm of artistic style,” arXiv preprint arXiv:1508.06576, 2015. https://arxiv.org/pdf/1508.06576.pdf, https://doi.org/10.1167/16.12.326

X. Huang and S. Belongie, “Arbitrary style transfer in real-time with adaptive instance normalization,” In Proceedings of the IEEE international conference on computer vision, pp. 1501–1510, 2017. http://doi.org/10.1109/ICCV.2017.167

V. van Gogh, “The starry night.” 1889, https://www.moma.org/collection/works/79802

R. Nakano, “Arbitrary Style Transfer in the Browser.” https://reiinakano.com/arbitrary-image-stylization-tfjs/

G. E. Hinton, “A practical guide to training restricted boltzmann machines,” in Neural networks: Tricks of the trade, pp. 599–619, Springer, 2012. https://doi.org/10.1007/978-3-642-35289-8_32

D. P. Kingma and M. Welling, “Auto-encoding variational bayes,” arXiv preprint arXiv:1312.6114, 2013. https://arxiv.org/pdf/1312.6114.pdf

P. Dhariwal and A. Nichol, “Diffusion models beat gans on image synthesis,” Advances in Neural Information Processing Systems, vol. 34, pp. 8780–8794, 2021. https://papers.nips.cc/paper/2021/file/49ad23d1ec9fa4bd8d77d02681df5cfa-Paper.pdf

F. Yu, A. Seff, Y. Zhang, S. Song, T. Funkhouser, and J. Xiao, “Lsun: Construction of a large-scale image dataset using deep learning withhumans in the loop,” arXiv preprint arXiv:1506.03365, 2015. https://arxiv.org/pdf/1506.03365.pdf

E. Strubell, A. Ganesh, and A. McCallum, “Energy and policy considerations for deep learning in nlp,” arXiv preprint arXiv:1906.02243, 2019. https://arxiv.org/pdf/1906.02243.pdf

A. Lacoste, A. Luccioni, V. Schmidt, and T. Dandres, “Quantifying the carbon emissions of machine learning,” arXiv preprint arXiv:1910.09700, 2019. https://arxiv.org/pdf/1910.09700.pdf

Published

02-28-2023

How to Cite

Gunasekaran, A. (2023). Generative Adversarial Networks: A Brief History and Overview. Journal of Student Research, 12(1). https://doi.org/10.47611/jsr.v12i1.1848

Issue

Section

Review Articles