Three Common Types of Generative Adversarial Networks (GANs)

James Han
3 min readFeb 2, 2023

Generative adversarial networks (GANs) are a rapidly growing field in the world of artificial intelligence, first introduced by Ian Goodfellow et al. in 2014. They are a class of machine learning algorithms that are designed to generate new data that is similar to the input data.

GANs consist of two neural networks: a generator and a discriminator. The generator is responsible for creating new data, while the discriminator is responsible for evaluating the generated data to determine its similarity to the training data. The two networks are trained simultaneously, with the generator trying to create data that is indistinguishable from the training data and the discriminator trying to correctly identify whether the generated data is real or fake.

There are several variations of GANs, including Deep Convolutional GANs (DCGANs), Conditional GANs (CGANs), and Wasserstein GANs (WGANs), each with its own unique characteristics that make them suitable for different applications.

Deep Convolutional GANs (DCGANs) are a popular type of GANs that are used for generating images. DCGANs use deep convolutional neural networks to learn the features of the input data, which allows them to generate high-resolution images that are similar to the training data. The generator network in a DCGAN typically consists of transposed convolutional layers, while the discriminator network consists of convolutional layers. The use of convolutional layers allows DCGANs to take advantage of the spatial relationships in the input data, which results in high-quality generated images.

Conditional GANs (CGANs) are a type of GANs that allow the user to control the generated data by adding an additional input to the generator network. This input specifies the desired characteristics of the generated data, such as the color or shape of an image. CGANs can be thought of as a combination of a GAN and a conditional generative model, where the generator and discriminator are trained to take into account both the input data and the condition. This allows CGANs to generate data that is more in line with the user’s expectations.

Wasserstein GANs (WGANs) are a relatively new type of GANs that address some of the stability issues that are commonly encountered when training GANs. WGANs use the Wasserstein distance metric to evaluate the quality of the generated data, which has been shown to provide improved stability during training compared to other metrics. The Wasserstein distance metric measures the earth mover’s distance between the generated data and the training data, which provides a robust measure of the quality of the generated data. WGANs have been shown to be particularly effective for generating high-quality images.

GANs have been applied to a wide range of applications, including image generation, music synthesis, drug discovery, and even style transfer. The quality of the generated data has improved significantly in recent years due to advances in training methods and novel architectures, and it is likely that we will see continued improvement in the near future.

--

--