Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2021
Generative Modelling has been a very extensive area of research since it finds immense use cases across multiple domains. Various models have been proposed in the recent past including Fully Visible Belief Nets, NADE, MADE, Pixel RNN Variational Auto Encoders, Markov Chain, and Generative Adversarial Networks. Amongst all the models, Generative Adversarial Networks have been consistently showing huge potential and developments in the area of Art, Music, SemiSupervised learning, Handling Missing data, Drug Discovery, and unsupervised learning. This emerging technology has reshaped the research landscape in the field of generative modeling. The research in the area of Generative Adversarial Networks (GANs) was introduced by Ian J. Goodfellow et al in 2014 [1]. However, since its inception, various models have been proposed over the years and are considered state-of-the-art models in generative modeling. In this survey, we provide a comprehensive review of the original GAN model and it...
2020
Generative Adversarial Networks (GANs) are part of the deep generative model family and able to generate synthetic samples based on the underlying distribution of real-world data. With expanding interest new discoveries and recent advances are hard to follow. Recent advancements to stabilize training, will help GANs to open up new domains using adjusted architectures and loss functions. Various findings show, that GANS can be used to generate not only images, but is also useful for text and audio creation. This paper, presents an overview of different GAN architectures, giving summaries of the underlying fundamentals of each presented GAN. Furthermore, this paper presents look into four application domains and lists additional domains. Additionally, this paper summaries datasets and metrics used to evaluate GANs and present recent scientific advancements. Keywords–generative adversarial networks; machine learning; deep learning.
arXiv (Cornell University), 2017
Generative Adversarial Networks (GAN) have received wide attention in the machine learning field for their potential to learn high-dimensional, complex real data distribution. Specifically, they do not rely on any assumptions about the distribution and can generate real-like samples from latent space in a simple manner. This powerful property leads GAN to be applied to various applications such as image synthesis, image attribute editing, image translation, domain adaptation and other academic fields. In this paper, we aim to discuss the details of GAN for those readers who are familiar with, but do not comprehend GAN deeply or who wish to view GAN from various perspectives. In addition, we explain how GAN operates and the fundamental meaning of various objective functions that have been suggested recently. We then focus on how the GAN can be combined with an autoencoder framework. Finally, we enumerate the GAN variants that are applied to various tasks and other fields for those who are interested in exploiting GAN for their research.
arXiv (Cornell University), 2017
Generative Adversarial Networks(GAN) gets wide attention in machine learning field because of its massive potential to learn high dimensional, complex real data. Specifically, it does not need to do further distribution assumption and can simply infer real-like samples from latent space. This powerful property leads GAN to be applied various applications such as image synthesis, image attribute editing, image translation, domain adaptation and even to other academic fields. In this review paper, we aim to look into details of GAN. We discuss how GAN operates and fundamental meaning of various objective functions suggested. Then, we focus on how GAN can be combined with auto encoder framework which makes possible latent space to be managed. As an extension, we also point to GAN variants which are applied to various amount of tasks and other fields.
International Journal of Multimedia Information Retrieval, 2020
Deep neural networks have attained great success in handling high dimensional data, especially images. However, generating naturalistic images containing ginormous subjects for different tasks like image classification, segmentation, object detection, reconstruction, etc., is continued to be a difficult task. Generative modelling has the potential to learn any kind of data distribution in an unsupervised manner. Variational autoencoder (VAE), autoregressive models, and generative adversarial network (GAN) are the popular generative modelling approaches that generate data distributions. Among these, GANs have gained much attention from the research community in recent years in terms of generating quality images and data augmentation. In this context, we collected research articles that employed GANs for solving various tasks from popular databases and summarized them based on their application. The main objective of this article is to present the nuts and bolts of GANs, state-of-the-art related work and its applications, evaluation metrics, challenges involved in training GANs, and benchmark datasets that would benefit naive and enthusiastic researchers who are interested in working on GANs.
arXiv (Cornell University), 2022
Generative networks are fundamentally different in their aim and methods compared to CNNs for classification, segmentation, or object detection. They have initially not been meant to be an image analysis tool, but to produce naturally looking images. The adversarial training paradigm has been proposed to stabilize generative methods, and has proven to be highly successful-though by no means from the first attempt. This chapter gives a basic introduction into the motivation for Generative Adversarial Networks (GANs) and traces the path of their success by abstracting the basic task and working mechanism, and deriving the difficulty of early practical approaches. Methods for a more stable training will be shown, and also typical signs for poor convergence and their reasons. Though this chapter focuses on GANs that are meant for image generation and image analysis, the adversarial training paradigm itself is not specific to images, and also generalizes to tasks in image analysis. Examples of architectures for image semantic segmentation and abnormality detection will be acclaimed, before contrasting GANs with further generative modeling approaches lately entering the scene. This will allow a contextualized view on the limits but also benefits of GANs.
International journal for research in applied science and engineering technology, 2024
GANs to advanced variants like Progressive GANs [7] and StyleGANs [8] In conclusion, Generative Adversarial Networks (GANs) stand as a testament to the remarkable strides made in the field of artificial intelligence. From their inception to their current state, GANs have revolutionized the landscape of generative modeling, offering unparalleled capabilities in synthesizing realistic data across diverse domains. Through a thorough examination of GAN architecture, training methodologies, and applications, this paper has provided insights into the multifaceted nature of GANs and their profound impact on various industries. Despite challenges such as training instability and ethical concerns surrounding deepfake generation, GANs continue to push the boundaries of creativity and innovation. As researchers continue to refine GAN architectures and address inherent challenges, the potential for GANs to drive advancements in artificial intelligence and shape the future of creative synthesis remains boundless. With ongoing developments and the collective efforts of the research community, GANs are poised to continue their transformative journey, unlocking new frontiers in generative modeling and reshaping our understanding of artificial intelligence.
IEEE Access, 2018
The appearance of generative adversarial networks (GAN) provides a new approach and framework for computer vision. Compared with traditional machine learning algorithms, GAN works via adversarial training concept and is more powerful in both feature learning and representation. GAN also exhibits some problems, such as non-convergence, model collapse, and uncontrollability due to high degree of freedom. How to improve the theory of GAN and apply it to computer-vision-related tasks have now attracted much research efforts. In this paper, recently proposed GAN models and their applications in computer vision are systematically reviewed. In particular, we firstly survey the history and development of generative algorithms, the mechanism of GAN, its fundamental network structures, and theoretical analysis of the original GAN. Classical GAN algorithms are then compared comprehensively in terms of the mechanism, visual results of generated samples, and Frechet Inception Distance. These networks are further evaluated from network construction, performance, and applicability aspects by extensive experiments conducted over public datasets. After that, several typical applications of GAN in computer vision, including high-quality samples generation, style transfer, and image translation, are examined. Finally, some existing problems of GAN are summarized and discussed and potential future research topics are forecasted. INDEX TERMS Deep learning, generative adversarial networks (GAN), computer vision (CV), image generation, style transfer, image inpainting.
IRJET, 2023
Generative Adversarial Networks (GANs), a class of deep learning models that creates new data samples that resemble the original data, are in-depth examined in this research study. The article covers many GAN subtypes, including vanilla GANs, MedGANs, StyleGANs, and CycleGANs, and analyses their designs and training approaches. The study examines the many GAN applications, including text-to-image synthesis, data augmentation, and picture and video creation. There is also discussion of the difficulties each type of GAN method faces, including mode collapse, instability, and vanishing gradients. In-depth analysis is also given to the technical features of GANs, including the generator and discriminator networks, training loss functions, and regularization techniques. The research study examines current advancements in GANs, including self-attention, adversarial autoencoders, and attention mechanisms. Additionally, the paper addresses the ethical issues related to GANs, such as the possible exploitation of data created by GANs and bias in training data. The future potential and developments of GANs are discussed in the study, including its use to unsupervised representation learning and the creation of novel GAN architectures. The study emphasizes the need for more study to overcome GANs' problems and broaden their application to other fields. GANs are a fast-developing subject of study with enormous potential in many areas.
In the last few years, a type of generative model known as Generative Adversarial Networks (GANs), has achieved tremendous success mainly in the field of computer vision, image classification, speech and language processing, etc. GANs are the models which are used to produce new samples which have similar data distribution as of the training dataset. In this review paper, we will first introduce the idea behind the GANs, followed by a brief overview of various types of GANs as well as comparing it with different generative models. Then, we will discuss the application range and finally the future work with its associated research frontiers.
arXiv (Cornell University), 2023
The launch of ChatGPT has garnered global attention, marking a significant milestone in the field of Generative Artificial Intelligence. While Generative AI has been in effect for the past decade, the introduction of ChatGPT has ignited a new wave of research and innovation in the AI domain. This surge in interest has led to the development and release of numerous cutting-edge tools, such as Bard, Stable Diffusion, DALL-E, Make-A-Video, Runway ML, and Jukebox, among others. These tools exhibit remarkable capabilities, encompassing tasks ranging from text generation and music composition, image creation, video production, code generation, and even scientific work. They are built upon various state-of-the-art models, including Stable Diffusion, transformer models like GPT-3 (recent GPT-4), variational autoencoders, and generative adversarial networks. This advancement in Generative AI presents a wealth of exciting opportunities and, simultaneously, unprecedented challenges. Throughout this paper, we have explored these state-of-the-art models, the diverse array of tasks they can accomplish, the challenges they pose, and the promising future of Generative Artificial Intelligence.
Recently, generative adversarial networks (GANs) have become a research focus of artificial intelligence. Inspired by two-player zero-sum game, GANs comprise a generator and a discriminator, both trained under the adversarial learning idea. The goal of GANs is to estimate the potential distribution of real data samples and generate new samples from that distribution. Since their initiation, GANs have been widely studied due to their enormous prospect for applications, including image and vision computing, speech and language processing, etc. In this review paper, we summarize the state of the art of GANs and look into the future. Firstly, we survey GANs' proposal background, theoretic and implementation models, and application fields. Then, we discuss GANs' advantages and disadvantages, and their development trends. In particular, we investigate the relation between GANs and parallel intelligence, with the conclusion that GANs have a great potential in parallel systems research in terms of virtual-real interaction and integration. Clearly, GANs can provide substantial algorithmic support for parallel intelligence.
IEEE, 2024
Generative adversarial networks (GANs) are a cutting-edge approach to generative modeling in deep learning. GAN’s was proposed in 2014 by Ian Goodfellow. Since then, there has been significant growth in adversarial networks. New breakthroughs and innovative approaches in generative adversarial networks can radically elevate and increase the quality of synthetically generated images by extracting patterns from the original datasets. Among the major advancements of GAN, image synthesis is the most prominent and extensively studied application. The concept of adversarial training, where two neural networks compete against each other, has introduced a novel paradigm for learning complex patterns in the images. The paper emphasizes the vital role of GANs in strengthening and fine-tuning datasets, honing further research to create GANs capable of producing high-quality synthetic samples with constrained practice of data.
2020
Generative models are widely used in many subfields of AI and Machine Learning. More recently generative models using deep learning have been employed in a creative manner to generate new media (images, text and music), but they have also been applied to areas such as drug discovery and data anonymisation. This workshop focused on the applications and research problems related to the practical use of deep generative models in the real world and where examples ranged from drug discovery to bespoke furniture generation.
COJ Robotics & Artificial Intelligence, 2020
Generative adversarial networks have become increasingly popular since they were first introduced in 2014. Many variants of GANs have been developed over the years and employed in a range of applications from computer vision to audio generation and medical imaging. As its applications in computer vision have been widely explored by the artificial intelligence community, here, we focus on more specific applications of GANs, namely audio generation and medical image synthesis. In the age of big data, these two fields still struggle with the scarcity of labelled data, hence they benefit greatly from the capabilities of GANs.
2020 International Joint Conference on Neural Networks (IJCNN), 2020
In this paper, we propose a novel technique for training Generative Adversarial Networks (GANs) using autoencoders. GANs, in recent years, have emerged as one of the most popular generative models. Despite their success, there are several challenges in maintaining the trade-off between diversity and quality of the generated distribution. Our idea stems from the fact that deeper layers of an autoencoder contain high-level feature representation of the input data distribution. Reusing these layers provides GAN with information about the representative characteristics of real data and hence can guide its adversarial training. We call our model Guided GAN since the autoencoder (guiding network) provides a direction to train the GAN (generative network). Guided GAN also minimizes both the forward and reverse Kullback-Leibler (KL) divergence in a single model, exploiting the complementary statistical properties of the two. We conduct extensive experiments and use various metrics for asses...
Preprints, 2023
Generative artificial intelligence (GenAI) has been developing with many incredible achievements like ChatGPT and Bard. Deep generative model (DGM) is a branch of GenAI, which is preeminent in generating raster data such as image and sound due to strong points of deep neural network (DNN) in inference and recognition. The built-in inference mechanism of DNN, which simulates and aims to synaptic plasticity of human neuron network, fosters generation ability of DGM which produces surprised results with support of statistical flexibility. Two popular approaches in DGM are Variational Autoencoders (VAE) and Generative Adversarial Network (GAN). Both VAE and GAN have their own strong points although they share and imply underline theory of statistics as well as incredible complex via hidden layers of DNN when DNN becomes effective encoding/decoding functions without concrete specifications. In this research, VAE and GAN is unified into a consistent and consolidated model called Adversarial Variational Autoencoders (AVA) in which VAE and GAN complement each other, for instance, VAE is good at generator by encoding data via excellent ideology of Kullback-Leibler divergence and GAN is a significantly important method to assess reliability of data which is realistic or fake. In other words, AVA aims to improve accuracy of generative models, besides AVA extends function of simple generative models. In methodology this research focuses on combination of applied mathematical concepts and skillful techniques of computer programming in order to implement and solve complicated problems as simply as possible.
ACM Computing Surveys
Generative Adversarial Networks (GANs) have received wide attention in the machine learning field for their potential to learn high-dimensional, complex real data distribution. Specifically, they do not rely on any assumptions about the distribution and can generate real-like samples from latent space in a simple manner. This powerful property allows GANs to be applied to various applications such as image synthesis, image attribute editing, image translation, domain adaptation, and other academic fields. In this article, we discuss the details of GANs for those readers who are familiar with, but do not comprehend GANs deeply or who wish to view GANs from various perspectives. In addition, we explain how GANs operates and the fundamental meaning of various objective functions that have been suggested recently. We then focus on how the GAN can be combined with an autoencoder framework. Finally, we enumerate the GAN variants that are applied to various tasks and other fields for those...
2017 IEEE International Conference on Image Processing (ICIP), 2017
This paper proposes an extension to the Generative Adversarial Networks (GANs), namely as ARTGAN to synthetically generate more challenging and complex images such as artwork that have abstract characteristics. This is in contrast to most of the current solutions that focused on generating natural images such as room interiors, birds, flowers and faces. The key innovation of our work is to allow backpropagation of the loss function w.r.t. the labels (randomly assigned to each generated images) to the generator from the discriminator. With the feedback from the label information, the generator is able to learn faster and achieve better generated image quality. Empirically, we show that the proposed ARTGAN is capable to create realistic artwork, as well as generate compelling real world images that globally look natural with clear shape on CIFAR-10.
ArXiv, 2021
-Pablo Picasso. For centuries, humans have dedicated themselves to producing arts to convey their imagination. The advancement in technology and deep learning in particular, has caught the attention of many researchers trying to investigate whether art generation is possible by computers and algorithms. Using generative adversarial networks (GANs), applications such as synthesizing photorealistic human faces and creating captions automatically from images were realized. This survey takes a comprehensive look at the recent works using GANs for generating visual arts, music, and literary text. A performance comparison and description of the various GAN architecture are also presented. Finally, some of the key challenges in art generation using GANs are highlighted along with recommendations for future work.
INTERNATIONAL JOURNAL OF ADVANCE RESEARCH, IDEAS AND INNOVATIONS IN TECHNOLOGY
GAN stands for Generative Adversarial Networks. GANs are the most interesting topics in Deep Learning. The concept of GAN is introduced by Ian Good Fellow and his colleagues at the University of Montreal. The main architecture of GAN contains two parts: one is a Generator and the other is Discriminator. The name Adversarial stands for conflict and here the conflict is present between Generator and Discriminator. And hence the name adversarial comes to this concept. In this paper, the author has investigated different ways GAN's are used in real time applications and what are the different types of GAN's present. GAN's are mainly important for generating new data from existing ones. As a machine learning model cannot work properly if the size of the dataset is small GAN's are here to help to increase the size by creating new fake things from original ones. GAN's are also used in creating images from the given words that are text-to-image conversion. GANs are also applied in image resolution, image translation and in many other scenarios. From this survey on GAN author aim to know what are the different applications of GAN that are present and their scope. The author has also aimed at knowing the different types of GAN's available at present.
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.