Transition Yourself - Emotion, Gender and
Region

Gayathri R; Neelanarayanan V; Malav M

+1 (929) 600-8049

- Feedback
- Signup
- Submit Manuscript

e-Pub

Full Text

COJ Robotics & Artificial Intelligence

Transition Yourself - Emotion, Gender and Region

Gayathri R¹*, Neelanarayanan V² and Malav M²

¹School of Computer Science and Engineering, Vellore Institute of Technology, India

²School of Computer Science and Engineering, Vellore Institute of Technology, India

*Corresponding author: Gayathri R, School of Computer Science and Engineering, Vellore Institute of Technology, India

Submission: May 24, 2022;Published: August 10, 2022

DOI: 10.31031/COJRA.2022.02.000536

ISSN:2832-4463
Volume2 Issue3

Abstract

The Image-to-image translation process belongs to a class of vision and graphics problem area, where the goal is to learn the mapping between an input and output image using a training set consisting of aligned image pairs. This research will focus on utilisation of image transitioning to make a Java Spring based Web-App which can transition a portrait into different human-oriented emotion (i.e., happy, sad, angry and frown), and simultaneously identify and transition its demographic region (i.e., America, Europe, Asia, South Asia) and same can be applied to the biological gender (i.e., Male, Female). The research study will use a proven deep learning methodology, which is Generative Adversarial Networks (GAN).

Keywords:General Adversarial Network (GAN); Deep learning; Pix-to-pix GAN; Cycle GAN; Webapp; Image-to-image transition

Introduction

Many issues in image processing, computer graphics, and computer vision may be simplified to “translating” an input image into an output image that looks comparable. A scene can be represented as an RGB image, a gradient field, an area map, a semantic label map, and so on, in the same way that a concept can be communicated in both English and French. In a similar fashion to automatic language translation, we describe automated image-to-image translation as the task of converting one feasible representation of a scene into another given enough training data. Traditionally, each of those functions has been handled by distinct, special-motive gear, despite the fact that the environment remains consistent.: Pixels should be expected from pixels. The goal of this study is to provide a framework for changing the transition of a dataset of individuals from one area, gender, and emotion to another [1]. Deep learning technologies and generative adversarial networks, often known as cycle GANs, are responsible for this.

It would be wonderful if we could provide a high-level objective, such as “make the output indistinguishable from reality,” and then have the image-to-image transition to the goal’s associated result. Fortunately, utilising the recently suggested Generative Adversarial Networks, this is precisely what is implemented (GANs). GANs learn a loss that tries to classify whether the output picture is genuine or false while also training a generative version to lower the loss. Blurry photos will no longer be accepted since they look to be clearly false. GANs are discovered in the conditional situation in this work. Conditional GANs (cGANs) evaluate a conditional generative version of data in the same way as GANs do [2]. This qualifies cGANs for image translation tasks, which need us to locate an input image and create a corresponding output image. In the last two years, GANs have been extensively investigated, and the numerous solutions we discuss in this study have been explicitly suggested. However, early research focused on specific applications, and it’s unclear how effective image GAN may be as a general-reason solution for image translation. Our first contribution is to show that conditional GANs can solve a wide range of problems at a reasonable cost.

Generative Adversarial Networks (GANs) have performed impressive results in image generation, photo editing, and representation learning. Recent strategies undertake the identical concept for conditional image generation applications, which include image to region, image to gender, and image to emotion, in addition to different domain names like videos and 3D data [3]. The key to GANs’ fulfilment is the concept of an adverse loss that forces the generated images to be, in principle, indistinguishable from actual photos. This loss is in particular effective for image generation tasks, as that is precisely the goal that a whole lot of computer graphics targets to optimize. We undertake an adversarial loss to study the mapping such that the translated images cannot be prominent from images in the target domain.

Image-to-Image Translation The concept of image-to-image translation is going back at least to Hertzmann et al. ‘s Image Analogies, who rent a non-parametric texture model on a single input-output training image pair. More recent techniques use a dataset of enter-output examples to examine a parametric translation feature the usage of CNNs. Our method builds on the “Cycle GAN” framework, which uses a conditional generative adversarial network to examine a mapping from input to output images [4]. Similar thoughts had been implemented to various tasks which included generating images from people images or from characteristic and semantic layouts.

Method

GANs are generative models that examine a mapping from random noise vector z to output image y, G: z → y [24]. In contrast, conditional GANs examine a mapping from observed image x and random noise vector z, to y, G: → y. The generator G is trained to produce outputs that can’t be outstanding from “real” images via the means of an adversarially trained discriminator, D, that is trained to do in addition to detecting the generator’s “fakes” [5].

Understanding and implementing of cycle gan in tensor flow

The paper we are going to implement is titled “Image-to-Image Translation to region, emotion and gender using Cycle-Consistent Adversarial Networks”. The title is quite a mouthful, and it helps to look at each phrase individually before trying to understand the model all at once. However, the above adversarial training strategy has a flaw. Citing the original paper’s authors: Adversarial training can, in theory, research mappings G and F that produce outputs identically allotted as target domains Y and X respectively. However, with massive sufficient capacity, a network can map the equal set of input images to any random permutation of images in the goal domain, in which any of the learned mappings can induce an output distribution that fits the target distribution [6]. Thus, an adversarial loss alone cannot assure that the learned feature can map an individual input xi to a preferred output yi. To regularize the model, the authors introduce the constraint of cycle-consistency - if we transform from supply distribution to target and then back again to supply distribution, we have to get samples from our supply distribution.

Process to Implementation

In the project many technologies we are going to use, so to start the implementation we need to install some packages like tensor flow because we are going with the cycleGAN so to install the tensor flow package we need to give the following command - (Figure 1) After installing the above packages we need to download the dataset. In my project I am collecting the man and female dataset. To learn and start the training i am going to train the different dataset which is horse and zebra dataset. In the dataset there are four folders TrainA, TrainB, ValA, ValB. in the TrainA folder horse and zebra picture exists. We collected more than 100 pictures and differentiated the images into the different folders. 90% images are in the TrainA folder and the rest of the 10% images in the ValA folder. Because when we are going to train, we need to validate the images that are real or fake. So separated the images into different folders [7]. Further, after setting up with the dataset we have to set which folder we are going to train, but in our case, we are going with the TrainA folder. So, we need to add the images and download them to the terminal.

Figure 1:

Figure 2:

Figure 3:

To download the images, we need some commands so we will apply the following command - (Figure 2) When we apply the above command terminal will connect to the server which dataset is available and download the dataset. After the process we need to set which folder images we are going to train. And the folder is TrainA. When we run the command terminal will download all the images on the terminal and will saved to the ‘./checkpoints/horse2zebra_ pretrained/latest_net_G.pth’ folder. So doing all this we set a model which we call horsezebra model. To test the model we need to run the following command-(Figure 3) After running the command, it will start the testing. It will take some time to test because the model will get all the images and test them. After successfully testing we need to generate the results. And on the terminal, we can’t show the result on the terminal so we will generate a localhost on the browser and will show the result there. So, to generate the result we need to run the following command-python test.py--dataroot datasets/horse2zebra/testA--name horse2zebra_pretrained-- model test--no_dropout. After running the command terminal will show the following error-module ‘torch. _C’ has no attribute ‘_cuda_ setDevice’

This error will show because the computer we are using only has a CPU. for further testing and training we need a NVIDIA GPU because we can’t train the whole dataset on CPU.

To overcome the error, we will add a command ---gpu_ids -1

Why do we need to add the command? Because in the model there are many features and there is one named gpu_ids, which is a string argument. And -1 we are adding because in the gpu_ids we gave gpu numbers and started from -1.

So further command we will run is following command -python test.py --dataroot datasets/horse2zebra/testA --name horse2zebra_pretrained --model test --no_dropout --gpu_ids -1

After running the command testing will start and will connect to the server. The model will show us options that we took on code and we set all the features. Those features will as be following-

So many options are shown there, and we can see the gpu_ids that we set during the test.
A. The option --model test is used for generating results of CycleGAN only for one side. This option will automatically set --dataset_mode single, which only loads the images from one set. On the contrary, using --model cycle_gan requires loading and generating results in both directions, which is sometimes unnecessary. The results will be Figure 4 saved at ./results/. Use --results_dir {directory_path_to_save_result} to specify the results directory.

Figure 4:

B. For pix2pix and your own models, you need to explicitly specify --netG, --norm, --no_dropout to match the generator architecture of the trained model.

After successfully running the command, we will train the model. To train the model we need to run the following command -

python train.py --dataroot ./datasets/horsetozebra --name facades_pix2pix --model pix2pix --direction BtoA

Dataset

Affectiva’s emotion database has now grown to nearly 6 million faces analysed in 75 countries. To be precise, we have now gathered 5,313,751 face videos, for a total of 38,944 hours of data, representing nearly 2 billion facial frames analysed. This global data set is the largest of its kind-representing spontaneous emotional responses of consumers while they go about a variety of activities. To date, the majority of our database consists of viewers watching media content (i.e, ads, movie trailers, television shows and online viral campaigns) [8]. In the past year, we have expanded our data repository to include other contexts such as videos of people driving their cars, people in conversational interactions and animated gifs.

Literature Review

Paper - 1: generative adversarial network

In the paper they recommend a new framework for estimating generative models thru an adversarial process, wherein we concurrently train models: a generative model G that captures the data distribution, and a discriminative model D that estimates the probability that a sample came from the training data rather than G. The training system for G is to maximize the probability of D making a mistake. This framework corresponds to a minimax -player game. In the space of arbitrary functions G and D, a completely unique solution exists, with G recovering the training data distribution and D identical to 1 2 everywhere. In the case in which G and D are defined through multilayer perceptrons, the entire system can be trained with backpropagation. There is no need for any Markov chains or unrolled approximate inference networks during both training and generation of samples [9]. Experiments demonstrate the potential of the framework through qualitative and quantitative evaluation of the generated samples.

Paper-2 Image to image translation with conditional adversarial network

They investigate conditional adversarial networks as a general-purpose solution to image-to-image translation troubles. These networks now no longer only learn the mapping from input image to output image, however, also learn a loss function to train this mapping. This makes it possible to apply the identical generic approach to problems that traditionally might require very different loss formulations. We show that this approach is effective at synthesizing photos from label maps, reconstructing object from edge maps, and colorizing images, amongst other tasks. Indeed, considering the fact that the release of the pix2pix software program associated with this paper, a big number of internet users (lots of them artists) have posted their personal experiments with our system, further demonstrating its extensive applicability and simplicity of adoption without the need for parameter tweaking. As a community, we no longer hand-engineer our mapping functions, and this work suggests we are able to acquire reasonable results without hand-engineering our loss functions either.

Paper-3 Unpaired image-to-image translation using cycle consistent adversarial network

Image-to-image translation is a category of vision and graphics issues in which the goal is to learn the mapping between an input image and an output image using a training set of aligned image pairs. However, for plenty of tasks, paired training data will not be available. We present a method for learning to translate an image from a source domain X to a target domain Y in the absence of paired examples. Our goal is to learn a mapping G : X → Y such that the distribution of images from G(X) is indistinguishable from the distribution Y the use of an adversarial loss. Because this mapping is highly under-constrained, we couple it with an inverse mapping F : Y → X and introduce a cycle consistency loss to enforce F(G(X)) ≈ X (and vice versa). Qualitative results are presented on several tasks in which paired training data does now no longer exist, which includes collection style transfer, object transfiguration, season transfer, photo enhancement, etc. Quantitative comparisons against several prior techniques demonstrate the superiority of our approach.

Paper - 4 Cycle-gan based emotion style transfer as data augmentation for speech emotion recognition

Cycle consistent adversarial networks (CycleGAN) have shown great fulfilment in image style transfer with unpaired data sets. In-spired with the aid of using this, we check out emotion style transfer to generate synthetic data, which aims at addressing the data scarcity problem in speech emotion recognition. Specifically, we propose a CycleGAN-primarily based totally method to transfer feature vectors extracted from a huge unlabeled speech corpus into synthetic features representing the given target emotions. We extend the CycleGAN framework with a classification loss which improves the discriminability of the generated data. To show the effectiveness of the proposed method, we present results for speech emotion recognition the use of the generated feature vectors as (i) augmentation of the training data, and (ii) as standalone training set. Our experimental results reveal that when utilising synthetic function vectors, the classification overall performance improves within-corpus and cross-corpus evaluation [10].

Paper - 5 Segmentation enhanced cycle-gan

Algorithmic reconstruction of neurons from electron microscopy data traditionally requires training system learning models on dataset-particular ground truth annotations that are expensive and tedious to acquire. We enhanced the training procedure of an unsupervised image-to-image translation approach with extra components derived from an automated neuron segmentation approach. We show that this approach, Segmentation-Enhanced CycleGAN (SECGAN), enables close to perfect reconstruction accuracy on a benchmark connectomics segmentation dataset despite operating in a “zero-shot” setting wherein the segmentation model was trained using only volumetric labels from a different dataset and imaging approach. By decreasing or eliminating the need for novel ground fact annotations, SECGANs alleviate one of the main practical burdens involved in pursuing automated reconstruction of extent electron microscopy data.

Dataset

Affectiva’s emotion database has now grown to nearly 6 million faces analysed in 75 countries. To be precise, we have now gathered 5,313,751 face videos, for a total of 38,944 hours of data, representing nearly 2 billion facial frames analysed. This global data set is the largest of its kind-representing spontaneous emotional responses of consumers while they go about a variety of activities [11]. To date, the majority of our database consists of viewers watching media content (i.e, ads, movie trailers, television shows and online viral campaigns). In the past year, we have expanded our data repository to include other contexts such as videos of people driving their cars, people in conversational interactions and animated gifs.

Link :
https://blog.affectiva.com/the-worlds-largest-emotiondatabase- 5.3-million-faces-and-counting

Conclusion

The consequences on this paper recommend that conditional adversarial networks are a promising method for lots of image to image translation tasks, particularly the ones concerning highly structured graphical outputs. These networks learn a loss adapted to the task and information at hand, which makes them relevant in an extensive sort of setting. in the paper given input could be changed into any region, emotion and gender with their respective fields [12,13]. to provide the security and recognition for any corporates this method could be useful. The technology used is deep learning through general adversarial networks is the main factor of the project.

References

Isola P, Yan Zhu J, Zhou T, Efros A (2018) Image-to-image translation with conditional adversarial networks. Berkeley AI Research (BAIR) Laboratory, University of California, Berkeley, California, USA.
Goodfellow IJ, Abadie JP, Mirza M, Xu B, Farley DB, et al. (2014) Generative adversarial nets. Department of Computer Science and Operations Research, Universite de Montreal, Montreal, Canada.
Bao F, Neumann M, Thang Vu N (2019) CycleGAN-based emotion style transfer as data augmentation for speech emotion recognition. Institute for Natural Language Processing (IMS), University of Stuttgart, Stuttgart, Germany.
Zhu JY, Park T, Isola P, Efros AA (2020) Unpaired image-to-image translation using cycle-consistent adversarial networks. Berkeley AI Research (BAIR) laboratory, University of California, Berkeley, California, USA.
Januszewski M, Jain V (2019) Segmentation-enhanced cycleGAN, bioRxiv.
Brownlee J (2019) How to develop a cyclegan for image-to-image translation with keras.
Cordts M, Omran M, Ramos S, Rehfeld T, Enzweiler M, et al. (2016) The cityscapes dataset for semantic urban scene understanding. 2016 IEEE Conference on Computer Vision and Pattern Recognition CVPR, Las Vegas, Nevada, USA.
Bousmalis K, Silberman N, Dohan, Erhan D, Krishnan D (2017) Unsupervised pixel-level domain adaptation with generative adversarial networks. CVPR.
Goodfellow I (2017) Generative adversarial networks. OpenAI.
Ledig C, Theis L, Huszar F, Caballero J, Aitken A, et al. (2017) Photo-realistic single image super-resolution using a generative adversarial network. CVPR.
Xiong F, Wang Q, Gao Q (2019) Consistent embedded gan for image-to-image translation. State Key Laboratory of Integrated Services Networks, Xidian University, Xi’an, China.
Wang H, Yu CN (2019) A direct approach to robust deep learning using adversarial network. Published as a conference paper at ICLR 2019.
Bansal H, Rathore A. Understanding and implementing CycleGAN in tensorflow.

© 2022 Gayathri R. This is an open access article distributed under the terms of the Creative Commons Attribution License , which permits unrestricted use, distribution, and build upon your work non-commercially.

Submit Query

PubMed Indexed Articles

Track Your Article

Editor In Chief

Hirotada TSUJII

Ph.D in Agriculture from Faculty of Agriculture, Tohoku University

Approaches in Poultry, Dairy & Veterinary Sciences

Maria Kuman

Research Professor, PhD, Holistic Research Institute

Advances in Complementary & Alternative Medicine

Tomasz Karski

MD PhD, Professor, Vincent Pol University

Orthopedic Research Online Journal

Jiexiong Feng

Professor, Chief Doctor, Director of Department of Pediatric Surgery, Associate Director of Department of Surgery, Doctoral Supervisor Tongji hospital, Tongji medical college, Huazhong University of Science and Technology

Research in Pediatrics & Neonatology

Muhammad Atiqullah

Senior Research Engineer and Professor, Center for Refining and Petrochemicals, Research Institute, King Fahd University of Petroleum and Minerals (KFUPM), Dhahran, Saudi Arabia

Research & Development in Material Science

Ian James Martins

Fellow of International Agency for Standards and Ratings (IASR), Edith Cowan University, Sarich Neuroscience Research Institute

Advancements in Case Studies

Thomas F George

Chancellor Emeritus / Professor Emeritus of Chemistry and Physics, University of Missouri–St. Louis

Annals of Chemical Science Research

Jose Crisologo de Sales Silva

Ph.D in Science from the Federal University of Alagoas, UFAL, Brazil

Novel Research in Sciences

Naglaa Sami Adbel Aziz Mahmoud

Assistant Professor in College of Architecture, Art and Design

Academic Journal of Engineering Studies

Tong-Ching Tom Wu

Interim Dean, College of Education and Health Sciences, Director of Biomechanics Laboratory, Sport Science Innovation Program, Bridgewater State University

Research & Investigations in Sports Medicine

Dr. Jose Luis Turabian

Professor of numerous training courses in Family Medicine

Associative Journal of Health Sciences

Dariusz Jacek Jakóbczak

Assistant Professor, Department of Electronics and Computer Science

COJ Electronics & Communications

Önder Pekcan

Emeritus Professor of Physics, Kadir Has University, Turkey

Polymer Science: Peer Review Journal

Member In

View All...

Quick Links

Editorial Board Registrations

×

Join as Editor

Join as Associate Editor
Submit your Article
Best Paper of the Volume
Reprints
Refer a Friend

×

Refer a Friend

Suggested By

Referrer Details
Advertise With Us

×

Advertise With Us

Our Recent Edition

Top Editors

Zhengcai Lou

Wenzhou Medical University, China
Ya Lie Ku

Fooyin University, Taiwan
Volkan Sarper Erikci

Saglik Bilimleri University, Turkey
Tomasz Karski

Vincent Pol University, Poland
Thamil Selvam

National Defence University of Malaysia, Malaysia
Tarik Baykara

Dogus University, Turkey
Steven Smith

Hope College, USA
Stanislav Grigoriev

Russian Academy of Sciences, Russia
Shi Zhou

Southern Cross University, Australia
Shewikar Farrag

Umm Al-Qura University, Saudi Arabia
Ray Marks

City University of New York, USA
Praveen K Maghelal

Khalifa University of Science & Technology, United Arab Emirates
Peng Yu

Hebei Normal University, China
Nawal Mohamed Khalafallah

Alexandria University, Egypt
N K Kishore

Indian Institute of Technology Kharagpur, India
Muzzalupo Innocenzo

Council for Agriculture Research and Analysis of Agri Economy (CREA), Italy
Muhammad Atiqullah

King Fahd University of Petroleum and Minerals, Saudi Arabia
Mohamed A Rashed

King Abdulaziz University, Saudi Arabia
Maurice E Morgenstein

University of Oregon, USA
Martin Sweatman

University of Edinburgh, Scotland
Maria Kuman

University of Tennessee, USA
Manuel Velasco

Central University of Venezuela, Venezuela
Majid Monajjemi

Islamic Azad University Central Tehran Branch, Iran
Luisetto Mauro

Tourin University, Italy
Lloyd Arthur Jenkins

Teaching & Public Speaking, Spain
Leonardo Milella

Paeditric Hospital "Giovanni XXIII", Italy
Kanakis Dimitrios

University of Nicosia, Cyprus
Jose Luis Clua Espuny

Universidad Miguel Hernández de Elche, Spain
John Korstad

Oral Roberts University, USA
Jinliang Zhang

Beijing Normal University, China
Irina Koretsky

Howard University, USA
Ian James Martins

Edith Cowan University, Australia
Hamid Yahiya Hussain

Dubai Health Authority, UAE
Gundu HR Rao

University of Minnesota, USA
GP Karmakar

Indian Institute of Technology Kharagpur, India
Ghassan George Haddad

Serhal Hospital, Lebanon
George Gregory Buttigieg

University of Malta, Malta
Fumihiko Hinoshita

National Center for Global Health and Medicine, Japan
Freida Pemberton

Molloy College, USA
Francisco Welington de Sousa Lima

Federal University of Piauí, Brazil
Florian Bert

Krankenhaus Nordwest Hospital, Germany
Fathi Habashi

Laval University, Canada
Dora Alicia Cortes Hernandez

Cinvestav-Unidad Saltillo, Mexico
Daniel Kinem

UPMC Hamot Neuroscience Institute, USA
Conxita Mestres Miralles

Ramon Llull University, Spain
Barry Kraynack

White Bear Associates, LLC, USA
Arkady S Voloshin

Lehigh University, USA
Alireza Heidari

California Southern University, USA
Alex Guskov

Institute of Solid State Physics of RAS, Russia
Alan Diego Briem Stamm

University of Buenos Aires, Argentina
Ahmed Nasr Ghanem

Mansoura University, Egypt
Afaf K El Ansary

King Saud University, Saudi Arabia
A Bernardes

University of Coimbra, Portugal

Financial Support

Latest e-Books

Latest Video

© 2017 Crimson Publishers, All rights reserved. No part of this content may be reproduced or transmitted in any form or by any means as per the standard guidelines of fair use. Creative Commons License Open Access by Crimson Publishers is licensed under

a Creative Commons Attribution 4.0 International License. Based on a work at www.crimsonpublishers.com. Best viewed in

| Above IE 9.0 version

Scroll

Full Text

COJ Robotics & Artificial Intelligence

Transition Yourself - Emotion, Gender and Region

Abstract

Introduction

Related Work

Method

Process to Implementation

Dataset

Literature Review

Dataset

Conclusion

References

PubMed Indexed Articles

Track Your Article

Editor In Chief

Member In

Signup for Newsletter

Quick Links

Our Recent Edition

Top Editors

Financial Support

Sponsors

Latest e-Books

Latest Video

Reprints