Skip to content

AaSiKu/Illusion_craft_Assignment_3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

README

Pokémon GAN Training and Augmentation

This project involves training a Generative Adversarial Network (GAN) to generate Pokémon images using text embeddings derived from Pokémon names. The workflow includes data preparation, BERT-based text embedding generation, and GAN training. Below is a detailed explanation of the process.

Table of Contents

  1. Dataset Class
  2. Data Augmentation
  3. BERT Embeddings
  4. GAN Training
  5. Results Visualization

Dataset Class

A custom dataset class is implemented to handle:

BERT Embeddings

Text embeddings for Pokémon names are generated using a pre-trained BERT model. The text is tokenized and then passed through the BERT model to obtain a high-dimensional embedding, which captures the semantic meaning of the Pokémon names.

GAN Training

The training process involves a GAN setup with two stages:

Stage 1

  1. Stage 1 Generator: This network takes text embeddings and random noise as inputs and generates low-resolution images (64x64 pixels).
  2. Stage 1 Discriminator: This network evaluates the authenticity of the generated images by comparing them to real images, conditioned on the text embeddings.

Both the generator and discriminator are trained iteratively. The generator aims to produce images that can fool the discriminator, while the discriminator aims to correctly distinguish between real and fake images.

Stage 2

  1. Stage 2 Generator: This network refines the images produced by the Stage 1 generator, improving their resolution and quality.
  2. Stage 2 Discriminator: This network evaluates the authenticity of the higher-resolution images produced in Stage 2.

Similar to Stage 1, the generator and discriminator are trained iteratively, but now with the refined images.

Results Visualization

After training, the generated images are visualized to assess the performance of the GAN. These images are expected to look similar to real Pokémon images, capturing various details and features. These are results produced in Stage-1

Stage 1

Screenshot 2024-06-21 000009

And these are the corresponding original images

Screenshot 2024-06-21 000017

Stage 2

image

And these are the corresponding original images

image

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published