README

Pokémon GAN Training and Augmentation

This project involves training a Generative Adversarial Network (GAN) to generate Pokémon images using text embeddings derived from Pokémon names. The workflow includes data preparation, BERT-based text embedding generation, and GAN training. Below is a detailed explanation of the process.

Dataset Class

A custom dataset class is implemented to handle:

Loading and transforming images.
Generating text embeddings for Pokémon names using the BERT model.
Providing the necessary data format for the DataLoader, which is used in the training process.
link to the dataset - [https://huggingface.co/datasets/diffusers/pokemon-gpt4-captions]

BERT Embeddings

Text embeddings for Pokémon names are generated using a pre-trained BERT model. The text is tokenized and then passed through the BERT model to obtain a high-dimensional embedding, which captures the semantic meaning of the Pokémon names.

GAN Training

The training process involves a GAN setup with two stages:

Stage 1

Stage 1 Generator: This network takes text embeddings and random noise as inputs and generates low-resolution images (64x64 pixels).
Stage 1 Discriminator: This network evaluates the authenticity of the generated images by comparing them to real images, conditioned on the text embeddings.

Both the generator and discriminator are trained iteratively. The generator aims to produce images that can fool the discriminator, while the discriminator aims to correctly distinguish between real and fake images.

Stage 2

Stage 2 Generator: This network refines the images produced by the Stage 1 generator, improving their resolution and quality.
Stage 2 Discriminator: This network evaluates the authenticity of the higher-resolution images produced in Stage 2.

Similar to Stage 1, the generator and discriminator are trained iteratively, but now with the refined images.

Results Visualization

After training, the generated images are visualized to assess the performance of the GAN. These images are expected to look similar to real Pokémon images, capturing various details and features. These are results produced in Stage-1

Stage 1

And these are the corresponding original images

Stage 2

And these are the corresponding original images

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
ass3-illusioncraft_FINAL.ipynb		ass3-illusioncraft_FINAL.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

Pokémon GAN Training and Augmentation

Table of Contents

Dataset Class

BERT Embeddings

GAN Training

Stage 1

Stage 2

Results Visualization

Stage 1

Stage 2

About

Releases

Packages

Contributors 3

Languages

AaSiKu/Illusion_craft_Assignment_3

Folders and files

Latest commit

History

Repository files navigation

README

Pokémon GAN Training and Augmentation

Table of Contents

Dataset Class

BERT Embeddings

GAN Training

Stage 1

Stage 2

Results Visualization

Stage 1

Stage 2

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages