Name		Name	Last commit message	Last commit date
parent directory ..
images		images
Dockerfile		Dockerfile
README.md		README.md
gcp_deploy.py		gcp_deploy.py
main.py		main.py
mask.py		mask.py
requirements.txt		requirements.txt

README.md

Training Image Segmentation Models

Intro

Create an image segmentation model using DeepLabV3+. This model is lightweight and works well. This code is based on the (Keras tutorial)[https://keras.io/examples/vision/deeplabv3_plus/] for image segmentation.

Setup

Clone repo if you haven't. Navigate to the training-image-segmentation folder.
Install dependencies. The file mask.py will be used at the end to remove a background from an image and inpaint it with another background using stable diffusion.
```
pip install gdown opencv-python scipy matplotlib tensorflow diffusers transformers
```

Download the data.

gdown "1B9A9UCJYMwTL4oBEo4RZfbMZMaZhKJaz"
unzip -q instance-level-human-parsing.zip

Copy the data to gcs. We'll set up some environment variables to use in the next steps. Replace the project values with yours.

PROJECT_BUCKET_NAME=<your-bucket-name> #Ex: jfacevedo-demos-bucket
PROJECT_ID=<your-project-id>
REGION=us-central1
gsutil -m cp -r instance-level_human_parsing gs://$PROJECT_BUCKET_NAME/datasets/segmentation_data/

Build the training image and push it.

docker build . -t gcr.io/$PROJECT_ID/image_segmentation_train:latest
docker push gcr.io/$PROJECT_ID/image_segmentation_train:latest

Run the training job. Here we will use 1/3 of the data to train and 4 T4 GPUs. This will take about 2.5 hours to train and gets good results.
```
python gcp_deploy.py --project-id $PROJECT_ID --accelerator-count 4 --image-uri gcr.io/$PROJECT_ID/image_segmentation_train:latest --gcs-datadir /gcs/$PROJECT_BUCKET_NAME/datasets/segmentation_dataset/instance-level_human_parsing/instance-level_human_parsing/Training --num-train-images 10000 --num-eval-images 500 --model-output-dir gs://$PROJECT_BUCKET_NAME/models/segmentation/ --batch-size 32
```
The final metrics should look like "�� 312/312 [==============================] - 351s 1s/step - loss: 0.2066 - accuracy: 0.9360 - val_loss: 0.6632 - val_accuracy: 0.8335 "

After the job completes, copy the final model to this directory.

gsutil -m cp -r gs://$PROJECT_BUCKET_NAME/models/segmentation .

Run the inference script. The script creates a black and white mask to isolate humans (white) from anything else (black). If you don't have a GPU you can comment out lines 80 downwards which runs stable diffusion inpainting.
```
python mask.py
```
Mask.py runs the segmentation model and then uses cv2's findContours function to try to fill in the parts of the human the model didn't mask. As we can see, the mask is not perfect, but it works.

Original Image

Mask

prompt: RAW photo, a photograph of a beach, ocean, sunset, highly detailed, close up shot.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training-image-segmentation

training-image-segmentation

README.md

Training Image Segmentation Models

Intro

Setup

Files

training-image-segmentation

Directory actions

More options

Directory actions

More options

Latest commit

History

training-image-segmentation

Folders and files

parent directory

README.md

Training Image Segmentation Models

Intro

Setup