- Workshop Tutorial
- August 2, 2018
...Object detection has a long history in computer vision community, usually considered as adifficult task in terms of computation and accuracy. However , in the recent years, it has becomevery common and easy to implement, thanks to the deep learning and convolutions. The difference from the classification tasks, in general, if we aim to identify the location of objects ordetect the number of instances in an image, we use object detection.
...There are two main approaches to apply the object detection which are single-stage andtwo-stage detection. Both have even several ways to be implemented. In this workshop , we are mostly going to talk about RCNN-Family(Faster-RCNN , Fast-RCNN,RCNN). In the implementation , we will use Google's Tensorflow Object Detection API which covers both singleand two-stage models at whole.
...The TensorFlow Object Detection API is an open source framework built on top of TensorFlowthat makes it easy to construct, train and deploy object detection models. This API is capable ofidentifying many types of objects like cars, pedestrians, person, kite, dog and many more. Weare going to learn how to use this API for our specific problems by preparing our own datapipeline and apply on different models (including two-stages like YOLO and SSD).
- About the Repo
- API : Tensorflow's Object Detection API
- ckpt : trained model parameters saved
- config : Configuration settings
- eval : Evaluation result folder
- figures : helper figures for understanding Faster-RCNN
- frozen_model : Inference graph of trained model
- images :
- train/val/test data
- converted csv files
- converted tfrecord files
- labelmap.pbtxt : defines labels
- pretrained : Pretrained model parameters
--> Datasets
- COCO
- KITTI
- Pascal
- Oid
- AVA v2.1 trained models
- Open Images-trained models
-
2-Stage Detectors
-
1-Stage Detectors
-
A Comprehensive Review
- VGG
- ZFNet
- MobileNet
- Inception
- ResNet
- DenseNet
-
Google Advanced Image Search
- birds , tigers , rabbits etc..
-
Downloader Tools (Chrome Extensions):
- Download All Images 2.0.4
- Fatkun Batch Download Image 2.23
- Bounding Box
- Polygon
- Semantic Segmentation
- Bounding Box
- Line
- xml_to_csv
- csv_to_tfrecord
- resizer
- extension check
- size check
- Install Requirements
- Connect Drive
- Clone Repository
- Set python environment
- train
- ..
├───data
├───images
├───src
│ ├───chainercv
│ │ └───train
│ ├───data
│ ├───keras_retinanet
│ │ ├───keras_retinanet
│ │ │ ├───backend
│ │ │ ├───bin
│ │ │ ├───callbacks
│ │ │ ├───layers
│ │ │ ├───models
│ │ │ ├───preprocessing
│ │ │ └───utils
│ │ ├───solar_panel
│ │ └───tests
│ │ ├───backend
│ │ ├───bin
│ │ ├───layers
│ │ ├───models
│ │ ├───preprocessing
│ │ └───utils
│ └───utils
└───tf_od_eval
└───protos
└───__pycache__