DEVELOP

Registry Mechanism

It is a common design that is adopted by several mainsteam deep learning repositries such as MMDetection and Detectron2.

The main idea is to build a dictionary whose keys are module names and whose values are the module class objects, and then construct the whole pipeline (e.g. tracker/segmenter/trainer of them/etc.) by retrieving module class objects and instantiating them with predefiend configurations.

An example for demonstrating the usage the registry is given as below:

# In XXX_base.py
from videoanalyst.utils import Registry

TRACK_TEMPLATE_MODULES = Registry('TRACK_TEMPLATE_MODULE')
VOS_TEMPLATE_MODULES = Registry('VOS_TEMPLATE_MODULE')

TASK_TEMPLATE_MODULES = dict(
    track=TRACK_TEMPLATE_MODULES,
    vos=VOS_TEMPLATE_MODULES,
)

# In XXX_impl/YYY.py
@TRACK_TEMPLATE_MODULES.register
@VOS_TEMPLATE_MODULES.register
class TemplateModuleImplementation(TemplateModuleBase):
...

Please refer to docs/TEMPLATES/README.md for detail.

Configuration Tree

Based on yaml and yacs, videoanalyst arranges its configuration in a hierarchical way.

Hyper-parameters

Developpers are recommended to take default .yaml configuration files as example and start from them. Additionally, the code definitions as well as their descriptions are under XXX_impl of each module.

Please refer to docs/TEMPLATES/README.md for detail.

P.S.

For serialization purpose, in default_hyper_params, do not use bare dict object. Wrap it with yacs.config.CfgNode.

Add Your Own Module

Please refer to docs/TEMPLATES/README.md for detail.

Additioanlly, users are recommended to contribute their code under contrib. It is a good practice to seperate stable part and experimental part of the project.

Structure

Trainer

Trainer Structure

Trainer
├── Dataloder (pytorch)               # make batch of training data
│   └── AdaptorDataset (pytorch)      # adaptor class (pytorch index dataset)
│       └── Datapipeline              # integrate data sampling, data augmentation, and target making process
│           ├── Sampler               # define sampling strategy
│           │   ├── Dataset           # dataset interface
│           │   └── Filter            # define rules to filter out invalid sample
│           ├── Transformer           # data augmentation
│           └── Target                # target making
├── Optimizer
│   ├── Optimizer (pytorch)           # pytorch optimizer
│   │   ├── lr_scheduler              # learning rate scheduling
│   │   └── lr_multiplier             # learning rate multiplication ratio
│   ├── Grad_modifier                 # grad clip, dynamic freezing, etc.
│   └── TaskModel                     # model, subclass of pytorch module (torch.nn.Module)
│       ├── Backbone                  # feature extractor
│       ├── Neck                      # mid-level feature map operation (e.g. cross-correlation)
│       └── Head                      # task head (bbox regressor, mask decoder, etc.)│
└── Monitor                           # monitoring (e.g. pbar.set_description, tensorboard, etc.)

Trainer Building Process (Functional Representation)

model = builder.build(model_cfg)  # model_cfg.loss_cfg, model.loss
optimzier = builder.build(optim_cfg, model)
dataloader = builder.build(data_cfg)
trainer = builder.build(trainer_cfg, optimzier, dataloader)

Tester

Tester Structure

Tester
├── Benchmark implementation          # depend on concrete benchmarks (e.g. VOT / GOT-10k / LaSOT / etc.)
└── Pipeline                          # manipulate underlying nn model and perform pre/post-processing
    └── TaskModel                     # underlying nereural network
        ├── Backbone                  # feature extractor
        ├── Neck                      # mid-level feature map operation (e.g. cross-correlation)
        └── Head                      # task head (bbox regressor, mask decoder, etc.)

Remarks:

Pipeline object can run standalone. See docs/PIPELINE_API.md for detail.

Training

Regex

Scheduler

Grad modifier

Misc

Logging

Names of currently used loggers are listed as below:

global
- built on each place
data
- built at: videoanalyst/data/builder.py
- log file by default stored at: snapshots/EXP_NAME/logs/data.log

Tips for debugging

Debugging in-tree files

For debugging / unit testing purpose, insert following code at the beginning of the source code.

Note that it requires that the file to debug should not have relative import code. (Thus we always encourage absolute import)

import os.path as osp

import sys  # isort:skip

module_name = "videoanalyst"
p = __file__
while osp.basename(p) != module_name:
    p = osp.dirname(p)

ROOT_PATH = osp.dirname(p)
ROOT_CFG = osp.join(ROOT_PATH, 'config.yaml')
sys.path.insert(0, ROOT_PATH)  # isort:skip

You can check out this code in many path.py files in this project.

Visualization tools

Check it out under visualize_siamfcpp_training_data.py.

python demo/main/debug/visualize_siamfcpp_training_data.py --config 'experiments/siamfcpp/train/siamfcpp_alexnet-trn.yaml'

Nota: make sure that X-Server is able to run on your terminal's side.

Windows: VcXsrv Windows X Server or MobaXterm
Linux: just connect to your server with argument "-X": ssh –X
MacOS: XQuartz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DEVELOP.md

DEVELOP.md

DEVELOP

Registry Mechanism

Configuration Tree

Hyper-parameters

P.S.

Add Your Own Module

Structure

Trainer

Trainer Structure

Trainer Building Process (Functional Representation)

Tester

Tester Structure

Training

Regex

Scheduler

Grad modifier

Misc

Logging

Tips for debugging

Debugging in-tree files

Visualization tools

Files

DEVELOP.md

Latest commit

History

DEVELOP.md

File metadata and controls

DEVELOP

Registry Mechanism

Configuration Tree

Hyper-parameters

P.S.

Add Your Own Module

Structure

Trainer

Trainer Structure

Trainer Building Process (Functional Representation)

Tester

Tester Structure

Training

Regex

Scheduler

Grad modifier

Misc

Logging

Tips for debugging

Debugging in-tree files

Visualization tools