wandb is a free tool for logging data from machine learning training processes. It includes features for user management, team management, and project management.
[Zhihu Tutorial (in Chinese)] [中文介绍]
- Ubuntu or Red Hat
- tmux
- Python3
pip install -r requirements.txt
For a detailed tutorial on this section, see: wandb Usage Tutorial (Part 1): Basic Usage
- Example of showing training curves: test_curves.sh
- Example of showing images: test_images.sh
- Example of showing videos: test_videos.sh
- Example of showing matplotlib plots: test_matplot.sh
- Example of showing tables: test_tables.sh
- Example of showing multi-process groups: test_multi_process.sh
- Example of showing HTML: test_html.sh
- Example of PyTorch integration: test_pytorch.sh
For a detailed tutorial on this section, see: wandb Usage Tutorial (Part 2): Distributed Hyperparameter Search Using Launchpad
In machine learning tasks, we often encounter many hyperparameters that need tuning. wandb provides features for hyperparameter search. However, wandb primarily focuses on hyperparameter search scheduling and visualization, and does not inherently offer distributed capabilities. Thus, this section describes a way to combine Launchpad and wandb for parallel (or distributed) hyperparameter search.
Note: Since Launchpad doesn't offer multi-machine distributed capabilities, if you wish to perform multi-machine parallel hyperparameter searches, consider using TLaunch.
- Basic example: test_sweep.sh, this example provides a minimal setup for combining wandb and Launchpad.
- Search for dropout hyperparameter in CNN classification task: test_sweep.sh, this example is based on the MNIST classification task and focuses on searching for the best dropout parameter.
wandb also offers features for data and model backup management. For a detailed tutorial on this section, see: wandb Usage Tutorial (Part 3): Data and Model Management
- Basic example: test_artifact.sh, this example provides a way to back up MNIST training data.
wandb also offers features for local server deployment. For a detailed tutorial on this section, see: wandb Usage Tutorial (Part 4): Local Deployment of wandb
If you use wandb_tutorial in your work, please cite us:
@article{huangshiyu2022wandb,
title={wandb Tutorial},
author={Shiyu Huang},
year={2022},
howpublished={\url{https://github.com/huangshiyu13/wandb_tutorial}},
}