-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tips for training #19
Comments
bro, u almost solved all the problems I met. Respect.. |
This should be update ada p
|
How to show progress while training? |
Thank you for making this repository.
It might be hard to run custom trainning due to some issues in the repo. Here are tips to make it work:
Env: RTX 3080, Ubuntu 18.04, Python 3.8, Pytorch 1.7.0, CUDA 11, CUDAToolkit 11.0.221, mmvc(torch1.7.0 and cu110), mmgeneration(f6551e1)
1. Issue related to mmgeneration
SiLU is registered
Comment line 35(the annotation to register SiLU) in
mmgeneration/mmgen/models/architectures/ddpm/modules.py
, then install mmgeneration.2. Build error when compiling custom operators
nvcc fatal : Unsupported gpu architecture ‘compute_86‘
on new graphics card series.Use
export TORCH_CUDA_ARCH_LIST="8.0"
before running.3. Pretrained weights is missing.
Download pretrained weights in configs into
work_dirs/pre-trained/
before running.4. Running trainer directly with Python.
Add
agilegan
folder intools/train.py
,then use
python tools/train.py PATH_TO_YOUR_CONFIGS --work-dir PATH_TO_YOUR_DIR --gpus GPU_NUMS
.5.
object has no attribute 'module'
intransfer.py
andlog_buffer boardcasting error
Fix
agilegan/transfer.py
with,~line 140 and 210
~line 460
6. Issue related to logger, gpu data should use cpu() before numpy().
Disable logger in your config, you can get info in work directory.
Hopes this helps.
The text was updated successfully, but these errors were encountered: