You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, I'm trying to use the train.py function to train a RIFE model with my own database. It all goes well till the last epoch where it gives me a "Segmentation fault (core dumped)". I'm using --nproc_per_node=1 and --world_size=1, so maybe that's the issue? It doesn't matter how many epoch I use it always gives this error at the last epoch. Also by launching the code like this: sudo -E /usr/bin/python3 -m torch.distributed.launch --nproc_per_node=1 train.py --epoch=1 --world_size=1 the error becomes simply "Segmentation fault" without the "(core dumped)" part. Any ideas of why is giving me this issue?
The text was updated successfully, but these errors were encountered:
Hello, I'm trying to use the train.py function to train a RIFE model with my own database. It all goes well till the last epoch where it gives me a "Segmentation fault (core dumped)". I'm using --nproc_per_node=1 and --world_size=1, so maybe that's the issue? It doesn't matter how many epoch I use it always gives this error at the last epoch. Also by launching the code like this: sudo -E /usr/bin/python3 -m torch.distributed.launch --nproc_per_node=1 train.py --epoch=1 --world_size=1 the error becomes simply "Segmentation fault" without the "(core dumped)" part. Any ideas of why is giving me this issue?
The text was updated successfully, but these errors were encountered: