-
Notifications
You must be signed in to change notification settings - Fork 134
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Running problem on Linux server #342
Comments
Hi @looperalt, My guess is something went wrong when trying to preprocess the dataset - did you check your terminal before that last error message to see if preprocessing was actually successful (e.g., the progress bar for preprocessing showed up, was completed, etc)? Remember, you have to preprocess the dataset for the first time and anytime you change key parameters that would affect preprocessing in your config file (refer to the example config files for more details). Also, note the project README and how datasets are expected to be organized (in most cases, the way they are downloaded). If it seems like preprocessing somehow starts and then abruptly stops, you could try seeing if the issue is the default |
Yes, a progress bar appears during preprocessing, but when the progress bar reaches the end, this issue occurs. However, it's strange that when I run the code locally on Windows, there is no problem. When I run it on a Linux remote server, this issue arises. I have followed the process outlined in the README file without any operational errors, and I am puzzled as to why there would be different results when running the code on different systems. I would like to know if you have encountered this issue and how to adjust the parameters when running this code on a Linux remote server |
If you can run it locally on a Windows machine but somehow not on a Linux server, are you sure there isn't something going wrong with how your dataset (whether the dataset itself or the preprocessed folder) is being pointed to in the Linux case? I guess if the Linux remote server has too much CPU usage going on, that could also lead to stuck or dead processes that prevent successful preprocessing, but if you use Feel free to share the config you're trying to run here, perhaps there is something up with the file paths that I can identify. |
When running on the Linux server, the progress bar advances normally, but after the progress bar is full, it displays a |
If it was read and not preprocessed or saved, again, try two things 1) adjust If 1) does not make a difference at all, let me know, and we can dig into other things that may be specific to your situation and causing issues. I should note, the majority of toolbox users (i.e., hundreds of people, myself included) use Linux and the default multi-process setting without any issue, so troubleshooting with respect to your particular remote server is the way to go. |
I have tried the methods you suggested, continuously adjusting the parameters of the multi-process function, but unfortunately, the error still persists. |
Ok, so we've ruled out multi-processing as far as too many processes being the issue. Here's a few more things to try:
All the best, Akshay |
I have confirmed that my preprocessing folder has write permissions. In fact, there is something very strange: when I first tried to run the code, .npy files appeared in the preprocessing folder, but when I closed the program and ran it again, the .npy files never appeared again. This is a very confusing situation for me. |
I would troubleshoot that a bit more, that does sound strange and it's really hard for me to tell what the issue might be since it sounds quite specific to your remote server / your environment. Is it safe to assume, using |
When I run iBVPnet on a Linux server, the following problems occur:
TypeError: Binding inputs to tf.function wrapped_fn failed due to Can not cast TensorSpec(shape=(1, 1024, 1365, 4), dtype=tf.float32, name=None) to TensorSpec(shape=(None, None, None, 3), dtype=tf.float32, name=None). Received args: (array([[[[149. , 155. , 163. , 0. ]
and
ValueError: ('train', 'No files in file list')
The text was updated successfully, but these errors were encountered: