-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problems met when trying the code #2
Comments
This is because your machine is not using latest driver. Please try this tag of docker instead, |
sorry, how to use visualjoyce/chengyubert:1.6.0-cuda10.1-cudnn7-devel ? |
|
docker run --gpus '"'device=$CUDA_VISIBLE_DEVICES'"' --ipc=host --rm -it
But,What is this mean? |
It was this before --mount src="${WORK_DIR}",dst=/src,type=bind \
--mount src="$OUTPUT",dst=/storage,type=bind \
--mount src="$PRETRAIN_DIR",dst=/pretrain,type=bind,readonly \
--mount src=$ANNOTATION_DIR,dst=/annotations,type=bind,readonly \
--mount src="$TXT_DB",dst=/txt,type=bind \ Now is that
Is my understanding right? 3 rows are deleted |
A diff file shows editing made, the line starts with My last post means, you need to change |
Thanks for your patience. parser.add_argument("--model", default='paired',
choices=['snlive'],
help="choose from 2 model architecture") What are paired and snlive mean? Here is my another error. File "train_official.py", line 304, in main
raise ValueError(f"No such model [{opts.model}] supported!")
ValueError: No such model [paired] supported! |
This is due to copy-paste from an earlier code. Now you may ignore the parameter as it's overwritten by I will fix that on my next version. |
Sorry that there are so many errors I met... Primary job terminated normally, but 1 process returned |
That's OK. Thank you for pointing these out! The code supports multiple GPU. If you only have one GPU, then use |
ok! |
Our next paper is under review which supports more than 30k idioms. So it's a parameter for future compatibility. |
The mode I choose is 'train'. Your code in train_official.py is this The error is that AttributeError: 'Namespace' object has no attribute 'train_txt_db' |
I can run the code on my machine with
Can you post the command line? |
This is mine. And I run your command, it has the same error... |
Have you done the preprocessing without error? |
Oh, I am so sorry for making wrong with that before. |
Also, your |
Copy that |
After preprocessing, I have the error too.... |
Locked is because they are generated from docker using |
ok, And..
|
Do you have |
Or change the value of |
It did work! What is that mean? |
You need to paste the full log, it's hard to tell where the problem might be. When you post the log, try using |
[1,0]<stderr>:01/04/2021 16:11:34 - INFO - __main__ - Waiting on git info....
[1,0]<stderr>:fatal: not a git repository (or any parent up to mount point /)
[1,0]<stderr>:Stopping at filesystem boundary (GIT_DISCOVERY_ACROSS_FILESYSTEM not set).
[1,0]<stderr>:01/04/2021 16:11:34 - INFO - __main__ - Git branch:
[1,0]<stderr>:fatal: not a git repository (or any parent up to mount point /)
[1,0]<stderr>:Stopping at filesystem boundary (GIT_DISCOVERY_ACROSS_FILESYSTEM not set).
[1,0]<stderr>:01/04/2021 16:11:34 - INFO - __main__ - Git SHA:
[1,0]<stderr>:fatal: not a git repository (or any parent up to mount point /)
[1,0]<stderr>:Stopping at filesystem boundary (GIT_DISCOVERY_ACROSS_FILESYSTEM not set).
[1,0]<stderr>:Traceback (most recent call last):
[1,0]<stderr>: File "train_official.py", line 468, in <module>
[1,0]<stderr>: main(args)
[1,0]<stderr>: File "train_official.py", line 317, in main
[1,0]<stderr>: best_ckpt = train(model, dataloaders, opts)
[1,0]<stderr>: File "train_official.py", line 49, in train
[1,0]<stderr>: save_training_meta(opts)
[1,0]<stderr>: File "/src/chengyubert/utils/save.py", line 45, in save_training_meta
[1,0]<stderr>: cwd=git_dir, universal_newlines=True).strip()
[1,0]<stderr>: File "/opt/conda/lib/python3.7/subprocess.py", line 411, in check_output
[1,0]<stderr>: **kwargs).stdout
[1,0]<stderr>: File "/opt/conda/lib/python3.7/subprocess.py", line 512, in run
[1,0]<stderr>: output=stdout, stderr=stderr)
[1,0]<stderr>:subprocess.CalledProcessError: Command '['git', 'status', '--short']' returned non-zero exit status 128. Thanks you very much! |
Are you using a cloned repo or downloaded the zip from master? |
Yes, I downloaded the zip before |
The code is trying to query the git info and failed. Either you clone the repo or comment out the line which is trying to query git status. |
It finally works! |
Glad that works! I will add more details on next release. |
The text was updated successfully, but these errors were encountered: