Dimension mismatch while loading model from checkpoint #5

Rajrup · 2023-02-22T08:04:34Z

Thanks for sharing this great work!

I am currently hitting an issue while running the evaluation for the pointgroup detector using the checkpoint file you shared.
python scripts/eval.py --folder <output_folder> --task detection

Output:
Traceback (most recent call last):
File "scripts/eval.py", line 522, in
model = init_model(cfg, dataset)
File "scripts/eval.py", line 121, in init_model
model.load_state_dict(checkpoint["state_dict"], strict=False)
File "/home/rajrup/miniconda3/envs/d3net-original/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1406, in load_state_dict
raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(
RuntimeError: Error(s) in loading state_dict for PipelineNet:
size mismatch for embeddings: copying a param with shape torch.Size([3441, 300]) from checkpoint, the shape in current model is torch.Size([3535, 300]).
size mismatch for speaker.caption.embeddings: copying a param with shape torch.Size([3441, 300]) from checkpoint, the shape in current model is torch.Size([3535, 300]).
size mismatch for speaker.caption.classifier.2.weight: copying a param with shape torch.Size([3441, 512]) from checkpoint, the shape in current model is torch.Size([3535, 512]).
size mismatch for speaker.caption.classifier.2.bias: copying a param with shape torch.Size([3441]) from checkpoint, the shape in current model is torch.Size([3535]).

The dimension of the tensors in checkpoint doesn't match the one required in the code. Before the model load step, the val splits, and the vocabulary loads fine. I might be missing something here. Can you please help me solve this issue?

Thanks!

The text was updated successfully, but these errors were encountered:

STAR-ALG · 2023-04-25T13:08:11Z

I meet the very similar error while running the evaluation for the pointgroup captioning.

Here is the error:
Traceback (most recent call last):
File "scripts/eval.py", line 523, in
model = init_model(cfg, dataset)
File "scripts/eval.py", line 122, in init_model
model.load_state_dict(checkpoint["state_dict"], strict=False)
File "/home/niexing/anaconda3/envs/D3Net/lib/python3.7/site-packages/torch/nn/modules/module.py", line 1407, in load_state_dict
self.class.name, "\n\t".join(error_msgs)))
RuntimeError: Error(s) in loading state_dict for PipelineNet:
size mismatch for embeddings: copying a param with shape torch.Size([3441, 300]) from checkpoint, the shape in current model is torch.Size([3433, 300]).
size mismatch for speaker.caption.embeddings: copying a param with shape torch.Size([3441, 300]) from checkpoint, the shape in current model is torch.Size([3433, 300]).
size mismatch for speaker.caption.classifier.2.weight: copying a param with shape torch.Size([3441, 512]) from checkpoint, the shape in current model is torch.Size([3433, 512]).
size mismatch for speaker.caption.classifier.2.bias: copying a param with shape torch.Size([3441]) from checkpoint, the shape in current model is torch.Size([3433]).

CurryYuan · 2023-04-28T04:17:27Z

Delete [:self.max_des_len] here.

D3Net/lib/dataset/pipeline.py

Line 453 in b505e98

all_words = chain(*[data["token"][:self.max_des_len] for data in train_data])

STAR-ALG · 2023-04-28T09:44:55Z

@CurryYuan Thank you very much for your help! I have fixed the code as you state. But when I run the evaluation as python scripts/eval.py --folder <output_folder> --task detection, I meet a new issue as follow:

Output:

Could not import cythonized box intersection. Consider compiling box_intersection.pyx for faster training.
=> loading configurations...
=> initializing data...
=> loading train split...
100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 562/562 [00:34<00:00, 16.37it/s]
building vocabulary...
Traceback (most recent call last):
File "scripts/eval.py", line 519, in
dataset, dataloader = init_data(cfg)
File "scripts/eval.py", line 82, in init_data
cap_train_dataset = Dataset(cfg, cfg.general.dataset, mode, "train", raw_train, raw_train_scan_list, SCAN2CAD)
File "./lib/dataset/pipeline.py", line 61, in init
self._load()
File "./lib/dataset/pipeline.py", line 392, in _load
self.lang, self.lang_ids = self._tranform_des(self.max_des_len)
File "./lib/dataset/pipeline.py", line 550, in _tranform_des
embeddings[token_id] = self.glove[glove_id]
IndexError: index 3433 is out of bounds for axis 0 with size 3433

I would be very appreciated if you can help me. Thank you very much!

Rajrup · 2023-09-14T18:29:20Z

@daveredrum any suggestions will be helpful.

Rajrup changed the title ~~Dimension mismatch while loacding model from checkpoint~~ Dimension mismatch while loading model from checkpoint Feb 22, 2023

Chuan-shanjia mentioned this issue Apr 25, 2023

Cannot reproduce the PointGroup Detector performance #3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dimension mismatch while loading model from checkpoint #5

Dimension mismatch while loading model from checkpoint #5

Rajrup commented Feb 22, 2023

STAR-ALG commented Apr 25, 2023

CurryYuan commented Apr 28, 2023

STAR-ALG commented Apr 28, 2023

Rajrup commented Sep 14, 2023

Dimension mismatch while loading model from checkpoint #5

Dimension mismatch while loading model from checkpoint #5

Comments

Rajrup commented Feb 22, 2023

STAR-ALG commented Apr 25, 2023

CurryYuan commented Apr 28, 2023

STAR-ALG commented Apr 28, 2023

Rajrup commented Sep 14, 2023