Training 2D pose model with combined datasets COCO, AI, OccHuman #858

khanhha · 2021-05-20T10:05:18Z

khanhha
May 20, 2021

Hi MMPose Team,

I would like to ask a question about training a 2D pose model with combined datasets COCO, AI, OccHuman, etc.
There would be a much higher amount of training data by combining these datasets, so I think it would be interesting to discuss various approaches to the problem. In the below, I summarize two main approaches based on my reserach.

One-stage training by disabling training supervision on unlabeled key-points: First convert all datasets to the same format, and mark missing key-points as unlabeled. For example, AI has 14 key-points while COCO has 17 key-points. The combined dataset will have 17 key-points, but the 3 missing key-points from AI are set to unlabeled. During the training, the supervision on unlabeled key-poitns will be disabled,
Two-stage training: find a shared/minimum keypoint format from all datasets and perform an initial training on this shared dataset. In the second stage, change the number of output channels and then fine-tune on a target dataset. For example, for combining COCO and AI, we can first train a model with 14 keypoints format of the shared dataset. In the second step, we fine-tune the pre-trained model from the first step but with 17 output channels on the COCO dataset.

The second approach is less convenient because it requires multi-stages. But it seems to work, as tested in this research. The first approach is more convenient but I am not sure if it works or not.

Could you please let me know your opinions about the first approach? or other solutions for joint training that you can think of?

I would really appreciate it.

Best
Khanh Ha

jin-s13 · 2021-05-20T11:27:21Z

jin-s13
May 20, 2021
Collaborator

In my experience, finetuning the model on the target dataset (after jointly training models on multiple datasets), always improves the performance. However, using different joint-training schemes will produce similar results. It does not matter whether you use the intersection (the mentioned first approach) and/or union (the mentioned second approach) of different sets of keypoints.

0 replies

khanhha · 2021-05-20T13:26:00Z

khanhha
May 20, 2021
Author

@jin-s13 thanks you for your quick response.

I have one concern regarding the first approach of joint training on multiple datasets with the help of disabled supervision on unlabeled key-points in some datasets.

In this case, do you think the lack of supervision on the unlabeled key-points could cause any confusion to the training?

Take one sample image from AIChallenger as an example, the eyes are visible in the image but the eye keypoints are not labeled, in this case, the supervision from the eyes will be missed. In contrast, when a sample image comes from COCO [during the same training], the eye key-points will be available and supervision is enabled. In conclusion, from images to images, even the two eyes are visible, but the supervision could be not consistent, sometimes available sometime not.

Best
Khanh

0 replies

jin-s13 · 2021-05-21T01:53:52Z

jin-s13
May 21, 2021
Collaborator

I think it is ok to skip some supervision, i.e. setting zero-losses for these unlabeled keypoints.
In fact, we also intentionally skip some keypoints in the ohkm(online hard keypoint mining) loss.

0 replies

khanhha · 2021-05-21T09:39:09Z

khanhha
May 21, 2021
Author

@jin-s13
thank you very much for your response
I will update the experiment result of the first approach here after it's done. Hopefully, it can give benefit to others.
Best

1 reply

luohao123 Jan 9, 2022

@khanhha Have u tried combined animal dataset with human dataset and prediction their keypoints at the same time?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training 2D pose model with combined datasets COCO, AI, OccHuman #858

{{title}}

Replies: 4 comments 1 reply

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Training 2D pose model with combined datasets COCO, AI, OccHuman #858

khanhha May 20, 2021

Replies: 4 comments · 1 reply

jin-s13 May 20, 2021 Collaborator

khanhha May 20, 2021 Author

jin-s13 May 21, 2021 Collaborator

khanhha May 21, 2021 Author

luohao123 Jan 9, 2022

khanhha
May 20, 2021

Replies: 4 comments 1 reply

jin-s13
May 20, 2021
Collaborator

khanhha
May 20, 2021
Author

jin-s13
May 21, 2021
Collaborator

khanhha
May 21, 2021
Author