Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About differences between configs on nuScenes and Argoverse2 #12

Open
curiosity654 opened this issue Dec 24, 2024 · 0 comments
Open

About differences between configs on nuScenes and Argoverse2 #12

curiosity654 opened this issue Dec 24, 2024 · 0 comments

Comments

@curiosity654
Copy link

Hi, thank you for your great work. As the FSF model on AV2 isn't released, I'm trying to train on AV2 myself. I found that the pretrained FSD model on nuScenes and AV2 are very different in terms of size (900+MB on nuScenes and 100+MB on AV2). Looking into the configs I found that the Sparse UNet backbone setting are very different, with nuScenes's backbone channel number are generally 2 times larger than on AV2. Is there any insight behind the design or just tuned by experience? Thank you very much for any feedback.

config of AV2:

backbone=dict(
        type='SimpleSparseUNet',
        in_channels=64,
        sparse_shape=[32, 2048, 2048],
        order=('conv', 'norm', 'act'),
        norm_cfg=dict(type='naiveSyncBN1d', eps=1e-3, momentum=0.01),
        base_channels=64,
        output_channels=128,
        encoder_channels=((64, ), (64, 64, 64), (64, 64, 64), (128, 128, 128)),
        encoder_paddings=((1, ), (1, 1, 1), (1, 1, 1), ((0, 1, 1), 1, 1)),
        decoder_channels=((128, 128, 64), (64, 64, 64), (64, 64, 64), (64, 64, 64)),
        decoder_paddings=((1, 0), (1, 0), (0, 0), (0, 1)),
    ),

config of nuScenes:

backbone=dict(
        type='SimpleSparseUNet',
        in_channels=64,
        sparse_shape=sparse_shape,
        order=('conv', 'norm', 'act'),
        norm_cfg=dict(type='naiveSyncBN1d', eps=1e-3, momentum=0.01),
        base_channels=64,
        output_channels=128, 
        encoder_channels=((128, ), (128, 128, 128), (128, 128, 128), (256, 256, 256), (512, 512, 512)),
        encoder_paddings=((1, ), (1, 1, 1), (1, 1, 1), ((0, 1, 1), 1, 1), (1, 1, 1)),
        decoder_channels=((512, 512, 256), (256, 256, 128), (128, 128, 128), (128, 128, 128), (128, 128, 128)),
        decoder_paddings=((1, 1), (1, 0), (1, 0), (0, 0), (0, 1)), 
    ),
@curiosity654 curiosity654 changed the title About differences of config on nuScenes and Argoverse2 About differences between configs on nuScenes and Argoverse2 Dec 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant