-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
about performance at resolution 272x736 #92
Comments
Besides changing the image resolution, we changed the conv module for bev map. I will get back to you today. If not, please remind me. |
Hi, In our 272x736 setting, we do not change the map size, which is 200x200. However, the map encoder should be changed to output 34x96 latent, which should be the same as the images. Basically, we change all padding in convs to (1, 1), and change the first stride=2 conv to stride=1, and add an AdaptiveAvgPool2d. FYI (
(conv_in): Conv2d(8, 16, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(blocks): ModuleList(
(0): Conv2d(16, 16, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(1): Conv2d(16, 32, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(2): Conv2d(32, 32, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(3): Conv2d(32, 96, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1))
(4): Conv2d(96, 96, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(5): Conv2d(96, 256, kernel_size=(3, 3), stride=(2, 1), padding=(1, 1))
(6): AdaptiveAvgPool2d(output_size=[34, 92])
)
(conv_out): Conv2d(256, 320, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
) |
@flymin Thank you very much, I will retrain the MagicDrive as your setting and report the results here. |
@flymin Urgent distress, I retrained the 172x736 model (350ep for 5 days), but the mAP of bevfusion is only 13.5. Is there anything else that needs to be modified in the inference process to adapt to the resolution? |
272x736 |
Did you use CFG for inference? The default should be 2, and we use 2.5 to report better results in the table. Another thing I could think of, could you verify the mAP of bevfusion on real data? According to my old logs, 150 epochs should give you ~18 mAP. This took us 62.5h of training on 8*V100. |
@flymin When we verify the bevfusion on real data, we get mAP 35.30 and NDS 42.25. When setting CFG to 2.5 we get mAP 13.59,NDS 24.50. Beisdes, in dataset of config, we set |
Yes, you also need to change them, but the parameters are not correct. First, you should have dataset:
image_size: [272, 736]
augment2d:
resize: [[0.5, 0.5]]
rotate: null They come from bevfusion and should be set before training MagicDrive. dataset:
back_resize: [544, 1472]
back_pad: [64, 356, 64, 0] They are aligned with the training transforms. |
This issue is stale because it has been open for 7 days with no activity. If you do not have any follow-ups, the issue will be closed soon. |
Hello, I retrain and get the result as your paper of the AP、NDS and FID at 224x400 resolution, but failed using 272x736 resolution. Can your give me more detail about 272x736 exp?
The text was updated successfully, but these errors were encountered: