You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Our GQA experiment took ~24h on 64 16G V100 GPUs with default hyperparameters. This is because the training is end-to-end (both MCAN and backbone would be trained jointly). But you may try to pretrain the backbone with the RelViT objective and make it frozen during the training of MCAN. Some performance degradation is expected tho.
Hello, could you please tell me the training time of hico and gqa in which gpu, thanks!
The text was updated successfully, but these errors were encountered: