Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How can you use a Transformer model as a backbone? No documentation available #4505

Open
deshwalmahesh opened this issue Aug 28, 2022 · 1 comment
Labels
enhancement Improvements or good new features

Comments

@deshwalmahesh
Copy link

Hey,
Just wanted to know that how could you use a pre-trained Transformer model for object detection? I want to use Microsoft's DiT. Seems like they have used it with Detectron2 but no clear steps have been described. Can you please let me know about where can I find the blog / tutorial / documentation on how to fine tune that transformer architecture to use it as a detection backbone with Faster RCNN / Cascade RCNN.

Thanks in advance :)

@deshwalmahesh deshwalmahesh added the enhancement Improvements or good new features label Aug 28, 2022
@ZawarK7
Copy link

ZawarK7 commented Nov 27, 2024

Hey,
it has been a long time. Have you implemented the transformer backbone? I am currently using ViT with Faster R-CNN. Let me know if you have implemented it or if you need any help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Improvements or good new features
Projects
None yet
Development

No branches or pull requests

2 participants