Integrated DINOv3 and ConvNext models into Faster RCNN pipeline

View profile for Sovit Ranjan Rath

Lead Software Engineer (LLMs & Generative AI) @ Indegene | debuggercafe.com | github.com/sovit-123

Added all DINOv3 Transformer (except the 7B backbone one) and ConvNext based Faster RCNN models to Faster RCNN Training Pipeline. All the Transformer based Faster RCNN models use multi-level features from the backbone to build the Faster RCNN head and the ConvNext ones use features from the last layer. Surprisingly, multi-level feature selection for ConvNext Faster RCNN is giving an inferior result compared to the last layer-only feature selection. Maybe need to debug a bit more. Now, just need the compute to pretrain all the adapter heads. Link to the project in the comments.

  • No alternative text description for this image
Sovit Ranjan Rath

Lead Software Engineer (LLMs & Generative AI) @ Indegene | debuggercafe.com | github.com/sovit-123

2w
Like
Reply

To view or add a comment, sign in

Explore content categories