Our model is built with 8000+ train images from 196 classes. It will detect the car model with bounding box(localization). Model is trained with MobileNet as base feature extractor and two paths were taken from it one for classification and another for bounding box.
Stanford Cars Dataset ⇨ https://www.kaggle.com/datasets/jessicali9530/stanford-cars-dataset
- Dataset contains around 8000+ train and 8000+ test images bwlonging to 196 classes
- Bounding box information about the car object in the images is provided in the csv format
flowchart TD
A[Input Layer] --> |None,224,224,3| B[mobilenet.preprocess_input]
B --> |None,224,224,3| C[MobileNet]
C --> |None,7,7,1024| D[GlobalAvragePooling2D]
D --> |None,1024| E[box_out]
D --> |None,1024| F[class_out]
E --> |None,4| G[Output]
F --> |None,196| G[Output]
Input Image | Predicted Image |
---|---|
https://car-object-detection.herokuapp.com/
App predicts the model of the car among 196 clases and plots the bounding box around the car of the given image