Image Captioning

Overview

Image captioning model which predicts the caption of a given image.
The dataset used is COCO 2014.
Faiss Library (https://faiss.ai/) is used for efficient similarity search and clustering.

Image captioning works by using KNN to find the 5 closest image to our given image and then picking from the image caption which is most similar to our image.
For getting 5 closest neighbours, we use Faiss library which is an efficient way for similarity searching and clustering.

The notebook accesses the COCO dataset through mounted google drive, change it to point to where the COCO dataset is available.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
notebook.ipynb		notebook.ipynb