Neural Image Captioning System

Caption generation is a challenging artificial intelligence problem where a textual description must be generated for a given photograph.

It requires both methods from computer vision to understand the content of the image and a language model from the field of natural language processing to turn the understanding of the image into words in the right order. Recently, deep learning methods have achieved state-of-the-art results on examples of this problem.

Acknowledgement

I would like to thank Jason Brownlee for his wonderful blog that help me to learn how to build a Image Caption Generator.

Content:

[WARNING]:

This project need high RAM : 32 GB/ 64 GB. Either you can use AWS EC2 instance or Google Collaboratory[The one I used].

Dependencies:

This project requires a lot of modules and packages. This can be installed from requirement.txt file using following command:

pip install -r requirements.txt for python 2.x
pip3 install -r requirements.txt for python 3.x

Helper Functions:

All the helper functions needed for this project are in utility.py file.

Dataset:

The dataset can be downloaded from Kaggle from here. You can use the already created feature file [features extracted from images] located in Features folder. Its compressed you need to unzip it.

The dataset consists of 2 files:

Images
Description and Image IDs

Training:

Now comes the training part. To train your model defined in model.py we will run the train.py file. Remember training may take longer time to run depends on the congiguration of the machine. Each epoch takes around 15-20 mins.

training need 4 arguments:

textPath
trainPath
devPath
features

python train.py --textPath /Path to Textfile/ --trainPath /Path to trainfile/  --devPath /Path to valimages/ --features /Path to features/

Evaluation:

After training we will evaluate our model on test dataset. Run the following command:

python evaluate.py --testPath /Path to testfile/

Architechture:

The model Architechture:

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
Image_Caption_Generator.ipynb		Image_Caption_Generator.ipynb
README.md		README.md
evaluate.py		evaluate.py
model.py		model.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Image Captioning System

Acknowledgement

Content:

[WARNING]:

Dependencies:

Helper Functions:

Dataset:

Training:

Evaluation:

Architechture:

About

Uh oh!

Releases

Packages

Languages

lalitMohan10/Neural-Image-Captioning

Folders and files

Latest commit

History

Repository files navigation

Neural Image Captioning System

Acknowledgement

Content:

[WARNING]:

Dependencies:

Helper Functions:

Dataset:

Training:

Evaluation:

Architechture:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages