Image_Captioning_RaspberryPi

In this project we tried to compress a full model for image captioning and put it on Raspberry Pi. The original model is created for computer and we get it from this repository: https://github.com/sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning

To make the project work you need to download the models from the following link: https://drive.google.com/drive/folders/1U9LJyL4YBXLel-nP237_6U8ypZHd4oxH?usp=sharing

In order to run the model you need to choose among the offering models which one you want to use for inferencing an image. The options are quantized, pruned, or normal models.

In case you want to change the models you gave to change the path in execute files, and then you can run the execute python files.

For running this project you need to have a camera, you can also use this project to caption an image from the internet but for that you need to do some modifications.

After adjusting the paths and choosing the models of choice the program to execute is execute_double.

execute_double.py

At the beginning you get a welcoming message.
Next you are prompted to choose between doing image captioning or video captioning: (i) If you chose image captioning you can enter 'c' or 'C' to capture an image from the PiCamera and you should get the captionining sentence along with a describing picture. (ii) If you chose video captioning you should enter a positive integer and later you should get a paragraph describing what was happening in front of the PiCamera with a number of sentences equal to the input integer.
You can enter 'x' or 'X' to escape from the current mini program and go to point 2).
Press ctrl + 'c' to exit the program definitely.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Sample_Images		Sample_Images
__pycache__		__pycache__
_efforts_for_pruning_quantizing		_efforts_for_pruning_quantizing
PruningQuantizationInference.ipynb		PruningQuantizationInference.ipynb
README.md		README.md
ResnetQuant.py		ResnetQuant.py
WORDMAP_coco_5_cap_per_img_5_min_word_freq.json		WORDMAP_coco_5_cap_per_img_5_min_word_freq.json
camera.py		camera.py
caption.py		caption.py
create_input_files.py		create_input_files.py
dataset_flickr30k.json		dataset_flickr30k.json
dataset_flickr8k.json		dataset_flickr8k.json
datasets.py		datasets.py
eval.py		eval.py
execute.py		execute.py
execute_double.py		execute_double.py
models.py		models.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image_Captioning_RaspberryPi

About

Releases

Packages

Contributors 2

Languages

hejazifar/Image_Captioning_RaspberryPi

Folders and files

Latest commit

History

Repository files navigation

Image_Captioning_RaspberryPi

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages