Skip to content

quocanh34/captionize-it-app

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Captionize it app

Table Of Contents

Description

This project implements an image captioning system using PyTorch and Flask. It generates descriptive captions for input images using a deep learning encoder-decoder model and provides a web interface for captioning images.

Features

  • Trained model for image captioning and utilization of vast.ai GPU.
  • Web interface for captioning images using Flask.
  • Adding cross attention for models (not completed)
  • Docker package (not completed)
  • Cloud server deployment (not completed)

For more details of model training architectures and webapp

Requirements

  • Flask, nltk, numpy, tqdm, python-dotenv, torch, torchvision.

Installation

First, clone and set up virtual environment

# clone project   
git clone https://github.com/quocanh34/captionize-it-app.git
cd captionize-it-app

# set up virtual env   
python3 -m venv captionize

# activate the env
source captionize/bin/activate  # for Unix/Linux
captionize\Scripts\activate  # for Windows

Second, install dependencies.

pip install -r requirements.txt

Next, download the trained model

Now run the app

# run flask 
python app.py

Citation

@{article{Anh Pham},
  title={Captionize it app},
  author={Anh Pham},
  year={2023}
}

About

An image captioning system using PyTorch and Flask.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published