CAPTCHA Prediction using Convolutional Recurrent Neural Network

This project focuses on training a Convolutional Recurrent Neural Network (CRNN) model to accurately predict 5-letter and 5-number CAPTCHA images. The goal is to develop a robust model that can reliably identify the text content within CAPTCHA images, which are commonly used as a security measure to distinguish between human users and automated bots.

Dataset

The dataset used in this project consists of CAPTCHA images, which were downloaded from the Kaggle dataset. The dataset is divided into the following proportions:

Training set: 75% (approximately 803 images)
Validation set: 12.5% (approximately 135 images)
Test set: 12.5% (approximately 135 images)

Project Structure

The file structure of the project is as follows:

.
├── data
│   └── CAPTCHA Images
│       ├── test
│       ├── train
│       └── val
├── dataset.py
├── model.py
├── output
│   ├── log.txt
│   ├── loss.png
│   └── weight.pth
├── predict.py
├── README.md
├── split_train_val_test.py
├── train.py
└── utils.py

Getting Started

To run this project after cloning, follow these steps:

Update the folder location of the CAPTCHA images in the following files:
- dataset.py: Line 56
- predict.py: Lines 23 and 27
- split_train_val_test.py: Line 38
- train.py: Lines 80 and 67
Run the train.py file to train the model:
```
python train.py
```
Run the predict.py file to get the final prediction accuracy:
```
python predict.py
```

Model Performance

The current prediction accuracy of the model is 92.5%.

Training Visualization

Training and Validation Loss

The image above shows the training and validation loss over the course of model training. The blue line represents the training loss, while the orange line represents the validation loss. Both losses decrease rapidly in the initial epochs and then stabilize, indicating that the model is learning effectively without overfitting.

Docker Container

When creating a Docker container for this project, make sure to use the 'all-gpu' tag to utilize GPUs if available. For example:

docker run --gpus all -it your_image_name:all-gpu

If you're running on a Mac or need more information on using GPUs with Docker, please consult the Docker documentation for specific instructions.

Personal Notes

This project is currently running in a Docker container on a server with an NVIDIA GTX 3060 Ti GPU.
It's fascinating to observe how methods originally designed to distinguish between robots and humans are becoming less effective. As we make progress in machine learning and hardware computation improves, training these models becomes easier and faster.

As a developer aspiring to build more in the AI/ML space, it's remarkable to realize that with readily available computing power and a few hours of work, we can build a model that achieves 92.5% accuracy in predicting CAPTCHAs – a task once thought to be an effective way to distinguish between humans and robots.

Contributing

Contributions to this project are welcome. Please feel free to submit a Pull Request.

Citations: [1] https://pplx-res.cloudinary.com/image/upload/v1719281556/user_uploads/afidsgvnv/loss.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CAPTCHA Prediction using Convolutional Recurrent Neural Network

Dataset

Project Structure

Getting Started

Model Performance

Training Visualization

Docker Container

Personal Notes

Contributing

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
output		output
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
dataset.py		dataset.py
dockerfile		dockerfile
model.py		model.py
predict.py		predict.py
pytorch_video.mp4		pytorch_video.mp4
requirements.txt		requirements.txt
split_train_val_test.py		split_train_val_test.py
train.py		train.py
utils.py		utils.py

shreyashguptas/CAPTCHA-Recognition-using-CRNN

Folders and files

Latest commit

History

Repository files navigation

CAPTCHA Prediction using Convolutional Recurrent Neural Network

Dataset

Project Structure

Getting Started

Model Performance

Training Visualization

Docker Container

Personal Notes

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages