Skip to content

tejas-1911/Fraud-Detection-Using-Machine-Learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

Fraud Detection Using Machine Learning

Overview

This project explores machine learning techniques to detect fraudulent transactions in a highly imbalanced dataset. The dataset contains anonymized transaction details with the goal of identifying fraudulent transactions (Class 1).

Dataset

The dataset consists of:

  • Time: Time elapsed since the first transaction.
  • V1-V28: Anonymized features.
  • Amount: Transaction amount.
  • Class: Target variable indicating fraud (1) or non-fraud (0).

Class Distribution:

  • Fraudulent Transactions: 492 instances (0.17%)
  • Non-Fraudulent Transactions: 284,315 instances (99.83%)

Key Models and Results

Model ROC AUC Precision Recall Notes
Logistic Regression 0.9559 0.83 0.65 Simple and interpretable.
Decision Tree 0.8875 0.75 0.78 Prone to overfitting.
Random Forest 0.9578 0.94 0.82 Robust and feature-rich.
SVM 0.9646 0.96 0.67 Effective but complex.
XGBoost 0.9711 0.89 0.83 Best performance overall.

Files

  • analysis/DSC478_Final_Project.ipynb: Jupyter notebook containing the analysis and code.
  • report/Finalreport-PML.pdf: Comprehensive project report.

How to Use

  1. Clone the repository:
    git clone https://github.com/tejas-1911/Fraud-Detection-Using-Machine-Learning.git

Releases

No releases published

Packages

No packages published