Credit Card Default Prediction

This classification problem aims to predict whether a customer will default on next month's credit card payment using Logistic Regression, Support Vector Machine and XGBoost Classifier.

Access the streamlit web app to delve into the detailed steps of data cleaning, preprocessing, modelling, and time series forecasting, as well as to uncover the insights derived from the analysis.

Introduction:

Credit risk refers to the cardholder's inability to make required payments on their credit card debt, leading to 'credit default.' This poses a major concern for financial institutions, as defaults can result in significant losses and, in severe cases, even bankruptcy. Conducting thorough evaluations and verifying a borrower's ability to repay can help prevent the over-issuance of credit cards to unqualified applicants, thereby minimizing credit risk.

Machine learning models can be deployed to identify risky customers and minimise lenders' losses. By using algorithms to study historical transactions and customer demographics, we can apply the findings to future customers, effectively distinguishing between risky and non-risky profiles. This approach leads to more efficient loan lending practices.

Environment

The analysis has been conducted in Python and the source code requires the following libraries: pandas, numpy, sklearn, scipy, and imblearn.

Primary libraries used for visualizations are matplotlib, seaborn, plotly, and hiplot.

streamlit must be installed to run the streamlit_code.py file. Run it using the following command in the terminal:

streamlit run streamlit_code.py

Datasets:

I. Default of Credit Card Clients

UCI dataset contains information on credit card clients in Taiwan from April 2005 to September 2005. It has 30,000 instances across 25 attributes, contains multivariate characteristics, and the attributes have both integer, categorical and real data types. The attribute summary is as follows:

ID: ID of each client

LIMIT_BAL: Amount of given credit in NT dollars (includes individual and family/supplementary credit)

SEX: Gender (male, female)

EDUCATION: Level of education (graduate school, university, high school, others)

MARRIAGE: Marital status (married, single, others)

AGE: Age in years

PAY_0: Repayment status in September, 2005 (-1=pay duly, 1=payment delay for one month, 2=payment delay for two months, … 8=payment delay for eight months, 9=payment delay for nine months and above)

PAY_2: Repayment status in August, 2005 (scale same as above)

PAY_3: Repayment status in July, 2005 (scale same as above)

PAY_4: Repayment status in June, 2005 (scale same as above)

PAY_5: Repayment status in May, 2005 (scale same as above)

PAY_6: Repayment status in April, 2005 (scale same as above)

BILL_AMT1: Amount of bill statement in September, 2005 (NT dollar)

BILL_AMT2: Amount of bill statement in August, 2005 (NT dollar)

BILL_AMT3: Amount of bill statement in July, 2005 (NT dollar)

BILL_AMT4: Amount of bill statement in June, 2005 (NT dollar)

BILL_AMT5: Amount of bill statement in May, 2005 (NT dollar)

BILL_AMT6: Amount of bill statement in April, 2005 (NT dollar)

PAY_AMT1: Amount of previous payment in September, 2005 (NT dollar)

PAY_AMT2: Amount of previous payment in August, 2005 (NT dollar)

PAY_AMT3: Amount of previous payment in July, 2005 (NT dollar)

PAY_AMT4: Amount of previous payment in June, 2005 (NT dollar)

PAY_AMT5: Amount of previous payment in May, 2005 (NT dollar)

PAY_AMT6: Amount of previous payment in April, 2005 (NT dollar)

default payment next month: Default payment (yes, no)

II. Macroeconomic Data for Taiwan

Data on labour, income, and inflation for Taiwan in 2005 have been sourced from the National Statistics Republic of China (Taiwan) and DGBAS Government Bureau.

CPI: Consumer Price Index representing the average change over time in the prices paid by consumers for a representative basket of consumer goods and services

Unemployment Rate: Percentage of people in the labour force who are unemployed (includes civilians age 15 & above who were: (i) jobless (ii) available for work (iii) seeking a job or waiting for results after job seeking during the reference week (iv) waiting for a recall after layoff (v) having a job offer but have not started to work)

Avg Income Level: Disposable income of employees (including those having: (i) full-time, part-time, or another payroll (ii) entrepreneurial income (iii) property income (iv) imputed rent income (v) current transfer receipts)

Name		Name	Last commit message	Last commit date
Latest commit History 146 Commits
.devcontainer		.devcontainer
code		code
datasets		datasets
CPI_dist.png		CPI_dist.png
IDA_bill_scatterplot.png		IDA_bill_scatterplot.png
IDA_drop_countplot.png		IDA_drop_countplot.png
IDA_education_pairplot.png		IDA_education_pairplot.png
IDA_knn_countplot.png		IDA_knn_countplot.png
IDA_marriage_pairplot.png		IDA_marriage_pairplot.png
IDA_missing_corr_plot.json		IDA_missing_corr_plot.json
IDA_missing_heatmap.png		IDA_missing_heatmap.png
IDA_pay_scatterplot.png		IDA_pay_scatterplot.png
Logistics_reg.png		Logistics_reg.png
PACF.png		PACF.png
README.md		README.md
STL.png		STL.png
SVM_balanced.png		SVM_balanced.png
SVM_imbalanced.png		SVM_imbalanced.png
Scree plot.png		Scree plot.png
XGBoost.png		XGBoost.png
age_plot.json		age_plot.json
chi-squaretest.png		chi-squaretest.png
class_imbalance.png		class_imbalance.png
correlation_heatmap.json		correlation_heatmap.json
correlation_heatmapmacro.json		correlation_heatmapmacro.json
data_income.xlsx		data_income.xlsx
data_macro.xlsx		data_macro.xlsx
density_plot.png		density_plot.png
distribution_smote.png		distribution_smote.png
education_plot.json		education_plot.json
forecast.png		forecast.png
hiplot.html		hiplot.html
imbalance_smote.png		imbalance_smote.png
kde_aug.png		kde_aug.png
kde_july.png		kde_july.png
kde_june.png		kde_june.png
marriage_plot.json		marriage_plot.json
pca_plot.json		pca_plot.json
requirements.txt		requirements.txt
sex_plot.json		sex_plot.json
streamlit_code.py		streamlit_code.py
test_set.csv		test_set.csv
ttest.png		ttest.png
xgb_model.pkl		xgb_model.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Credit Card Default Prediction

Table of Contents:

Introduction:

Environment

Datasets:

About

Releases

Packages

Languages

mahnoorsheikh16/Credit-Card-Default-Prediction

Folders and files

Latest commit

History

Repository files navigation

Credit Card Default Prediction

Table of Contents:

Introduction:

Environment

Datasets:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages