- Repository includes two files:
- Jupyter notebook with Python code written for data analysis and model building
- CSV file includes data imported into notebook
- Analyze the data of INN Hotels to find which factors have a high influence on booking cancellations, build a predictive model that can predict which booking is going to be canceled in advance, and help in formulating profitable policies for cancellations and refunds.
- Exploratory Data Analysis (Variable identification, Univariate analysis, Bi-Variate analysis)
- Data Pre-processing
- Logistic regression
- Multicollinearity
- Optimal threshold using AUC-ROC curve
- Decision trees
- Pruning