OPENCLASSROOMS - Data Scientist - Project 5
This repository contains notebooks for a machine learning project that segments customers of the Olist marketplace based on their orders history.
The dataset used for this project is the Brazilian E-Commerce Public Dataset by Olist, which includes information of 100k orders from 2016 to 2018 in Brazil.
- barbier_victor_1_notebook_exploration_102022.ipynb : Exploratory data analysis of the customers informations
- barbier_victor_2_notebook_essais_102022.ipynb : Machine learning models for the clustering of the customers
- barbier_victor_3_notebook_simulation_102022.ipynb : Analysis of the best model performance through time
- barbier_victor_4_presentation_102022.pdf: Final presentation of the project
- Python 3.x
- Jupyter Notebook
- NumPy
- Pandas
- Matplotlib / Seaborn
- Scikit-learn : PCA, KMeans, t-SNE
- HDBSCAN
- YellowBrick