MLOps Project - Team 7

This project is designed to demonstrate a range of data manipulation, preprocessing, and machine learning model development tasks using MLOps principles, ensuring reproducibility, data versioning, and efficient pipeline structuring.

Clone this repository

For clone, write the next line on terminal.

First, locate the folder where you will save the project

C:\Users\PC\Documents\GitHub

git clone https://github.com/Medicenchapin/MLOps_Equipo-7.git

cd MLOps_Equipo-7

Create and active virtual environment

Create a virtual environment named dvc-venv:

Using Conda

conda create --name dvc-venv python=3.8.11
conda activate dvc-venv

Using virtualenv

python -m venv dvc-venv
source dvc-venv/bin/activate  # On macOS/Linux
dvc-venv\Scripts\activate     # On Windows

Installtion

The Code is written in Python 3.8.11. If you don't have Python installed you can find it your link here. If you are using a lower version of Python you can upgrade using the pip package, ensuring you have the latest version of pip. To install the required packages and libraries, run this command in the project directory after cloning the repository.

After cloning the repository, install the required dependencies by running:

pip install -r requirements.txt

Running the Application

Run the application by executing:

Run app.py file

Project Organization

├── notebooks                                  <- Jupyter notebooks for experimentation
│   ├── data                                   <- Data folder containing various datasets
│   │   ├── raw                                <- Original data and source files
│   │   │   ├── codetable.txt                  <- Metadata or documentation for the dataset
│   │   │   ├── read_SouthGermanCredit.R       <- R script for data reading or analysis
│   │   │   ├── SouthGermanCredit.asc          <- Raw data in ASCII format
│   │   │── DataToModel.csv                    <- Data ready for modeling
│   │   │── Preprocess.csv                     <- Data after manipulation
│   │   │── X_train_SouthGermanCredit.csv      <- Training features
│   │   │── X_test_SouthGermanCredit.csv       <- Testing features
│   │   │── Y_train_SouthGermanCredit.csv      <- Training labels
│   │   └── Y_test_SouthGermanCredit.csv       <- Testing labels
│   ├── Fase-1                                 <- Phase 1 notebooks and related files
│   │   ├── mlflow                             <- MLflow project directory
│   │   │   └── Dockerfile                     <- Docker setup for the project
│   │   ├── .gitignore                         <- Git ignore file
│   │   ├── .gitkeep                           <- Keeps the empty directory in git
│   │   ├── config.env                         <- Environment configurations
│   │   ├── Credit_Data_RF.pkl                 <- Random Forest model
│   │   ├── data_load.py                       <- Script to load data
│   │   ├── docker-compose.yaml                <- Docker Compose file
│   │   ├── fase-1_manipulacion_and_EDA.ipynb  <- Data manipulation and EDA
│   │   ├── fase-1_Part01_v3.ipynb             <- Data manipulation and preparation (Part 01)
│   │   ├── fase-1_Part02_v3.ipynb             <- Data exploration and preprocessing (Part 02)
│   │   ├── fase-1_Part03_v3.ipynb             <- Model construction and evaluation (Part 03)
│   │   ├── fase-1_Part04_v1.ipynb             <- Additional modeling or evaluation (Part 04)
│   │   ├── fase-1_Part05_v1.ipynb             <- Further analysis or results (Part 05)
│   │   ├── Scaler_Credi_Datat.pkl             <- Scaler for data preprocessing
│   │   └── test.ipynb                         <- Test notebook for initial experimentation
│   └──fase-1_Part01_v3.ipynb                  <- Data manipulation 
├── dvc.yaml                                   <- DVC pipeline configuration file
├── params.yaml                                <- Pipeline parameter file
├── README.md                                  <- Main README for the project
├── requirements.txt                           <- Project dependencies
└── test.ipynb                                 <- Test notebook for initial experimentation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLOps Project - Team 7

Clone this repository

Create and active virtual environment

Using Conda

Using virtualenv

Installtion

Running the Application

Project Organization

About

Releases

Packages

Contributors 5

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
notebooks		notebooks
src/stages		src/stages
test		test
.DS_Store		.DS_Store
README.md		README.md
dvc.yaml		dvc.yaml
mlflow.db		mlflow.db
params.yaml		params.yaml
params_v2.yaml		params_v2.yaml
requirements.txt		requirements.txt
test.ipynb		test.ipynb

Medicenchapin/MLOps_Equipo-7

Folders and files

Latest commit

History

Repository files navigation

MLOps Project - Team 7

Clone this repository

Create and active virtual environment

Using Conda

Using virtualenv

Installtion

Running the Application

Project Organization

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages