Skip to content

nguyenanhtuan1008/Machine-Learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Machine-Learning

Machine Learning

Setup:

conda create -n ml conda activate ml

##1.1 Gather the data and build cross-validation

  • Download this dataset: Categorical Feature Encoding Challenge from https://www.kaggle.com/c/cat-in-the-dat/data then save it into input folder

  • Check csv file: head train.csv

  • python .\src\create_folds.py

  • python .\src\train.py Result: 0.7437763648654564

##1.2 Building an inference for the machine learning framework run.bat set TRAINING_DATA=input\train_folds.csv set TEST_DATA=input\test.csv

set MODEL=%1

set FOLD=0 
python -m src.train
set FOLD=1 
python -m src.train
set FOLD=2 
python -m src.train
set FOLD=3 
python -m src.train
set FOLD=4 
python -m src.train

python -m src.predict

.\run.bat randomforest

2. Cross Validation Framework

3. Handling categorical features in ML problems

Ref:

run.bat in window set TRAINING_DATA=input\train_folds.csv set FOLD=0 python -m src.train1

and run.sh linux: set ==> export

About

Machine Learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published