Skip to content

Solution for the challenge hosted by drivendata for predicting poverty for three different countries. Rank 381/2310 (top 17%)

Notifications You must be signed in to change notification settings

Tejash-Shah/Poverty-challenge-on-Drivendata

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Poverty-challenge-on-Drivendata

This repo contains the solution for the challenge hosted by drivendata for predicting poverty

---- Information from driven data ----

The data for this competition comes from The World Bank Development Data Group.

With funding from the World Bank's Knowledge for Change Program, this competition aims to engage data scientists from developing countries and apply a cost-effective solution to testing a diverse set of approaches to poverty prediction.

The surveys used to come from three developing countries. Each country offers a different demographic makeup, so successful poverty prediction across these countries will help identify a robust set of predictors that can be used in future poverty measurement efforts.

--- Data Preprocessing ---

  1. Standardization
  2. Encoding
  3. Column Enforcement in train and test set
  4. Replacing missing values

--- Sampling ---

SMOTE

---- Dimensionality Reduction ---

  1. SVD
  2. PCA

--- Algorithm Experimented ---

  1. Random Forest
  2. Logistic Regression
  3. AdaBoost Classifier
  4. Gradient Boosting
  5. Decision Tree
  6. Gaussian NB
  7. Extra Tree Classifier
  8. XGBoost

--- Winning Algorithm ---

XGBoost without any sampling or dimensionality reduction

About

Solution for the challenge hosted by drivendata for predicting poverty for three different countries. Rank 381/2310 (top 17%)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published