Skip to content

This repository is where I share and organize my data analyses, covering a range of topics from descriptive to predictive analytics, using mostly large, real-world datasets gathered from various sources.

Notifications You must be signed in to change notification settings

ErnieSumoso/data-analysis

Repository files navigation


Icon

Data Analysis Repository

I use this repository to store many of my data analyses, performed mostly on large datasets using Python. The repository is organized into folders by type of analytics: descriptive, diagnostic, exploratory, and predictive. If new types of analytics are performed, I will add new folders containing these analyses.


Pull Requests · Issues

About The Project

Showcase
This repository was originally created to store and track my learning process in data analysis, from descriptive to prescriptive. Over the months, I have added most of my data analyses, ranging from basic to complex, on large datasets up to 21k rows. The majority of these analyses were developed using JupyterLab for enhanced visualizations and development. Feel free to explore and make suggestions on any of my solutions!

(back to top)

Built With

(back to top)

Getting Started

As this repository contains mostly Jupyter Notebook files, you only need basic Python and Jupyter software installations. However, there are a couple of scripts that were developed using R-Studio.

Prerequisites

To run the code you need at least the following components (R-Studio is optional).

Installation

  1. Clone the repo
    git clone https://github.com/ErnieSumoso/data-analysis.git
  2. Explore the files and enjoy!

(back to top)

Usage

You can use this repository to explore the analysis yourself, try new approaches, and improve your data analysis skills!

Here are some resources that I use for help:

(back to top)

Roadmap

  • Add more advanced descriptive and diagnostic analytics.
  • Add new predictive analysis using large datasets composed of 20k+ rows.
  • Update the repository by adding previously performed data analyses that have not yet been uploaded.

I am always open for any suggestions or recommendations on any of the files. Please, add them on the issues section.

(back to top)

Contact

Ernie Sumoso - GitHub Profile - My Repositories

Project Link: https://github.com/ErnieSumoso/data-analysis

About

This repository is where I share and organize my data analyses, covering a range of topics from descriptive to predictive analytics, using mostly large, real-world datasets gathered from various sources.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published