Skip to content

ChaitanyaChilukuri663/NYC_Taxi_Trip_Duration

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Capabl_Project

Taxi Trip Fare and Duration Prediction

This repository contains the code and resources for a Machine Learning model developed during my internship period at Elite Techno Groups. The project aims to predict taxi trip fare and duration using various features and historical data.

Project Overview

The objective of this project is to build a predictive model that can accurately estimate the fare and duration of taxi trips. By leveraging machine learning techniques, we can help taxi companies optimize their pricing strategies and provide more accurate estimated trip times to their customers.

Dataset

The model is trained on a dataset that includes historical records of taxi trips, containing information such as pickup and dropoff locations, trip distance, time of day, and various other relevant features. The dataset is not included in this repository due to its large size and proprietary nature.

Requirements

To run the code in the Jupyter notebook, you need to have the following dependencies installed:

  • Python
  • Jupyter Notebook
  • Pandas
  • NumPy
  • Scikit-learn
  • Matplotlib
  • Seaborn

You can install these dependencies using pip. The versions specified in the requirements.txt file should be followed for compatibility.

Instructions

  1. Clone the repository to your local machine using the following command:

git clone https://github.com/ChaitanyaChilukuri663/Capabl_Project.git

  1. Install the required dependencies as mentioned in the Requirements section.

  2. Obtain the dataset from the appropriate source and place it in the /data directory. The notebook assumes that the data file is named taxi_data.csv.

  3. Launch Jupyter Notebook and navigate to the cloned repository.

  4. Open the Capbl_Taxi prediction.ipynb notebook.

  5. Follow the code in the notebook to preprocess the data, train the machine learning models, and evaluate their performance.

About the Models

We have experimented with multiple machine learning algorithms such as Linear Regression, Random Forest, and Gradient Boosting. The models are trained and evaluated using various metrics, including Mean Squared Error (MSE) and Mean Absolute Error (MAE).

Results

The performance of each model is analyzed and compared in the notebook. Visualizations are provided to understand the model predictions and potential areas of improvement.

Conclusion

The developed models show promising results in predicting taxi trip fare and duration. However, there is always room for improvement. Feel free to experiment with different feature engineering techniques, hyperparameter tuning, or consider incorporating additional data sources to enhance the models further.

If you have any questions or suggestions, please feel free to reach out or submit a pull request.

Happy coding!

Chaitanya Chilukuri

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published