Linear Regression model applied using statsmodels library on Bike sharing Dataset to predict the number of bike rented in a day and find important features of the data.
A US bike-sharing provider BoomBikes has recently suffered considerable dips in their revenues due to the ongoing Corona pandemic. The company is finding it very difficult to sustain in the current market scenario. So, it has decided to come up with a mindful business plan to be able to accelerate its revenue as soon as the ongoing lockdown comes to an end, and the economy restores to a healthy state.
In such an attempt, BoomBikes aspires to understand the demand for shared bikes among the people after this ongoing quarantine situation ends across the nation due to Covid-19. They have planned this to prepare themselves to cater to the people's needs once the situation gets better all around and stand out from other service providers and make huge profits.
- Which variables are significant in predicting the demand for shared bikes.
- How well those variables describe the bike demands
Dataset taken from Paper "Event labeling combining ensemble detectors and background knowledge", Progress in Artificial Intelligence (2013): pp. 1-15, Springer Berlin Heidelberg, doi:10.1007/s13748-013-0040-3.
- The bikes rented will be more with each passing year.
- The bikes rented are more in summer and winter season thus hosting events or advertisements will be beneficial.
- Month of August and October are the best months to rent bikes as they see most increase in bikes rented.
- Month of Jan is not the best month to rent bikes as it sees decrease in bikes rented.
- Days with low windspeed also see increase in bikes rented and are good opportunities for business.
- The bikes rented are decreased in rainy and cloudy weather.
We achieved the following results:
- Test Accuracy: 0.795
- RMSE : 0.0986
- Mean Absolute Error : 0.0746
- Durbin-Watson value : 1.9896
- Pandas - version 1.3.4
- NumPy - version 1.20.3
- MatplotLib - version 3.4.3
- Seaborn - version 0.11.2
- Scikit-Learn - version 0.24.2
- StatsModels - version 0.12.2
This project was inspired by UpGrad IITB Programme as a case study for the Machine Learning and Artificial Intelligence course.
Thanks to Fanaee-T, Hadi and Gama, Joao for the sharing the dataset with the world.
Created by [@sukhijapiyush] - feel free to contact me!
This project is open source and available without restrictions.