- Performed analysis on the textual movie review dataset to correctly classify positive and negative reviews of the users.
- Transformed the unstructured textual data into structured data by making use of TF-IDF, bag of words and n-grams techniques in python using Scipy library.
- Applied various machine learning algorithms for predictive analysis such as K-nearest neighbors, LinearSVC, Naive Bayes Classifier in python to classify positive and negative reviews. Achieved the best accuracy of 83.4%.
- Python
- Scikit text processing features