CinEvalu8 is a sentiment analysis project focused on analyzing IMDb movie reviews. Leveraging Natural Language Processing (NLP) techniques, CinEvalu8 provides insights into the sentiment expressed in movie reviews, helping users understand audience reactions and sentiments towards various films.
The dataset used for this project is the IMDb Dataset of 50K Movie Reviews available on Kaggle. It contains 50,000 movie reviews labeled as positive or negative.
- Data Collection: Scripts for scraping IMDb movie reviews or importing the Kaggle dataset.
- Preprocessing: Code for cleaning and preparing the text data for sentiment analysis.
- Sentiment Analysis Model: Implementation of feature engineering and logistic regression model for sentiment classification.
- Visualization: Tools for visualizing sentiment analysis results and trends.
-
Clone the repository: https://github.com/SakshiFadnavis2003/CinEvalu8.git
-
Explore the notebooks and scripts in each directory for data collection, preprocessing, model development, and visualization.
Contributions are welcome! If you have any suggestions, improvements, or new features to add, feel free to open an issue or submit a pull request.
- IMDb Dataset of 50K Movie Reviews by Lakshmi N Pathi