This project implements a Recurrent Neural Network (RNN) for the binary classification of tweets. The objective is to classify tweets into two categories, enabling sentiment analysis and understanding public opinion on various topics. This model leverages various embedding techniques and hyperparameters to optimize classification performance.
- Utilizes RNN and LSTM architectures for sequence modeling.
- Supports different word embedding techniques such as Word2Vec, FastText, and GloVe.
- Configurable training parameters including batch size, learning rate, and number of epochs.
- Logs training progress and results using the
loguru
library. - Data preprocessing and visualization capabilities.
To run this project, ensure you have the following dependencies installed. You can install them using pip or poetry.
pip install -r requirements.txt
poetry install
- pandas
- numpy
- matplotlib
- seaborn
- scikit-learn
- torch
- torchvision
- torchaudio
- opencv-python
- datasets
- nltk
- gensim
- Clone the Repository
- Navigate to the Project Directory
- Install Dependencies
- Prepare your Data
- Configure Training Parameters
- Run the Training
Execute the main script to start training model:
python main.py