This project is an LSTM-based text classification system that utilizes the IMDB dataset, which consists of 50K movie reviews for natural language processing. The dataset is suitable for binary sentiment classification and contains substantially more data than previous benchmark datasets, with 25,000 reviews provided for training and 25,000 for testing. The project's primary goal is to predict the number of positive and negative reviews using either classification or deep learning algorithms. Additionally, the project uses Flask and Gunicorn as an endpoint and has been developed using microservice architecture, making it an end-to-end project.
- Tensorflow
- Keras
- Pandas
- Numpy
- DVC
1. Python
2. shell scripting
3. aws cloud Provider
1. AWS S3
2. GitHub
3. Flask, gunicorn
conda create --prefix ./env python=3.9
conda activate ./env
pip install -r requirements.txt
dvc init
This project is production ready to be used for the similar use cases and it will provide the automated and orchesrated production ready pipelines(Training & Serving)