🐦 Twitter Sentiment Analysis using MiniLM Embeddings 🚀

📌 Project Overview

Twitter sentiment analysis using all-MiniLM-L6-v2 sentence embeddings. This project compares multiple machine learning models for sentiment classification (positive, negative, neutral) and includes comprehensive visualizations.

✨ Features

Text Preprocessing - Complete tweet cleaning pipeline
Sentence Embeddings - Using all-MiniLM-L6-v2 (384-dim vectors)
4 Classification Models - Logistic Regression, XGBoost, SVM, Random Forest
Cosine Similarity Analysis - Embedding space visualization
2D Visualization - UMAP dimensionality reduction
Custom Predictions - Test with your own tweets

📊 Results

Model	Accuracy	F1-Score
SVM (RBF)	68.20%	0.684
Logistic Regression	66.40%	0.663
XGBoost	63.50%	0.637
Random Forest	61.80%	0.618

🖼️ Visualizations

Sentiment Distribution

Confusion Matrix (SVM - Best Model)

Model Comparison

Word Clouds

Cosine Similarity Between Classes

Tweet Length Analysis

🛠️ Tech Stack

Python 3.x
sentence-transformers
scikit-learn
XGBoost
UMAP
matplotlib, seaborn

🤝 Contributing

All contributions are welcome — bug fixes, feature enhancements, or documentation improvements!

Please give appropriate credit to the original author if you use or modify this tool in your own projects.

📜 License

This project is licensed under the MIT License — see the LICENSE file for details.

👤 Author

Ayush Mani
🔗 GitHub: @m1n1v1rus

🚀 Quick Start

# Clone repository
git clone https://github.com/m1n1v1rus/Sentiment-Embeddings-Project.git

# Install requirements
pip install -r requirements.txt

# Run notebook
jupyter notebook Sentiment_Embeddings_Project.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
images		images
LICENSE		LICENSE
README.md		README.md
Sentiment_Embeddings_Project.ipynb		Sentiment_Embeddings_Project.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🐦 Twitter Sentiment Analysis using MiniLM Embeddings 🚀

📌 Project Overview

✨ Features

📊 Results