Phishing Email Detection System

This project uses Natural Language Processing (NLP) and Machine Learning techniques to detect phishing emails. By analyzing the contents of emails, the system classifies them as either "Safe" or "Phishing". The model uses Random Forest for classification and utilizes TF-IDF for text feature extraction.

Project Description

Phishing attacks are one of the most common cyber threats that target individuals and organizations. This project aims to develop a reliable system to detect phishing emails using various techniques in Natural Language Processing (NLP) and Machine Learning (ML). The model is trained using a labeled dataset of emails and uses a Random Forest classifier to make predictions.

The core of the system involves:

TF-IDF Vectorization: Transforms the email body text into numerical features that can be used by the machine learning model.
Random Forest Classifier: A robust algorithm for classification based on multiple decision trees.
Sentiment Analysis: Analyzes the sentiment of email content to assist in classification.

The system can be deployed as a web application using Streamlit, where users can input email details to get real-time classification results.

Technologies Used

Python: The main programming language used for the project.
Scikit-Learn: For machine learning models, including Random Forest classifier.
NLTK (Natural Language Toolkit): For text preprocessing and sentiment analysis.
TF-IDF (Term Frequency-Inverse Document Frequency): A feature extraction technique used for text classification.
Streamlit: A framework to create the web interface for interacting with the phishing detection system.
BeautifulSoup: For cleaning and extracting text from HTML email bodies.
IMAP: For fetching emails from Gmail for real-time classification.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
models		models
LICENSE		LICENSE
NLPphishing.ipynb		NLPphishing.ipynb
Phishing_Email.csv		Phishing_Email.csv
README.md		README.md
phishing_email_detection.py		phishing_email_detection.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Phishing Email Detection System

Table of Contents

Project Description

Technologies Used

About

Uh oh!

Languages

License

mzainxo/Phishing-email-detection-using-ML-and-NLP

Folders and files

Latest commit

History

Repository files navigation

Phishing Email Detection System

Table of Contents

Project Description

Technologies Used

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages