Multi-layered malware detection system using static analysis, dynamic browser automation, and external APIs for accurate website threat identification. Project Code, Documents and Video Implementation
This project implements a comprehensive malware detection system using multiple detection layers, dynamic analysis techniques, and external threat intelligence APIs.
Youtube Video : https://youtu.be/Zs528NqpIMs?si=Vri3rs0qkW8lnOow
This system is designed to analyze and detect malicious websites by combining static analysis, dynamic behavior inspection, and third-party API integrations. Its multi-layered architecture ensures robust and accurate threat detection across a wide range of categories.
- HTML Content Scanning: Uses Cheerio to parse and analyze HTML structure.
- JavaScript Code Analysis: Detects suspicious or obfuscated JS patterns.
- Network Request Monitoring: Extracts and analyzes outbound URLs and endpoints.
- Keyword-Based Threat Detection: Matches against known malicious keywords and patterns.
- VirusTotal β URL reputation and threat reports.
- Google Safe Browsing API β Website threat classification.
- URLScan.io β Dynamic scanning and visual analysis.
- AbuseIPDB β IP reputation and abuse data.
- PicPurify & APILayer β Image and content moderation.
- Playwright Automation: Crawls websites in headless mode.
- Hidden Element Detection: Identifies cloaking, hidden links, and iframes.
- Obfuscated Script Identification: Flags minified, encoded, or anomalous JavaScript.
- Behavioral Pattern Matching: Detects patterns common in malicious websites.
-
Initial Content Analysis
- Parse HTML
- Extract and analyze JS
- Match suspicious keywords
-
External API Verification
- Scan URLs and IPs
- Check reputation
- Analyze embedded media (images, etc.)
-
Dynamic Behavior Analysis
- Simulate user browsing
- Monitor hidden elements
- Track suspicious scripts
- Malware & viruses
- Phishing websites
- Adult content
- Gambling websites
- Crypto scams
- Suspicious JavaScript
- Hidden iframes
- Obfuscated/encoded code
- Number of threat indicators
- Severity score per threat
- VirusTotal detection count
- Keyword match confidence
- Image/content scan results
By combining static, dynamic, and third-party intelligence, this system aims to provide accurate and real-time threat detection. Itβs built for use cases like:
- Malware URL databases
- Web security monitoring
- Safe browsing tools
- Parental controls
- Automated threat research
Installation and usage instructions will be added here.
Youtube Video : https://youtu.be/Zs528NqpIMs?si=Vri3rs0qkW8lnOow
For support, feedback, or project files, please contact [vatshayan]