Skip to content

Cyprus: PII Protection and Verification System A web-based solution using Python, Django, Tesseract OCR, and AES-256 encryption to extract, mask, and securely verify PII from government documents. Improved efficiency by 60% and accuracy by 70%, replacing traditional methods with a scalable digital process.

License

Notifications You must be signed in to change notification settings

Srujanrana07/Cyprus-PII-Protection-and-Verification-System

Β 
Β 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

27 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

PII Protect Logo

πŸ›‘οΈCyprus : PII Protect System

AI-Powered Sensitive Data Detection, Masking & Secure Document Management


🌟 Overview

PII Protect System is a secure web platform designed to detect, mask, encrypt, and manage Personally Identifiable Information (PII) from user-uploaded documents.

Using OCR, AI-based PII detection, and AES-256 encryption, the system enables:

  • Safe document upload
  • Automatic PII extraction
  • User-controlled masking/revealing
  • Secure encrypted storage
  • Manager-level controlled access

πŸ”„ System Workflow

Workflow

User Upload β†’ OCR β†’ PII Detection β†’ Masking β†’ Encryption β†’ Secure Storage β†’ Manager Access

✨ Features

Feature Description
OCR Extraction Uses Tesseract to read text from images
PII Detection Detects Aadhaar, PAN, phone, email, address, etc.
Custom Masking Partially or fully masks sensitive elements
User Access Control User decides if the PII should be revealed or hidden
Manager Verification System Manager must authenticate using ID
AES-256 Encryption Every extracted PII text is encrypted
Secure Blob Storage Masked files stored as encrypted blob objects
Audit Logs Tracks every access request

πŸ—οΈ Architecture

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                   FRONTEND                  β”‚
β”‚   HTML β€’ CSS β€’ JavaScript (User Interface)  β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                    β”‚
                    β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                    API LAYER                β”‚
β”‚      Django Backend + Node.js Services      β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                    β”‚
                    β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                OCR & PII ENGINE             β”‚
β”‚   Tesseract OCR + Rule/Model-based PII DET  β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                    β”‚
                    β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚          ENCRYPTION & STORAGE LAYER         β”‚
β”‚      AES-256 Encryption + Blob Storage      β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                    β”‚
                    β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚              MANAGER ACCESS SYSTEM          β”‚
β”‚       Identity Validation + Secure Access   β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

πŸ› οΈ Technology Stack

Frontend

  • HTML
  • CSS
  • JavaScript

Backend

  • Django
  • Node.js
  • MySQL

Security

  • AES-256 encryption
  • Blob storage
  • Access control
  • Audit logs

πŸŽ₯ Demo Screenshots

Replace these with real app screenshots.

Upload UI

OCR + PII Detection

Manager Login

Masked Document


πŸ”’ Security & Compliance

  • DPDP Act 2023 Compliant
  • IT Act & SPDI Rules
  • AES-256 Industry Encryption
  • Zero-Trust Workflow

⚠️ Challenges

  1. DPDP compliance complexity
  2. Low user awareness
  3. Managerial misuse risks
  4. Cyberattacks despite encryption

🎯 Impact

  • Protects sensitive Indian identity documents
  • Reduces fraud & identity theft
  • Builds trust in digital document workflows
  • Useful for universities, HR, banks, govt offices

πŸš€ Future Enhancements

  • AI-based PII classification
  • Multi-language OCR
  • Role-based dashboards
  • Cloud storage encryption
  • Real-time redaction

πŸ“œ License

MIT License.

About

Cyprus: PII Protection and Verification System A web-based solution using Python, Django, Tesseract OCR, and AES-256 encryption to extract, mask, and securely verify PII from government documents. Improved efficiency by 60% and accuracy by 70%, replacing traditional methods with a scalable digital process.

Topics

Resources

License

Stars

Watchers

Forks

Languages

  • CSS 40.1%
  • Python 30.6%
  • HTML 27.7%
  • JavaScript 1.6%