- Team Name: shecodes
- Team Members: Sahithi Vemishetty, Sree Manaswini, Manogna Belli
- Submission Date: July 28, 2025
This repository contains our solutions for the Adobe India Hackathon 2025 under the theme "Connecting the Dots". The project focuses on intelligent PDF processing and persona-driven document analysis.
- Objective: Extract structured outlines (titles, headings, subheadings) from PDF documents.
- Folder:
round1a/ - Key Features:
- Fast and lightweight PDF parsing
- Hierarchical heading detection
- Structured JSON output
- Objective: Extract and rank relevant sections from multiple PDFs based on user persona and job-to-be-done.
- Folder:
round1b/ - Key Features:
- Multi-document ingestion and processing
- Persona-based section filtering
- Relevance-based ranking of extracted content
Ensure Docker is installed on your system.
cd round1a
docker build --platform linux/amd64 -t round1a-solution .
docker run --rm -v $(pwd)/input:/app/input -v $(pwd)/output:/app/output --network none round1a-solution