Scrapely - A Web Scraping Tool

Scrapely is a simple and customizable web scraper built using Java and Jsoup. It allows users to extract structured data from web pages based on CSS selectors and stores the results in a JSON file.

🚀 Features

✅ Extract Specific Elements – Users can define which elements to scrape using CSS selectors.
✅ Attribute Extraction – Option to extract specific attributes (e.g., href, src, alt).
✅ Automatic JSON Output – Saves extracted data in a structured JSON file.
✅ User Input Handling – Fully interactive, taking inputs via the command line.

🛠️ Tech Stack

Java – Core programming language
Jsoup – HTML parsing & web scraping library

📌 How to Use

1️⃣ Clone the Repository

git clone https://github.com/yourusername/scrapely.git
cd scrapely

2️⃣ Compile and Run

Make sure you have Java 8+ installed.

javac -cp .:jsoup-1.13.1.jar Main.java WebScraper.java
java -cp .:jsoup-1.13.1.jar Main

3️⃣ Provide Inputs

When prompted, enter the required details:

Website URL
CSS Query (class/tag)
Element to extract
(Optional) Attribute to extract

4️⃣ Check Output

The extracted data is stored in a JSON file named after the website title. Example output:

[
  { "text": "The Great Gatsby" },
  { "text": "1984" },
  { "text": "To Kill a Mockingbird" }
]

📜 License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea		.idea
src/main/java		src/main/java
.gitignore		.gitignore
All products \| Books to Scrape - Sandbox.json		All products \| Books to Scrape - Sandbox.json
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrapely - A Web Scraping Tool

🚀 Features

🛠️ Tech Stack

📌 How to Use

1️⃣ Clone the Repository

2️⃣ Compile and Run

3️⃣ Provide Inputs

4️⃣ Check Output

📜 License

About

Releases

Packages

Languages

adarshpandey18/java-web-scrapper

Folders and files

Latest commit

History

Repository files navigation

Scrapely - A Web Scraping Tool

🚀 Features

🛠️ Tech Stack

📌 How to Use

1️⃣ Clone the Repository

2️⃣ Compile and Run

3️⃣ Provide Inputs

4️⃣ Check Output

📜 License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages