Skip to content

Add Python actor templates for LLM-based web scraping frameworks #739

@vdusek

Description

@vdusek

Summary

Add new Python actor templates for modern LLM-based web scraping frameworks. These frameworks leverage large language models for intelligent data extraction, making it easier to scrape unstructured or semi-structured web content.

Templates to add

  • Crawl4AI – LLM-friendly web crawler and scraper with built-in support for structured extraction
  • Scrapling – high-performance, adaptive web scraping library with intelligent content extraction
  • Browser Use – AI agent framework for browser automation using LLMs
  • ScrapeGraphAI – LLM-powered scraping pipelines using graph-based logic

Notes

  • Each template should follow the existing Python template conventions in this repo
  • Templates should include a reasonable default use case / example configuration
  • Consider which base Docker image is most appropriate for each (e.g. Playwright-based for browser-heavy frameworks)

🤖 Generated with Claude Code

Metadata

Metadata

Assignees

No one assigned

    Labels

    t-toolingIssues with this label are in the ownership of the tooling team.

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions