Skip to content

[Feature] Web & YouTube Content Loader #12

Open
@Shen-po-heng

Description

@Shen-po-heng
  • Input: website URL or YouTube video URL.
  • Use BeautifulSoup or trafilatura for websites.
  • Use youtube_transcript_api to fetch captions.
  • Parse and chunk crawled text for embedding and QA.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

Relationships

None yet

Development

No branches or pull requests

Issue actions