enabled pdf scraper and fixed some bugs #109

TigranTigranTigran · 2025-08-08T12:45:47Z

Summary 📝

Enabled pdf scraper and fixed some bugs (use of correct scraper schema and fix in base scraper class)

Bugfixes 🐛

Enabled the scraper to be called in _execute_searches:

            try:
                assessment_output = await self.link_relevancy_assessor.arun(
                    assessor_input,
                )
                if self.debug:
                    logger.debug(
                        f"Relevancy assessment summary: {assessment_output.assessment_summary}",
                    )
                if self.web_scraper and assessment_output.filtered_results:
                    assessment_output.filtered_results = await self._fetch_full_content_for_high_relevancy(assessment_output.filtered_results)
                return assessment_output.filtered_results
            except Exception as e:
                logger.warning(f"Error in relevancy assessment: {e}")

Now using correct input schema for PDF scraper: PDFScraperInputSchema (before this it was ScraperToolInputSchema)
httpx.AsycClient call fixed so that it now automatically follows redirects (i.e., follow_redirects=True); it now doesn't throw 301 errors for moved urls

Checks

Closed #798
Tested Changes
Stakeholder Approval

enabled pdf scraper and fixed some bugs

50916c4

TigranTigranTigran requested review from NISH1001 and muthukumaranR August 8, 2025 12:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

enabled pdf scraper and fixed some bugs #109

enabled pdf scraper and fixed some bugs #109

Uh oh!

TigranTigranTigran commented Aug 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

enabled pdf scraper and fixed some bugs #109

Are you sure you want to change the base?

enabled pdf scraper and fixed some bugs #109

Uh oh!

Conversation

TigranTigranTigran commented Aug 8, 2025

Summary 📝

Bugfixes 🐛

Checks

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants