Skip to content

Problems crawling buildcanada.com #755

Open
@mveytsman

Description

@mveytsman

I have tried archiving the page with ArchiveWeb.page and it replays successfully within the app/ReplayWeb.page. When trying to crawl it with Browsertrix, the page does not replay succesfully:

Here's a screenshot of what I expect at https://buildcanada.com/memos
Image

Here's what I see in the replay

Image

I also experienced some text problems on a page that does load (https//buildcanada.com):

Image

You can see some of these in QA with the extracted text difference on this crawl

I ran this by @Shrinks99 and he ran a crawl using the beta channel and it delivered improved results but did not complete as successfully as ArchiveWeb.page.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    Status

    Triage

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions