Skip to content

Is the scraper pulling the full text of each post? #27

@ezramechaber

Description

@ezramechaber

Here's the WH text:
https:/www.whitehouse.gov/the-press-office/2017/02/03/presidential-executive-order-core-principles-regulating-united-states

And here's the scraper's text:
https://newsdiffs-wh.herokuapp.com/diff/78/71/https:/www.whitehouse.gov/the-press-office/2017/02/03/presidential-executive-order-core-principles-regulating-united-states

The text displayed on our site truncates most of the text from the order itself. Is that because the tool wants to show us there are no changes (and therefore it's not worth displaying) or is that because it's not scraping the full page?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions