Skip to content

Fix Nature#887

Open
addie9800 wants to merge 4 commits intomasterfrom
fix-nature
Open

Fix Nature#887
addie9800 wants to merge 4 commits intomasterfrom
fix-nature

Conversation

@addie9800
Copy link
Collaborator

@addie9800 addie9800 commented Feb 7, 2026

Unfortunately, since the publisher coverage did not fail and archive constantly redirecting, I can't accurately place the valid_until date. We know for sure it didn't work since the release of v0.5.5 due to qse errors.

Copy link
Collaborator

@MaxDall MaxDall left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@addie9800 Thanks a lot for updating the layout 👍


class V1_1(V1):
_paragraph_selector = XPath(
"//div[@data-test='main-content' or @class='main-content']//p"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In this article [1] the title of the related article is extracted as paragraph. In general related article titles are extracted.

This article [2] couldn't be parsed.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for catching, should be fixed now 👍

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@addie9800 Unfortunately the plaintext of 1 is now cut off.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@MaxDall Unfortunately, I cannot reproduce that. The last extracted sentence in my test is: It’s not crazy. It’s really exciting.” as expected. Or what do you mean by cut off?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@addie9800 Hmm, that's weird. For me the last paragraph is the one before the Reinventing MRI subheadline, ending with “It’s a tiny change on a tiny scale.”

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants