Skip to content

prerelease: minor version#74

Merged
BobBorges merged 94 commits into
mainfrom
dev
Oct 6, 2025
Merged

prerelease: minor version#74
BobBorges merged 94 commits into
mainfrom
dev

Conversation

@BobBorges
Copy link
Copy Markdown
Contributor

No description provided.

BobBorges and others added 30 commits August 29, 2025 14:14
@mandlilaast
Copy link
Copy Markdown
Contributor

Failing check because of pyriksdagen.utils import version_number_is_valid?

Copy link
Copy Markdown
Contributor

@mandlilaast mandlilaast left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great work overall! Just a few notes (probably repeating myself but, just in case):

  1. “Lika lydande…” – This phrase should not be part of any motion title in the sampled cases (e.g., data/1962/mot-1962--fk--00552.xml, data/1953/mot-1953--fk--00110.xml). It can be safely ignored when extracting titles.
  2. mot-1946--fk--00217.xml (diff at line 148) – The content in the PDF (pages 3–4: link, link) refers to attachments (“bilagor”), not the actual motion title. So we shouldn’t classify it as the title, in my opinion....?
  3. Split titles over multiple segments – Some motion titles span multiple <p> elements (e.g., mot-1907--fk--00042.xml). Right now, only one segment is classified as the title, although the title itself spans across those two segments.

Otherwise, everything looks really solid! 16/17 considering points 1 and 3 as improvement and not mistakes.

add goldstandard quality estimate test and results...
@MansMeg
Copy link
Copy Markdown
Contributor

MansMeg commented Oct 3, 2025

Still failing tests.

@BobBorges BobBorges merged commit e794e3b into main Oct 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants