Skip to content

Releases: swerik-project/riksdagen-records

v1.6.1alpha

24 Mar 11:13
dc6a269

Choose a tag to compare

v1.6.1alpha Pre-release
Pre-release

Data formats

records.zip includes the complete records in the ParlaClarin XML format.

records_speeches_DECADE.ndjson.gz includes the speeches in the records in newline delimited JSON format, aggregated by decade. These are compressed via gzip.

Quality estimates

The quality estimates are available in the quality.zip archive.

References

A list of references can be found in the reference-list.bib file.

v1.6.0

13 Mar 09:47
dc6a269

Choose a tag to compare

Data formats

records.zip includes the complete records in the ParlaClarin XML format.

records_speeches_DECADE.ndjson.gz includes the speeches in the records in newline delimited JSON format, aggregated by decade. These are compressed via gzip.

Quality estimates

The quality estimates are available in the quality.zip archive.

References

A list of references can be found in the reference-list.bib file.

What's Changed

New features and data

  • Add modern pagenumbers (applying the add_modern_pagenumbers.py to the curation) by @mandlilaast in #163
  • Add modern pagenumbers (applying the add_modern_pagenumbers.py to the curation) by @mandlilaast in #163
  • Curate protocols step 2: fix add_uuid.py related problems. by @mandlilaast in #165
  • Protocol curation step 3: add links to pdf pages by @mandlilaast in #167
  • Curation of 2023-2025 protocols step 4: find the dates by @mandlilaast in #168
  • Feat: classify_note_seq based on the found script from older branch by @mandlilaast in #170
  • 20232425 protocols (step 8): Feat: add uuid after classifying paragraphs into notes and utterances. by @mandlilaast in #171
  • Protocol 202324 and 202425 curation (step 9): map introductions to the speaker in the metadata. by @mandlilaast in #172
  • 2023-25 Protocol curation step 10: split protocols into
    sections by @mandlilaast in #173
  • Feat: classify titles in protocols by @mandlilaast in #174
  • Last step of curating the 20232425 protocols by @mandlilaast in #175
  • Merge utterances by @mandlilaast in #186
  • Feat: annotate-speeches.py by @mandlilaast in #194
  • Add 2023/24 and 2024/25 protocols to the corpus by @mandlilaast in #176
  • feat: merge consecutive utterances and add next/prev tags by @ninpnin in #198
  • feat: Add newline delimiter JSON as a release file format by @ninpnin in #201

Bug fixes

Misc. chores

Full Changelog: v1.5.0...v1.6.0

v1.5.1alpha3

12 Mar 12:26

Choose a tag to compare

v1.5.1alpha3 Pre-release
Pre-release

Data formats

records.zip includes the complete records in the ParlaClarin XML format.

records_speeches_DECADE.ndjson.gz includes the speeches in the records in newline delimited JSON format, aggregated by decade. These are compressed via gzip.

Quality estimates

The quality estimates are available in the quality.zip archive.

References

A list of references can be found in the reference-list.bib file.

v1.5.1alpha2

12 Mar 10:16

Choose a tag to compare

v1.5.1alpha2 Pre-release
Pre-release

Preprocessed easy-to-use formats

Download persons_csv.zip, persons.xlsx or persons.sqlite to easily access preprocessed data.

Normal form DB for more complex processing

The persons.zip archive contains the original unmerged tables as CSVs for more complex processing.

Quality estimates

The quality estimates are available in the quality.zip archive.

References

A list of references can be found in the reference-list.bib file.

v1.5.1-alpha

27 Feb 09:15
35dcd99

Choose a tag to compare

v1.5.1-alpha Pre-release
Pre-release

Full Changelog: v1.5.0...v1.5.1-alpha

v1.5.0

18 Dec 13:42
4b454e4

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v1.4.2...v1.5.0

v1.4.2

16 Apr 08:09
084e840

Choose a tag to compare

What's Changed

  • feat: heuristically split merged margin notes and bodytext into two paragraphs by @ninpnin in #98
  • feat: heuristically split merged margin notes and bodytext into two paragraphs by @ninpnin in #100
  • prerelease: patch version by @BobBorges in #101

Full Changelog: v1.4.1...v1.4.2

v1.4.1

21 Mar 08:19
adfa693

Choose a tag to compare

What's Changed

Full Changelog: v1.4.0...v1.4.1

v1.4.0

14 Feb 15:03
449605d

Choose a tag to compare

What's Changed

Full Changelog: v1.3.0...v1.4.0

v1.3.0

15 Jan 15:57
569ba31

Choose a tag to compare

What's Changed

New Contributors

  • @ljo made their first contribution in #55

Full Changelog: v1.2.0...v1.3.0