Skip to content

Latest commit

 

History

History
15 lines (9 loc) · 263 Bytes

File metadata and controls

15 lines (9 loc) · 263 Bytes

swe-article-extraction-benchmark

Benchmark over extractions of articles in Swedish

Usage

prepare-raw-html

This tool us used to prepare raw html to dataset.

Example

prepare-raw-html raw-html/files.jsonl data/dataset_sample.jsonl