This repository contains the results of an analysis of long tails for outputs of the STARK tool for dependency extraction. The files contained are as follows:
-
all_treebanks_dataframe.tsv- A table containing for all UD v2.13 languages counts of dependency trees with a frequency of 1, counts of dependency trees with a frequency of 2 or more, total number of extracted dependency trees and percent of all extracted trees that have a frequency of 1 -
SL_SSJ_STARK_output.txt- Output of the STARK extraction tool for the Slovenian SSJ treebank -
config.ini- Config file used with STARK to obtain the desired results