Maybe put all into a single data-set (data.frame), with additional attributes 'category' (success, failure, ...) and 'score' (indicating how well it fits the category, to be used as sampling weights).
Also need to find a place for the code to update / construct the proverbs data-set.