Judge the error rate of the different filters: we know that they are blocking a portion of the URLs that have been submitted, but is this in line with the filter settings? Could do this by taking a random sample of blocked URLs (say 100) and getting people to check whether they should be blocked (or not) and under what conditions.