Information on how these datasets were chosen

Hi there,

I realize this benchmark is a few years old now, but can you explain how these datasets from OpenML were selected for this benchmark?  If they were not randomly selected (using a seed, sampling from OpenML ids), then it would be good to know how/why each dataset was chosen to be included in the benchmark.  Thanks!