Hi there,
I realize this benchmark is a few years old now, but can you explain how these datasets from OpenML were selected for this benchmark? If they were not randomly selected (using a seed, sampling from OpenML ids), then it would be good to know how/why each dataset was chosen to be included in the benchmark. Thanks!