Because we are harvesting prefixes from all over the place and we can not guarantee these will not collide, we need some unique identifiers at the level of the dataset upon which we can hang the metadata. Wherever possible we will present these in the UI using only existing identifiers (eg. those from identifiers.org, biosharing, etc.).