-
Notifications
You must be signed in to change notification settings - Fork 41
Description
IYP currently integrates a lot of datasets from OpenINTEL but these have a lot of overlap because they are in fact from the same data source. Which is kind of contradicting with our guidelines and make the database unnecessarily big..
For example we have three forward dns datasets:
openintel.tranco1mopenintel.umbrella1mopenintel.crux
These could be merged intp only one dataset, that could be nammed openintel.toplist following OpenINTEL taxonomy (https://openintel.nl/download/forward-dns/basis=toplist/).
To be more consistent with OpenINTEL dataset, we can also consider adding Majestic and Cloudflare top 1M rankings to IYP. (to be addressed in separate issues)
The dns graph datasets could also be consolidated into one dataset (openintel.dnsgraph):
openintel.dnsgraph_nlopenintel.dnsgraph_rdnsopenintel.dnsgraph_crux
The openintel.infra_ns shouldn't be consolidated with others because it is already unique.