Spatial Adaptive Agglomerative Aggregation (SA3) clustering #482
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Hi all,
We implemented an algorithm to delineate contiguous areas within cities that have identical characteristics and configurations of buildings and streets, but we thought it might be useful for other applications since the procedure is quite generic.
The idea is that is a kind of spatially restricted HDBSCAN, so there is only one parameter to specify - the minimum number of observations to form a cluster. The procedure basically consists of two steps: first, carrying out a full spatially, restricted
sklearn.cluster.AgglometariveClustering
clustering; and second, extracting clusters from the resulting linkage matrix, using density-clustering extraction algorithms - Excess of Mass or Leaf. This results in multiscale (clusters have varying ranges of internal similarity), contiguous clusters with noise (some observations are not attached to any clusters).I try to explain more how it works, examples and advantages and disadvantages in the
sa3.ipynb
notebook.