Skip to content

Commit 26b46ce

Browse files
KMeans clustering algorithm (apache#13604)
Implement Kmeans clustering algorithm for vectors. Knn algorithms that further reduce memory usage of vectors (such as Product Quantization, RaBitQ etc) require clustering of vectors. This implements KMeans clustering algorithm. Co-authored-by: Jim Ferenczi [email protected]
1 parent cb8bc75 commit 26b46ce

File tree

6 files changed

+692
-0
lines changed

6 files changed

+692
-0
lines changed

lucene/CHANGES.txt

+2
Original file line numberDiff line numberDiff line change
@@ -137,6 +137,8 @@ New Features
137137
* GITHUB#13597: Align doc value skipper interval boundaries when an interval contains a constant
138138
value. (Ignacio Vera)
139139

140+
* GITHUB#13604: Add Kmeans clustering on vectors (Mayya Sharipova, Jim Ferenczi, Tom Veasey)
141+
140142
Improvements
141143
---------------------
142144

lucene/sandbox/src/java/module-info.java

+1
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,7 @@
2222

2323
exports org.apache.lucene.payloads;
2424
exports org.apache.lucene.sandbox.codecs.idversion;
25+
exports org.apache.lucene.sandbox.codecs.quantization;
2526
exports org.apache.lucene.sandbox.document;
2627
exports org.apache.lucene.sandbox.queries;
2728
exports org.apache.lucene.sandbox.search;

0 commit comments

Comments
 (0)