@@ -107,6 +107,59 @@ Sample output:
107107- HNSW Orphan Labels - this must always be empty set, if you see anything else report it
108108 in [ Discord] ( https://discord.gg/MMeYNTmh3x ) .
109109
110+ #### How to Read the output
111+
112+ ** General Info**
113+
114+ This section presents general Chroma persistent dir info.
115+
116+ - Chroma Version - the currently installed Chroma version.
117+ - Number of Collection - the number of collections in the persistent dir.
118+ - Persist Directory - the path to the persistent dir (if privacy mode is off).
119+ - Persist Directory Size - the size of the persistent dir.
120+ - SystemDB size - the size of the system database (if privacy mode is off the full path to the sqlite3 file is shown).
121+ - Orphan HNSW Directories - a list of orphan HNSW directories. These directories are present in the persistent dir but
122+ are not associated with any collection.
123+
124+ ** Collections**
125+
126+ - ID - the collection ID.
127+ - Name - the collection name.
128+ - Metadata - the metadata associated with the collection.
129+ - Dimension - the dimension of the embeddings in the collection. (this can be None in case no vectors are present and
130+ the collection is newly created).
131+ - Tenant - the tenant of the collection.
132+ - Database - the database of the collection.
133+ - Records - the number of records in the collection.
134+ - WAL Entries - the number of WAL entries in the collection (as of 0.5.5 for new instances Chroma will clean WAL for
135+ each collection periodically).
136+
137+ ** Metadata Segment**
138+
139+ - Segment ID - the segment ID.
140+ - Type - the segment type.
141+ - Scope - the segment scope.
142+ - SysDB Max Seq ID - the maximum sequence ID in the system database.
143+
144+ ** HNSW Segment**
145+
146+ - Segment ID - the segment ID.
147+ - Type - the segment type.
148+ - Scope - the segment scope.
149+ - Path - the path to the HNSW directory.
150+ - SysDB Max Seq ID - the maximum sequence ID in the system database.
151+ - HNSW Dir Size - the size of the HNSW directory.
152+ - HNSW Metadata Max Seq ID - the maximum sequence ID in the HNSW metadata.
153+ - HNSW Metadata Total Labels - the total number of labels in the HNSW metadata.
154+ - WAL Gap - the difference between the maximum sequence ID in the system database and the maximum sequence ID in the
155+ HNSW
156+ metadata. The gap usually represents the number of WAL entries that are not committed to the HNSW index.
157+ - HNSW Raw Total Active Labels - the total number of active labels in the HNSW index.
158+ - HNSW Raw Allocated Labels - the total number of allocated labels in the HNSW index.
159+ - HNSW Orphan Labels - a set of orphan labels in the HNSW index. These are labels in the HNSW index that are not visible
160+ to Chroma as they are not part of the metadata. This set should always be empty, if not please report it!!!
161+ - Fragmentation Level - the fragmentation level of the HNSW index.
162+
110163### WAL Commit
111164
112165This command ensures your WAL is committed to binary vector index (HNSW).
0 commit comments