Skip to content

Commit 22fa63b

Browse files
committed
More information about how kafka minion works
1 parent 8386b38 commit 22fa63b

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -106,13 +106,13 @@ Below metrics have a variety of different labels, explained in this section:
106106

107107
## How does it work
108108

109-
At a high level Kafka Minion fetches source data in two different ways.
109+
At a high level Kafka Minion fetches data from two different sources (see below). Kafka Minion provides lots of metrics by connecting these datasets. For instance a partition high water mark with a consumer group's current offset to calculate the lag on that partition. Invocating the `/metrics` endpoint starts the calculation of these metrics on a snapshot of the current data.
110110

111111
- **Consumer Group Data:** Since Kafka version 0.10 Zookeeper is no longer in charge of maintaining the consumer group offsets. Instead Kafka itself utilizes an internal Kafka topic called `__consumer_offsets`. Messages in that topic are binary and the protocol may change with broker upgrades. On each succesful offset commit from a consumer group member a message is created and produced to that topic. The message key is a combination of the `groupId`, `topic` and `partition`. The value is the offset index.
112112

113113
The `__consumer_offsets` topic is a compacted topic. Once an offset expires Kafka produces a tombstone for the given key, which will Kafka Minion use to delete the offset information as well. Therefore Kafka Minion has to consume all messages from earliest, so that it gets all consumer group offsets which have not yet been expired.
114114

115-
2. **Broker requests:** Brokers are being queried to get topic metadata information, such as partition count, topic configuration, low & high water mark.
115+
- **Broker requests:** Brokers are being queried to get topic metadata information, such as partition count, topic configuration, low & high water mark.
116116

117117
## FAQ
118118

0 commit comments

Comments
 (0)