We are using Kafka Indexing Service to ingest data.
Now we want to transfer the data from one Druid cluster to another Druid cluster(with different metadata db,deep storage)
The steps are listed here:
-
shutdown the kafka supervisor
-
copy the segements to the new druid
-
update the metadata using InsertSegment tools
-
start the kafak supervisor on the new Druid
The problem is, the two druid is using the same Kafka cluster, How can I know which offsets are consumed, and how to set the offsets in the new kafka supervisor?
I believe it is saved in the metadata db, but I only can find the start offset in the table druid_tasks like this:
“startPartitions” : {
“partitionOffsetMap” : {
“11” : 3,
“2” : 3,
“5” : 3,
“8” : 3
},
“topic” : “topic1”