Our initial testing has been doing great, especially since the new kafka indexer. The testing have been running on single computer and now we want to take this a step further by taking this to a production like environment.
I’ve read the cluster documentation and my understanding is something like this:
2x Historical+MiddleManager (CPU+RAM)
1x Broker (CPU+RAM)
We’re not going fully fault protected in the beginning, but it’s my understanding that we could easily expand by clone one of the above services when needed?
Not mentioned yet is:
1x MySQL metadata storage
ZooKeeper cluster (use the C+O machines at the beginning?)
Kafka cluster (use C+O machines? H+M machines?)
How will Druid handle the new kafka indexing service when running several nodes?