Hi,
I am currently using docker-druid image for demo purposes. I have deployed this on a Kubernetes cluster on IBM cloud. My application workflow works fine in which events are persisted to druid. However, In the kubernetes environment the pods keep going down and comes back up automatically (handled by kubernetes) resulting in loss of data. This could be tacked by using HDFS for storage. Not sure how zookeeper needs to be handled.
I learned that docker-druid image isn’t suitable for production. So is the imply docker image as per the official Imply documentation:
“This is a Dockerized version of Imply (https://imply.io/get-started) designed for easily running the quickstart (single-machine, non-clustered). It isn’t currently supported for use in production; for that, we still recommend using the downloadable distribution”
Given this, can we deploy druid reliably using docker on a kubernetes cluster? Are there any best practices for doing this?
Thanks,
Prathamesh