I am evaluating Druid for large scale OLAP with additional tools like Caravel or Metabase but I can’t find any way to secure the cluster. I can imagine isolating the cluster with firewalls and relying on Caravel for authentication and permissions but my problem is on data ingestion.
I am currently using Spark and Kafka but none of them will allow to implement a secure data ingestion. Kafka connector doesn’t manage Kafka secured API and Spark BeamRDD is pushed based approach so I need to allow access from Spark worker which is insecure for multi tenant Spark clusters.
How do you deploy Druid and use it in a secure way ?