I am trying to ingest data into Druid and using HDFS as deep storage. I have 5 nodes in the cluster and my hdfs storage directory is “/data” on every node ( so there are directories like ‘/data/dfs’ , /data/dfs/nn’, ‘/data/dfs/dn’ on different nodes).
What should be the value of the property druid.storage.storageDirectory ?
Does Druid automatically understand that HDFS is spread over 5 nodes and also the namenodes and datanodes.
How does Druid interact with the HDFS?
Where is the data stored by the Realtime and Historical nodes?