I want to set up a cluster providing indexing service. After reading the Druid doc and questions/answers in the Druid User group, I still have the following questions:
- If middlemanager and overlord are in separate nodes, do we have to use HDFS or S3? My HDFS is not ready yet so I want to see if a shared NFS directory can be used instead.
- Middlemanager can spawn peons to ingest different datasources. To post event to peon, should we always use the url pattern as http://peonHost:port/druid/worker/v1/chat//push-events/, where peon’s host/port is the same as middlemanager’s host/port?
- Can we post json array to the url in case even producer buffers events somehow.