I understood that the following approaches to load from kafka are available, perhaps more:
ingest into realtime nodes (not sure how)
use map reduce from kafka InputFormat to druid batch side
use tranquility? ( I understood it is an adaptor for push based streaming solutions such as storm, not sure how it would work with a pull based one such as kafka)
Related to that, I understood realtime nodes are optional, and more recent options are available.
We are not keen on realtime, but every 10 minutes would be fine (focusing on batch and low cost).
Input wil likely go thru kafka, directly or indirectly, as we will likely use it as a bus for all systems.