Local Batch Ingestion issues?

We’re looking to deploy a more meaty prototype of our system that uses Druid, and were wondering what the issues around local batch ingestion are. The proof of concept I built over the summer used kafka ingestion, but we really don’t need real-time ingestion at this point. It seems to me that local batch ingestion would be the quickest, simplest method for pulling in large chunks of data that other parts of our system generate.

So … what are the problems with that plan?

Thanks in advance, you guys have been great while I’ve been exploring druid!

Ron Hay

Batch ingestion doesn’t really have any hidden gotchas. Batch ingestion without hadoop is slow.