Aggregating delayed data


We are collecting real time data and aggregating those data on hourly basis.

But sometimes we are getting few delayed raw data,

so is there any way we can add up those data to already ran Aggregation task…

Thanks and Regards,

Amit Kumar

Hey Amit,

In Druid, if you’re reading from streams (Kafka/Kinesis indexing) then late data is automatically incorporated. For storage efficiency’s sake, you might want to run compaction from time to time (or enable auto compaction) but this is not necessary for correctness.