Thanks Itai. Got it now. Just to clarify - in the Spark package you are ensuring that the data is ordered by time for ingestion into Druid. Correct?
Tarun
Thanks Itai. Got it now. Just to clarify - in the Spark package you are ensuring that the data is ordered by time for ingestion into Druid. Correct?
Tarun
Yes, and also I remove events that are out-of-order.
Thus, the output from Spark (which is the input to the ingestion task) only contains the events that are ordered (i.e match the funnel’s definition).