We have the following usecase which we are having problem realising in Druid.
We have a datasource called
store_visits with the dimensions
We want to be able to answer the question:
“How many times users transitioned from store “A” to store “B” (in that order and no matter if there were intermediary store visits)”
So for example if the user path was “A->B->C->A->D->B”, we would have two transitions (“A->B” and “A->D->B”).
Any ideas are appreciated.
What you are asking for is an ‘ordered funnel’ sort of analysis. If you wanted ‘unordered’ I would have asked you to check out Theta Sketches, which are great for that kind of thing. But you want ordered, so never mind that.
For the time being I’d suggest writing a Druid extension. If you write an aggregator as an extension, you can define any sort of behavior and intermediary data format that you want. For examples, check out: https://github.com/implydata/druid-example-extension
Hope this helps, and of course, stay tuned for future Druid releases that include more array functions!!