Fast Online Analytical Processing for Big Data Warehousing (Research Study benchmarking Druid)

Dear colleagues,

I recently published this paper (Fast Online Analytical Processing for Big Data Warehousing) with the main goal of benchmarking Druid and give some recommendations about its use to implement a Big Data Warehouse.

The paper tests different possible configurations of data volumes, with different schemas and different data organizations, in terms of “segment granularity”, “query granularity” and “hashed partitions”.

The paper uses the Well Know Star Schema Benchmark.

I hope the paper could fill the gap in the literature regarding Druid and help his community as well.


Please feel free to consult it and to give your feedback.

Best regards,

José Correia