Unable to connect Apache Druid with Azure HDInsight Hadoop and HDFS

I’ve two use-cases -

  1. Import data into Druid from HDFS cluster on HDInsight
  2. Use Hadoop cluster for indexing and importing for faster imports.

I was referring to following pages for setting up the connection -


I’m getting exceptions related to hadoop-azure classes. Hadoop version on HDInsight is 3.1.1. To resolve the dependency, I followed the following steps -

  1. Copied core-site.xml, hdfs-site.xml and yarn-site.xml

  2. Copied all the jars from Hadoop instance to hadoop-dependencies/hadoop-client/3.1.1/ on hadoop machines

  3. In middlemanager config, I added ruid.indexer.runner.javaOpts=… -Dhadoop.mapreduce.job.classloader=true

  4. I also set the property - druid.indexer.task.defaultHadoopCoordinates=

But I keep getting the same exception while restarting the druid service. Seems like Druid is not taking my new hadoop dependencies during start.

Please suggest some solution for this.