Hello,
am new to druid, to use Azure Data lake as deep storage, i followed this link Microsoft Azure · Apache Druid and made changes to common.runtime.properties file.
in unified console when i try to connect to data by clicking Azure data lake, and after providing the URI i get following error.
Blockquote
Error: Cannot construct instance of org.apache.druid.data.input.azure.AzureInputSource
, problem: Invalid URI scheme [https://storage.blob.core.windows.net/databsesnew/output_csv/events.csv] must be [azure] at [Source: (org.eclipse.jetty.server.HttpInputOverHTTP); line: 1, column: 190] (through reference chain: org.apache.druid.indexing.overlord.sampler.IndexTaskSamplerSpec[“spec”]->org.apache.druid.indexing.common.task.IndexTask$IndexIngestionSpec["ioConfi
Blockquote
Here is copy of my common.runtime.properties file
Blockquote
druid.extensions.loadList=[“druid-azure-extensions”, “druid-hdfs-storage”, “druid-kafka-indexing-service”, “druid-datasketches”]
druid.host=localhost
Logging
Log all runtime properties on startup. Disable to avoid logging properties on startup:
druid.startup.logging.logProperties=true
Zookeeper
druid.zk.service.host=localhost
druid.zk.paths.base=/druid
Metadata storage
For Derby server on your Druid Coordinator (only viable in a cluster with a single Coordinator, no fail-over):
druid.metadata.storage.type=derby
druid.metadata.storage.connector.connectURI=jdbc:derby://localhost:1527/var/druid/metadata.db;create=true
druid.metadata.storage.connector.host=localhost
druid.metadata.storage.connector.port=1527
Deep storage
druid.storage.type=azure
druid.azure.account=s_____orage
druid.azure.key=*************
druid.azure.container=*************
#druid.azure.prefix=“”
druid.azure.protocol=https
druid.azure.maxTries=3
druid.azure.maxListingLength=1024
druid_indexer_logs_type=file
druid_indexer_logs_directory=/opt/shared/indexing-logs
Service discovery
druid.selectors.indexing.serviceName=druid/overlord
druid.selectors.coordinator.serviceName=druid/coordinator
Monitoring
druid.monitoring.monitors=[“org.apache.druid.java.util.metrics.JvmMonitor”]
druid.emitter=noop
druid.emitter.logging.logLevel=info
Storage type of double columns
commiting this will lead to index double as float at the storage layer
druid.indexing.doubleStorage=double
Security
druid.server.hiddenProperties=[“druid.s3.accessKey”,“druid.s3.secretKey”,“druid.metadata.storage.connector.password=***”]
SQL
druid.sql.enable=true
Planning SQL query when there is aggregate distinct in the statement
druid.sql.planner.useGroupingSetForExactDistinct=true
Lookups
druid.lookup.enableLookupSyncOnStartup=false
Expression processing config
druid.expressions.useStrictBooleans=true
Http client
druid.global.http.eagerInitialization=false
Blockquote