Question about spec files for the Indexer


These are completely newbie questions

  1. Is it necessary to start the Indexer and pass in a spec file? What’s the role of the hadoop_config.json in this example?

java -Xmx2g -Duser.timezone=UTC -Dfile.encoding=UTF-8 -classpath config/_common:config/overlord:lib/*:examples/indexing/wikipedia_hadoop_config.json io.druid.cli.Main server overlord

  1. What is the purpose of the JSON definition inside the task file that is posted to create task? Is there any relationship between this and the config file that was used to launch the Indexer?

curl -X ‘POST’ -H ‘Content-Type:application/json’ -d @examples/indexing/wikipedia_index_task.json localhost:8090/druid/indexer/v1/task

  1. I’ve so far seen at least a couple different ways of specifying dimensions and metrics. I’ve seen them inside the ‘parser’ section and also inside the ‘firehose’ section. Are they related?



see Inline