I am trying to create an ingestion task through local machine on remote Druid server using command post-index-task. Specification file is stored at local machine and json data to be ingested is located at Druid server. When I run the command "post-index-task --file --url " from local machine, it submit the task to Druid and shows success but after that it gives error: urllib2.URLError: <urlopen error [Errno 111] Connection refused>
Full trace is:
__Beginning indexing data for cmh
Task started: index_parallel_cmh_2019-09-30T06:56:08.593Z
Task log: /druid/indexer/v1/task/index_parallel_cmh_2019-09-30T06:56:08.593Z/log
Task status: /druid/indexer/v1/task/index_parallel_cmh_2019-09-30T06:56:08.593Z/status
Task index_parallel_cmh_2019-09-30T06:56:08.593Z still running…
Task index_parallel_cmh_2019-09-30T06:56:08.593Z still running…
Task finished with status: SUCCESS
Completed indexing data for cmh. Now loading indexed data onto the cluster…
Traceback (most recent call last):
File “/home/user/Downloads/apache-druid-0.15.0-incubating/bin/post-index-task-main”, line 174, in
main()
File “/home/user/Downloads/apache-druid-0.15.0-incubating/bin/post-index-task-main”, line 171, in main
await_load_completion(args, datasource, load_timeout_at)
File “/home/user/Downloads/apache-druid-0.15.0-incubating/bin/post-index-task-main”, line 119, in await_load_completion
response = urllib2.urlopen(req, None, response_timeout)
File “/usr/lib/python2.7/urllib2.py”, line 154, in urlopen
return opener.open(url, data, timeout)
File “/usr/lib/python2.7/urllib2.py”, line 429, in open
response = self._open(req, data)
File “/usr/lib/python2.7/urllib2.py”, line 447, in _open
'open’, req)
File “/usr/lib/python2.7/urllib2.py”, line 407, in call_chain
result = func(*args)
File “/usr/lib/python2.7/urllib2.py”, line 1228, in http_open
return self.do_open(httplib.HTTPConnection, req)
File “/usr/lib/python2.7/urllib2.py”, line 1198, in do_open
raise URLError(err)
urllib2.URLError: <urlopen error [Errno 111] Connection refused>
You can check from your local machine to druid cluster if there are any network latency. Could you also check the coordinator log if the issue is consistent?