I created this demo lab using a single physical machine with plenty of RAM and CPU. I am ingesting two tasks and for some reason one of them fails one or twice an hour continuously. The task duration and timeout values are set for 5 minutes in the spec files for these tasks. The data generated is very limited. We are not trying to stress test the platform.
The task that generates relatively more data keeps failing between 50th and 59th minute of hour constantly. The other task is having no issues. I looked at the indexing logs for the failed tasks and cannot figure out what went wrong.
I am also seeing that I have some additional task folders under the ./druid/task folder besides the running ones. I thought these were supposed to be moved to somewhere else once the task is completed; then I noticed that these additional ones are actually the failed ones.
I am attaching two of the failed task logs. Can someone give me a clue as to what is happening. I have been looking at these logs for the last two days, nothing so far.
log-1.txt (3.2 MB)
log-2.txt (3.24 MB)