A problem when ingest data locating in S3

Hi Druid Gurus,

We are using Druid as our OLAP and I face a problem when using druid to ingesting data locating in S3. I hope I can get help from your.

We setup a ETL pipeline and transform some event log and then put it into S3. After that, a druid job will be triggered by posting a json to index node. Usually druid will submit a Hadoop job to EMR. However in some cases the job fails in the Hadoop job configuring phrase.

It seems the druid node append files into Hadoop job definition. and then it append these files again. The first time it successes and then it fails.

I read the source code and really cannot understand why this happens. If anyone can help me, thanks in advance.

Best Wishes

Gabriel

The log:

2017-06-14T06:02:02,644 INFO [task-runner-0-priority-0] io.druid.indexing.common.task.HadoopIndexTask - Starting a hadoop index generator job...
2017-06-14T06:02:02,666 INFO [task-runner-0-priority-0] io.druid.indexer.path.GranularityPathSpec - Checking path[s3n://traffic-data-processed/transformed/2017/05/15/11]
2017-06-14T06:02:02,849 INFO [task-runner-0-priority-0] io.druid.indexer.path.GranularityPathSpec - Appending path [s3n://traffic-data-processed/transformed/2017/05/15/11/part-00000-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00001-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00002-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00003-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00004-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00005-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00006-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00007-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00008-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00009-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00010-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00011-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00012-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00013-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00014-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00015-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00016-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00017-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00018-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00019-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00020-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00021-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00022-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00023-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00024-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00025-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00026-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00027-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00028-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00029-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00030-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00031-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00032-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00033-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00034-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00035-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00036-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00037-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00038-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00039-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00040-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00041-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00042-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00043-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00044-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00045-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00046-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00047-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00048-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00049-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00050-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00051-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00052-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00053-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00054-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00055-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00056-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00057-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00058-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00059-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00060-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00061-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00062-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00063-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00064-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00065-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00066-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00067-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00068-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00069-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00070-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00071-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00072-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00073-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00074-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00075-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00076-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00077-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00078-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00079-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00080-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00081-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00082-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00083-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00084-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00085-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00086-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00087-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet, s3n://traffic-data-processed/transformed/2017/05/15/11/part-00088-23d496b3-5da2-4a6b-8869-f83078ec9049.snappy.parquet]
2017-06-14T06:02:02,853 INFO [task-runner-0-priority-0] io.druid.indexer.HadoopDruidIndexerJob - No metadataStorageUpdaterJob set in the config. This is cool if you are running a hadoop index task, otherwise nothing will be uploaded to database.
2017-06-14T06:02:02,879 INFO [task-runner-0-priority-0] io.druid.indexer.path.GranularityPathSpec - Checking path[s3n://traffic-data-processed/transformed/2017/05/15/11]
2017-06-14T06:02:03,091 ERROR [task-runner-0-priority-0] io.druid.indexing.overlord.ThreadPoolTaskRunner - Exception while running task[HadoopIndexTask{id=index_hadoop_em-traffic-druid-concise-compressed_2017-06-14T05:55:34.265Z, type=index_hadoop, dataSource=em-traffic-druid-concise-compressed}]
java.lang.RuntimeException: java.lang.reflect.InvocationTargetException
	at com.google.common.base.Throwables.propagate(Throwables.java:160) ~[guava-16.0.1.jar:?]
	at io.druid.indexing.common.task.HadoopTask.invokeForeignLoader(HadoopTask.java:211) ~[druid-indexing-service-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexing.common.task.HadoopIndexTask.run(HadoopIndexTask.java:223) ~[druid-indexing-service-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexing.overlord.ThreadPoolTaskRunner$ThreadPoolTaskRunnerCallable.call(ThreadPoolTaskRunner.java:436) [druid-indexing-service-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexing.overlord.ThreadPoolTaskRunner$ThreadPoolTaskRunnerCallable.call(ThreadPoolTaskRunner.java:408) [druid-indexing-service-0.10.0-iap1.jar:0.10.0-iap1]
	at java.util.concurrent.FutureTask.run(FutureTask.java:266) [?:1.8.0_131]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [?:1.8.0_131]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [?:1.8.0_131]
	at java.lang.Thread.run(Thread.java:748) [?:1.8.0_131]
Caused by: java.lang.reflect.InvocationTargetException
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) ~[?:1.8.0_131]
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) ~[?:1.8.0_131]
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) ~[?:1.8.0_131]
	at java.lang.reflect.Method.invoke(Method.java:498) ~[?:1.8.0_131]
	at io.druid.indexing.common.task.HadoopTask.invokeForeignLoader(HadoopTask.java:208) ~[druid-indexing-service-0.10.0-iap1.jar:0.10.0-iap1]
	... 7 more
Caused by: java.lang.RuntimeException: java.util.NoSuchElementException
	at io.druid.indexer.IndexGeneratorJob.run(IndexGeneratorJob.java:215) ~[druid-indexing-hadoop-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexer.JobHelper.runJobs(JobHelper.java:349) ~[druid-indexing-hadoop-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexer.HadoopDruidIndexerJob.run(HadoopDruidIndexerJob.java:95) ~[druid-indexing-hadoop-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexing.common.task.HadoopIndexTask$HadoopIndexGeneratorInnerProcessing.runTask(HadoopIndexTask.java:276) ~[druid-indexing-service-0.10.0-iap1.jar:0.10.0-iap1]
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) ~[?:1.8.0_131]
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) ~[?:1.8.0_131]
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) ~[?:1.8.0_131]
	at java.lang.reflect.Method.invoke(Method.java:498) ~[?:1.8.0_131]
	at io.druid.indexing.common.task.HadoopTask.invokeForeignLoader(HadoopTask.java:208) ~[druid-indexing-service-0.10.0-iap1.jar:0.10.0-iap1]
	... 7 more
Caused by: java.util.NoSuchElementException
	at io.druid.indexer.hadoop.FSSpideringIterator.next(FSSpideringIterator.java:98) ~[druid-indexing-hadoop-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexer.hadoop.FSSpideringIterator.next(FSSpideringIterator.java:91) ~[druid-indexing-hadoop-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexer.hadoop.FSSpideringIterator.next(FSSpideringIterator.java:33) ~[druid-indexing-hadoop-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexer.path.GranularityPathSpec.addInputPaths(GranularityPathSpec.java:144) ~[druid-indexing-hadoop-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexer.HadoopDruidIndexerConfig.addInputPaths(HadoopDruidIndexerConfig.java:389) ~[druid-indexing-hadoop-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexer.IndexGeneratorJob.run(IndexGeneratorJob.java:193) ~[druid-indexing-hadoop-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexer.JobHelper.runJobs(JobHelper.java:349) ~[druid-indexing-hadoop-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexer.HadoopDruidIndexerJob.run(HadoopDruidIndexerJob.java:95) ~[druid-indexing-hadoop-0.10.0-iap1.jar:0.10.0-iap1]
	at io.druid.indexing.common.task.HadoopIndexTask$HadoopIndexGeneratorInnerProcessing.runTask(HadoopIndexTask.java:276) ~[druid-indexing-service-0.10.0-iap1.jar:0.10.0-iap1]
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) ~[?:1.8.0_131]
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) ~[?:1.8.0_131]
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) ~[?:1.8.0_131]
	at java.lang.reflect.Method.invoke(Method.java:498) ~[?:1.8.0_131]
	at io.druid.indexing.common.task.HadoopTask.invokeForeignLoader(HadoopTask.java:208) ~[druid-indexing-service-0.10.0-iap1.jar:0.10.0-iap1]
	... 7 more
2017-06-14T06:02:03,101 INFO [task-runner-0-priority-0] io.druid.indexing.overlord.TaskRunnerUtils - Task [index_hadoop_em-traffic-druid-concise-compressed_2017-06-14T05:55:34.265Z] status changed to [FAILED].
2017-06-14T06:02:03,104 INFO [task-runner-0-priority-0] io.druid.indexing.worker.executor.ExecutorLifecycle - Task completed with status: {
  "id" : "index_hadoop_em-traffic-druid-concise-compressed_2017-06-14T05:55:34.265Z",
  "status" : "FAILED",
  "duration" : 381058
}

I think I find the root cause. It relates to AWS emr. It caused by _temporary folder. what weird is sometimes it disappears and then it shows again.

-rw-rw-rw- 1 0 2017-06-15 01:44 s3n://traffic-data-processed/transformed/2017/05/07/13/_SUCCESS

drwxrwxrwx - 0 1970-01-01 00:00 s3n://traffic-data-processed/transformed/2017/05/07/13/_temporary

drwxrwxrwx - 0 1970-01-01 00:00 s3n://traffic-data-processed/transformed/2017/05/07/13/_temporary/0

drwxrwxrwx - 0 1970-01-01 00:00 s3n://traffic-data-processed/transformed/2017/05/07/13/_temporary/0/task_20170615003331_0003_m_000005

lsr: File s3n://traffic-data-processed/transformed/2017/05/07/13/_temporary/0/task_20170615003331_0003_m_000005 does not exist.

-rw-rw-rw- 1 43377095 2017-06-15 01:42 s3n://traffic-data-processed/transformed/2017/05/07/13/part-00000-f3f74268-9b12-4067-a6ff-0495b56c4e9a.snappy.parquet

-rw-rw-rw- 1 42635285 2017-06-15 01:41 s3n://traffic-data-processed/transformed/2017/05/07/13/part-00001-f3f74268-9b12-4067-a6ff-0495b56c4e9a.snappy.parquet

-rw-rw-rw- 1 42149495 2017-06-15 01:41 s3n://traffic-data-processed/transformed/2017/05/07/13/part-00002-f3f74268-9b12-4067-a6ff-0495b56c4e9a.snappy.parquet

-rw-rw-rw- 1 41917703 2017-06-15 01:41 s3n://traffic-data-processed/transformed/2017/05/07/13/part-00003-f3f74268-9b12-4067-a6ff-0495b56c4e9a.snappy.parquet

-rw-rw-rw- 1 41680200 2017-06-15 01:43 s3n://traffic-data-processed/transformed/2017/05/07/13/part-00004-f3f74268-9b12-4067-a6ff-0495b56c4e9a.snappy.parquet

-rw-rw-rw- 1 41534347 2017-06-15 01:42 s3n://traffic-data-processed/transformed/2017/05/07/13/part-00005-f3f74268-9b12-4067-a6ff-0495b56c4e9a.snappy.parquet

-rw-rw-rw- 1 41321052 2017-06-15 01:42 s3n://traffic-data-processed/transformed/2017/05/07/13/part-00006-f3f74268-9b12-4067-a6ff-0495b56c4e9a.snappy.parquet

-rw-rw-rw- 1 41161334 2017-06-15 01:41 s3n://traffic-data-processed/transformed/2017/05/07/13/part-00007-f3f74268-9b12-4067-a6ff-0495b56c4e9a.snappy.parquet

-rw-rw-rw- 1 44629158 2017-06-15 01:42 s3n://traffic-data-processed/transformed/2017/05/07/13/part-00008-f3f74268-9b12-4067-a6ff-0495b56c4e9a.snappy.parquet