spark-druid-connector
spark-druid-connector copied to clipboard
A library for querying Druid data sources with Apache Spark
When I tested original code with spark(2.4.4), I got some trouble. So, I updated 4 difference steps. 1. Change spark, scala version and dependency modules version in `build.sbt`. 2. Add...
I create a connection with druid. I can access the schema of the datasource without any problem but when I execute a sql query and try to print the results...
Please, can you provide more documentation or examples for reading from existing Druid table. Thank you in advance.
spark.read.format("org.rzlabs.druid").option("zkHost","localhost:2181").option("druidDatasource", "wikiticker").load.createOrReplaceTempView("druid_table"); org.rzlabs.druid.DruidDataSourceException: Time boundary should include both the start time and the end time. at org.rzlabs.druid.client.DruidQueryServerClient.timeBoundary(DruidClient.scala:411) at org.rzlabs.druid.client.DruidClient.metadata(DruidClient.scala:308) at org.rzlabs.druid.client.DruidQueryServerClient.metadata(DruidClient.scala:417) at org.rzlabs.druid.metadata.DruidMetadataCache$.getDataSourceInfo(DruidMetadataCache.scala:226) at org.rzlabs.druid.metadata.DruidRelationInfoCache$class.druidRelation(DruidMetadataCache.scala:127) at org.rzlabs.druid.metadata.DruidMetadataCache$.druidRelation(DruidMetadataCache.scala:145) at org.rzlabs.druid.DefaultSource.createRelation(DefaultSource.scala:92) at...
In this case, a queryIntervals member should be added in QueryIntervals class to keep the multiple interval