chronix.spark
chronix.spark copied to clipboard
Time series analysis with Apache Spark based on Chronix |
Cool showcase: ``` select metric, process from timeseries group by metric, process order by timeseries ``` With and without GROUP_BY pushdown and ORDERING in Spark
see https://commons.apache.org/proper/commons-math/userguide/stat.html
Based on Chronix logic but enhanced with Johannes Weigend's soluton to pre-hash the documents to avoid shuffling during import.
- Support the Solr /export interface to transfer result sets from Solar to Spark (see https://cwiki.apache.org/confluence/display/solr/Exporting+Result+Sets). - Introduce DocValues for all relevant fields to be able to export them -...
- POJO representation - ChronixRDD - DataFrame
Integrate copied code from other sub-projects into these and get it out of Chronix Spark codebase