Oliver Kennedy issues

Results 161 issues of


                                            Oliver Kennedy

Fixed-width-format data inputs

enhancement

Simplify error-aware CSV parser

Currently the error aware CSV parser is a mod of the existing Spark CSV parser. 1. There's a lot of overhead in the CSV parser for dealing with things that...

enhancement

Add Schema Documentation for the Mimir API

As reported by @heikomuller

documentation

Can't create missing-key lens over another missing-key lens

As reported by @heikomuller

bug

Propagate Schema Changes From Adaptive Schemas through Models

To see this bug in action: ``` mimir> load 'test/data/temperature.csv'; mimir> create lens repaired as select * from temperature with domain('temperature > -30'); mimir> feedback temperature 0 is real; mimir>...

bug

enhancement

lenses

adaptive schemas

Pivot Lens

Another use for the adaptive schemas, creating pivot tables. Consider for example: ``` time,location,temperature 1505958603,den,24.5 1505958604,basement,21.3 1505959204,den,24.5 1505959204,basement,21.400000000000002 1505959803,den,24.6 1505959804,basement,21.3 1505960265,office,17.5 1505960403,den,24.5 1505960404,basement,21.40000000000000 1505961003,den,24.5 1505961005,basement,21.400000000000002 1505961603,den,24.400000000000002 1505961604,basement,21.3 1505962203,den,24.400000000000002 ``` It...

enhancement

eventually

adaptive schemas

662 Project

Extend ANALYZE to detect potential data errors

As of right now, ANALYZE only detects sources of uncertainty injected by Mimir. It would be helpful if Mimir had some facility to do syntactic analysis on a dataset being...

enhancement

eventually

explain/analyze

662 Project

`UNNEST`

It would be nice to have an UNNEST operator, essentially an inverse aggregate (one row/cell to many). Examples: * Iterate over the elements of a JSON array * Regexp matches...

enhancement

compiler

PDF Table Import

Might be interesting to see if we can incorporate stuff from this system for extracting tabular PDF data: https://github.com/WZBSocialScienceCenter/pdftabextract

enhancement

help wanted

lenses

See if we can plug macrobase into Mimir's explanation system

https://github.com/stanford-futuredata/macrobase

enhancement

help wanted

backend

lenses

eventually