Cameron Joannidis
Cameron Joannidis
You need to run `sbt fmt`! :)
Sure @afsalthaj!
@jdegoes What implementations of Data Structures should we support? Ideally it could be abstract but because the `AnalyticsModule` currently defines the types, the implementing modules don't really have a choice...
Awesome. I've just merged in the PR thats been open for the last few weeks on dataset/data stream design. Worth branching off that for this piece since there are a...
Note that Arrow in scalaz 7.x is not something we can leverage unfortunately because of the method `arr` which allows you to lift arbitrary functions into the Arrow. Whilst this...
@afsalthaj precisely. We want to control the user code so that we can reify it but if they can lift arbitrary functions we can't do this
@SemanticBeeng If I'm understanding you correctly, I think the `data schema` you're referring to can be separated from the API design discussion. What we are aiming to do here is...
Yeah so ultimately, the low level description language probably makes sense to be streaming based since streams are a superset of batch operations: https://data-artisans.com/blog/batch-is-a-special-case-of-streaming Ultimately the user facing APIs will...
I have a very rough API design proposal here: https://gist.github.com/camjo/10cb0f25b9da10f08f9b30cbd9419985 I've done a lot of thinking about the nature of batch and stream and what it means to do computation...
@jdegoes 1. Hmm thats an interesting idea. I initially thought "it should be `Grouped[K, List[V]]` but perhaps you're thinking of a more general `Aggregatable` structure for `V` that can be...