Utku Demir
Utku Demir
Joins
Implement a `join` function which joins two datasets based on a key/key function. Different join types (left/right/full outer, cartesian product) should be supported.
Currently starting a new project using `distributed-dataset` is not really defined. We should have a `hello-world` project, alongside with a easy-to-use build system for both Linux and MacOS. `stack`'s container...
Currently the API to write a `Backend` is quite limited, it forces us to create a new process, and does not look pretty. With a nicer API: * We can...
It makes sense to work on plaing nicely with Apache Spark & Hadoop ecosystem; so that people can start using `distributed-dataset` alongside with their existing data pipeline. This is an...
If we implement a `ShuffleStore` and some utilities for reading from/writing to HDFS, users can more easily start using `distribute-dataset` on their existing data.
Currently we wait for a stage to finish before starting another one. It would be nicer to have a proper dependency graph where the nodes are tasks, so we can...
See: http://hackage.haskell.org/package/aws-lambda-haskell-runtime
[Dependabot](https://dependabot.com/) is an open-source SaaS which periodically checks for dependency updates and automatically sends PR's. I think it will be a nice addition to Nix ecosystem if we teach Dependabot...
I tried to run slack-cli in a minimal environment, and it silently failed with an exit code, but without printing any errors. I had to look at the source code...