pash
pash copied to clipboard
PaSh: Light-touch Data-Parallel Shell Processing
Printing the specific commit that produced a log could help looking at old logs and figuring out what version of PaSh they were produced from.
The microbenchmarks tests seem to be divided into two, with one file for the environment variables and one file for the script. I find this very impractical. It would be...
Add the anticipated shell (`/bin/bash`?) at the beginning of the generated parallel script
This PR includes scripts that allow running experiments on multiple ec2-instances. The infrastructure is in `evaluation/multi-instance-experiment` and assuming that the instances referred to in `main.sh` exist, we just have to...
This issue is for tracking progress on the super-optimization component. This component solves the challenge of generating a script that is likely to exhibit the highest possible performance for the...
When executing the script `minimal_grep_stdin.sh` there is no split added before the first `tr`. This has to be solved.
It seems that it might be possible to also parallelize the sequence operator in a node. For example something like: ``` { head ; grep } | ... ``` Due...
There is a bug in our compiler's compilation... At the moment we compile the following: ```sh cmd1 & cmd2 ``` to a dataflow with two nodes (cmd1, cmd2). Then we...
Currently (in `future_annotations` and soon in `future`) we install the annotation library by cloning it and then building it using pip, but we could instead just download it from PyPI...
When debugging the annotations framework, it is sometimes a bit annoying that all exceptions are caught but using `-assert_compiler_success` is not possible for all scripts. I suppose debugging for other...