datapipe icon indicating copy to clipboard operation
datapipe copied to clipboard

Data Processing Tool

DataPipe

DataPipe is a data processing tool that can work with larger-than-memory files.

Screenshot of DataPipe

Downloading

DataPipe requires that you have Java installed.

Download the latest version from the releases page. It looks like datapipe-0.x.x-standalone.jar.

Using

DataPipe always requires an input file and an output file. The input file is typically a large file that needs to be processed. DataPipe will read in this file. The output file is a new file where DataPipe can write the data to.

Just below the input and output file, you can type a list of commands. Commands will be executed as operations on the original file. The final output will be sent to the output file.

At the bottom, a preview of the file is displayed. It shows only the first 100 rows of the output file.

To build

lein uberjar

License

Copyright © 2015 Frederik De Bleser

Distributed under the Eclipse Public License either version 1.0 or (at your option) any later version.