lumberjack icon indicating copy to clipboard operation
lumberjack copied to clipboard

Track changes in data with ease

Track changes in data

Build Status Coverage Status CRAN status DownloadsMentioned in Awesome Official Statistics

The lumberjack R package allows you to:

  • track changes in multiple data sets as they get processed;
  • using multiple loggers for each dataset;
  • where loggers are fully customizable.

You can get started by just adding one line of code to your existing data analysis script.

Citing lumberjack

Please cite the JSS paper.

  title = {Monitoring Data in {R} with the {lumberjack} Package},
  author = {Mark P. J. {van der Loo}},
  journal = {Journal of Statistical Software},
  year = {2021},
  volume = {98},
  number = {1},
  pages = {1--13},
  doi = {10.18637/jss.v098.i01},
  url = {}

lumberjack philosophy

Production scripts may contain many data transformations, aimed to clean, select, model, or augment data with new variables. Analyzing the effect of each step is cumbersome because it involves adding a lot of code that is not concerned with the primary goal of the script, namely to analyze and process data.

In the lumberjack philosophy, a programmer (analyst) should be only concerned with the primary process of data analyses.


Published version from CRAN


Development version.

git clone
cd lumbjerjack
make install

Copyright (2016) Mark van der Loo Licenced by EUPL 1.2