refinery icon indicating copy to clipboard operation
refinery copied to clipboard

Refinery is a tool to extract and transform semi-structured data from Excel spreadsheets of different layouts in a declarative way.

refinery (open sourced) CircleCI

Refinery is an open source tool to extract semi-structured data from Excel spreadsheets (both in .xls and .xlsx format) in a declarative way.

In the E-T-L process, refinery focuses on the E: Extract.

With declarative DRY schemas representing the expectations for a given data source, Refinery makes data pipelines more maintainable.

All documentation is located under GitHub Wiki