datapackage-pipelines
datapackage-pipelines copied to clipboard
Example showing processing zipped xls files
I'm trying to construct an example for processing zipped xls. Downloader seems to only support csv.
A couple of questions:
- Can I use
/tmp? How can I make sure it's deleted after I'm done? Even if some error occurs. Is there some kind ofwithstatement across the pipeline? - How can I spew parameters to the next processor (i.e. filenames in a zip)? Should I use yield or modify datapackage?
- Could find a link to excel processor. Is there a list of all processors available somewhere?
- Downloader should support Excel files as well - see here for example: https://github.com/openspending/os-data-importers/blob/master/america/uruguay/fiscal.source-spec.yaml
- The way to do it is to create a separate 'resource' in the datapackage for each of these files.
- I need to add proper documentation for 'standard library' processors.