RdfProcessingToolkit icon indicating copy to clipboard operation
RdfProcessingToolkit copied to clipboard

Only output used prefixes

Open Aklakan opened this issue 3 years ago • 2 comments

Output of all formats that support prefixes cause all of prefix.cc prefixes to be written out. This makes output so ugly its not funny.

  • By default, output should be deferred until a certain amount of data has been seen on which basis the set of used prefixes is determined. This is likely to work for 80-99% of the use cases. The amount of data to consider should be configurable.
  • To capture the remaining corner cases, there should be an option to provide the output prefixes directly

completed deferred output support on:

  • [x] sparql integrate
  • [ ] named graph streams (ngs)
  • [n/a] sparql binding streams (sbs)

Aklakan avatar Jan 23 '21 22:01 Aklakan

Deferred output with used prefix analysis is now implemented. Need to add the defer-output flag to all commands.

Aklakan avatar Feb 13 '21 16:02 Aklakan

There should be a --out-prefixes option where a file with prefix declarations can be specified.

Aklakan avatar Feb 19 '23 00:02 Aklakan