marklogic-contentpump
marklogic-contentpump copied to clipboard
MarkLogic Contentpump (mlcp)
CompressedRDFReader now buffers ZipEntrys in java heap to read in zipped file. Instead, it should stream the reading process, which is desired and preferrable. CompressedAggregateReader and CompressedDelimitedTextReader has done the...
The current set of tests function mostly as integration tests that require a live instance of MarkLogic, and take ~15 minutes to run. - Unit tests should run quickly, and...
After much frustration fussing, and a post to stack overflow, I finally found a syntax that would work: "^.*(/[^/]+/),'$1'" It turns out that you have to both wrap the whole...
When running MLCP transform on triple data the `context` argument does not contain any collections. I suspect this is because triplestore data uses the `-output_graph` argument instead of the `-output_collection`...
Important note: This works correct in a windows _command prompt_, but not in **PowerShell**. Steps to reproduce: Run the following two commands in PowerShell: ```powershell mlcp $LASTEXITCODE ``` The first...
This is something I noticed recently but worked around by removing bad rows (those where the column count does not match the header count) during pre-processing. However, it can still...
It would be useful to get `query_filter` working on import documents such as XML or CSV splits. This can be realized by executing reverse queries prior to ingest for filtering...
Need to ingest DAT file into DH. DAT file comes with ~|^ delimiter. I try other Field Separator with ~|^ but MLCP couldn't identify the content. Try Binary or Text,...
Can we please publish commons-csv:1.5.1-marklogic to jcenter or mavenCentral as it is now unreachable due to issues with http://developer.marklogic.com/maven2/ and/or https://developer.marklogic.com/maven2/
As a USER I frequently receive data as an array of objects [{...}, {...}, {…}] and want to be able to easily ingest it into MarkLogic using mlcp without having...