datahub icon indicating copy to clipboard operation
datahub copied to clipboard

FR: Infer schema from multiline (multiple docs) JSON files

Open livelace opened this issue 2 years ago • 0 comments

datahub version: v0.8.41

Currently there is no chance to infere JSON schema from file that contain muitple lines/docs.

{"id":1, "text": "foo"}
{"id":2, "text": "bar"}

It's a very convenient way to put many JSON docs into a single file. It allows to have well organized and easy managed datasets.

Currently inferring such files produces error:

could not infer schema for file s3://path/to/file.json: ' 'Trailing data']

Slack conversation

livelace avatar Aug 09 '22 22:08 livelace

This issue is stale because it has been open for 30 days with no activity. If you believe this is still an issue on the latest DataHub release please leave a comment with the version that you tested it with. If this is a question/discussion please head to https://slack.datahubproject.io. For feature requests please use https://feature-requests.datahubproject.io

github-actions[bot] avatar Sep 11 '22 02:09 github-actions[bot]