label-studio icon indicating copy to clipboard operation
label-studio copied to clipboard

Support JSONL format

Open johann-petrak opened this issue 3 years ago • 3 comments

For larger files and generally for easier handling of example sets in json format, it would be very useful if JSONL format was supported:

  • each example a JSON map, on its own line
  • no enclosing array

In other words, such files would get processed line by line, with each line getting converted to/from json as needed on export/import.

See https://jsonlines.org/

johann-petrak avatar Jan 25 '22 18:01 johann-petrak

Are you talking about LS storages with jsonl import? Or about the common LS import files?

makseq avatar Jan 25 '22 21:01 makseq

Sorry, what I meant is for

  • uploading examples to annotate (Import button within a project)
  • downloading annotated examples (Export button within a project)

jsonl has become an often-used format in nlp because it is flexible, yet it is, unlike a json array file, line oriented, so other tools can be used to filter, update. sort, etc examples in jsonl files.

johann-petrak avatar Jan 25 '22 22:01 johann-petrak

Seconding this back again and resurrecting it. could we have JSONL support ? there is a PR (#fea:jsonl support 5830) but that has stalled. thanks guys :)

teddyhla avatar Jun 12 '25 09:06 teddyhla

JSONL support to load tasks from Source Cloud Storages will be released in Label Studio 1.20. Would like to gauge interest from the community about whether you generally use Cloud Storages or Data Import for this kind of data!

matt-bernstein avatar Jun 18 '25 20:06 matt-bernstein

JSONL support would be really great actually. For both imports and also for exports. Tend to use more for data import rather than cloud at the moment for our organisation. Thanks a lot. I look forward to the update.

teddyhla avatar Jun 18 '25 21:06 teddyhla

Hello,

LS now support JSONL, please test it out and let us know. I'll mark this issue closed for now.

AbubakarSaad avatar Nov 03 '25 18:11 AbubakarSaad