rasa icon indicating copy to clipboard operation
rasa copied to clipboard

Fix endoding issues

Open jzaprzalka opened this issue 1 year ago • 1 comments

Fixes encoding issues referenced in issue OSS-781 while running rasa data split nlu with entities containing non-english characters

Proposed changes:

  • Add ensure_ascii=False for json.dumps in TrainingDataWriter in order to properly encode non-ascii chatacters, e.g. Polish letters as referenced in issue OSS-781

Status (please check what you already did):

  • [ ] added some tests for the functionality
  • [ ] updated the documentation
  • [ ] updated the changelog (please check changelog for instructions)
  • [ ] reformat files using black (please check Readme for instructions)

jzaprzalka avatar Jul 10 '24 12:07 jzaprzalka

CLA assistant check
All committers have signed the CLA.

CLAassistant avatar Jul 10 '24 12:07 CLAassistant