unstructured
unstructured copied to clipboard
bug/partition_text remove the minus sign
Describe the bug function partition_text remove the minus sign.
To Reproduce
from unstructured.partition.text import partition_text
text = '''
net amount
-4,391,082,054.12
rate is -10%
'''
print(text)
data = partition_text(text=text)
for d in data:
print(d.text)
print('-' * 10)
Expected behavior
net amount
-4,391,082,054.12
rate is -10%
net amount
----------
-4,391,082,054.12
----------
rate is
----------
-10%
----------
Screenshots
It seems to be because the symbol is recognized as a bullet for an unordered list.
We'll get this fixed as soon as we're able
Is anyone working on fixing this?