unstructured
unstructured copied to clipboard
bug/<Only element from partition is "Please enable JS and disable any ad blocker">
Describe the bug Only element returned from partition is (unstructured.documents.html.HTMLTitle, 'Please enable JS and disable any ad blocker')
To Reproduce
!pip install "unstructured[all-docs]"
url = 'https://www.nytimes.com/2024/02/19/world/europe/navalny-letters-russia.html'
from unstructured.partition.auto import partition
elements = partition(url=url, strategy='hi_res', html_assemble_articles=True)
display(*[(type(element), element.text) for element in elements])
Expected behavior Partition results (Title, Narrative Text, etc) should be returned
Environment Info Google Colab