Steve Canny

Results 120 comments of Steve Canny

Closing as incomplete. @viboognesh if you have more information on this, please feel free to reopen :)

@MthwRobinson This appears to be either a corrupted message or a defect or limitation in `msg_parser` in how it extracts attachments. I'd say next step is either to chalk this...

One other possible behavior variation that occurred to me, not sure whether it's useful or not: When option {x} has value {y}, content within tables is partitioned like other content:...

@LucasOliveira44 You can convert `Table` elements to text roughly like this: ```python from unstructured.chunking.basic import chunk_elements from unstructured.documents.elements import Element, ElementMetadata, Table, Text elements = partition_..(...) # -- no chunking_strategy...

This approach of changing `Table` elements to another element type is the only approach that's going to work I believe. Combining tables into the same chunk as non-table chunks would...

Try setting the environment variable: ```bash $ export SCARF_NO_ANALYTICS=true ``` and see if that makes a difference. On some network configurations the analytics appear to take longer than desired. It's...

Hi @liturrig, `unstructured` does not currently have a "pdf-fast-only" install option. Can you say a bit more about the your use case and why you want something like that?

@cds-code can you describe how you are running `unstructured` in `AWS Lambda`?

Have you accounted for spin-up (cold-start) time of the Lambda instance? Like only start timing after receiving the first response? Also, can you provide some specific timings?