docxtractr issues

Numbers are lost when reading cells with numbered lists

First of all, thanks for the amazing package! I am trying to read the contents of a docx table and having issues with numbered lists. If I enter the numbers...

gorkang

Get table heading or page number for tables

Hello, This package has been incredibly helpful. Is there a way to include (or get) page numbers for each table? Or can we read in particular number of pages and...

msundara1001

docx_extract_all_cmnts(..., include_text = TRUE) failing on edge case

First off, thank you for this package, it's really useful. I've run into an interesting scenario where the argument include_text = TRUE fails for a word document. Here are two...

conig

Allow for OOXML where commentStartRange and commentEndRange nodes are not siblings

I've run across a valid DOCX XML structure where the commentStartRange and commentEndRange nodes are not siblings: This is an attempt to allow for accurate comment and anchor text extraction...

WilDoane

Feature request: selected_text in docx_extract_all_cmnts()

2

It would be so nicer if `docx_extract_all_cmnts()` function adds a column for `selected_text` which contains each block of selected text corresponding to each comment. This way will allow users to...

jooyoungseo

enhancement

Read special symbols within the tables in a .docx file

Thank you for docxtractr. While reading a .docx file, I have a special symbol (tick mark) within a table. Currently using docxtractr renders them as null character. Requesting to see...

prakashperiasamy

Alternative way of Supporting for doc-files

Thanks a lot for such a great package. I was trying out `docxtractr::read_docx` on `doc` files in `Windows 10` using `LibreOffice Version: 6.2.5.2 (x64)`. It was horribly slow (_due to...

bedantaguru

Extract contents of document footers?

2

This package is incredibly handy, thanks! I don't know much about XML, but looking at an unzipped docx file, it appears that, if the footer exists, each section of document...

pkq

enhancement

input issue

2

Hello, everyone, when I used the function docx_extract_all_tbls() to extract data from one docx file that outputted from SAS, there was an issue which showed that "Error: Must pass in...

zbq-2019

Error when assigning column names if the table has only one column

Hi there, thanks for the package, very useful! I get the following error when assigning a row as a column name if the scraped Word table has only one column....

cmzambranat

docxtractr
docxtractr copied to clipboard

Metadata

Numbers are lost when reading cells with numbered lists

Get table heading or page number for tables

docx_extract_all_cmnts(..., include_text = TRUE) failing on edge case

Allow for OOXML where commentStartRange and commentEndRange nodes are not siblings

Feature request: selected_text in docx_extract_all_cmnts()

Read special symbols within the tables in a .docx file

Alternative way of Supporting for doc-files

Extract contents of document footers?

input issue

Error when assigning column names if the table has only one column

← Metadata

Owner

Metadata

docxtractr docxtractr copied to clipboard

Metadata

← Metadata

Owner

Metadata

docxtractr
docxtractr copied to clipboard