Teller icon indicating copy to clipboard operation
Teller copied to clipboard

differing PDF file names downloaded from RBC site breaks regular expression

Open scanion opened this issue 3 years ago • 1 comments

Greetings,

Been using this and had to make some adjustments to get it to work properly in my case. RBC does not make this easy!

For example I found there are multiple places to download pdf's on RBCs site. And depending on how you do it, it names the file differently. In doing so breaks the re to search for year in pdf title.

Typically you can download it for just a year and it names them something like so: XXXXXXXXXX-2020Apr14-2020May14.pdf

If you download them from edocument manager (hard to find) you can actually download all 7 years of the pdfs. Though the same pdf's are named something like Chequing Statement 2014-Dec-12.pdf

That is an easy fix and i can make a pull request if you are up for it.

Lonnie

scanion avatar Jul 22 '21 17:07 scanion

Greetings,

Been using this and had to make some adjustments to get it to work properly in my case. RBC does not make this easy!

For example I found there are multiple places to download pdf's on RBCs site. And depending on how you do it, it names the file differently. In doing so breaks the re to search for year in pdf title.

Typically you can download it for just a year and it names them something like so: XXXXXXXXXX-2020Apr14-2020May14.pdf

If you download them from edocument manager (hard to find) you can actually download all 7 years of the pdfs. Though the same pdf's are named something like Chequing Statement 2014-Dec-12.pdf

That is an easy fix and i can make a pull request if you are up for it.

Lonnie

Would be great if you can create the adjustment suggested. Just out of curiosity, is it possible to extract the account / card number and the statement date from the PDF content as well?

izlotnik avatar Sep 30 '21 16:09 izlotnik