reader icon indicating copy to clipboard operation
reader copied to clipboard

Incomplete Markdown Conversion: Missing MSRP Cap from URL Content

Open nagendrakumar02 opened this issue 1 year ago • 2 comments

I am using r.jina.ai to read the contents of the URL and I'm encountering an issue while trying to convert the contents of the URL to markdown format. It appears that some sections are missing from the conversion, particularly the MSRP cap details from this URL. Could you please look into this?

URL: https://portal.ct.gov/deep/air/mobile-sources/cheapr/cheapr---new-eligible-vehicles Missing content (especially MSRP cap) image

nagendrakumar02 avatar Sep 23 '24 21:09 nagendrakumar02

Hi. This should be our default transformer @mozilla/readability not smart enough and removing your desired content.

Please try the other mode which does not tend to remove things:

curl https://r.jina.ai/https://portal.ct.gov/deep/air/mobile-sources/cheapr/cheapr---new-eligible-vehicles -H 'x-return-format: markdown'

nomagick avatar Sep 25 '24 01:09 nomagick

Hi. This should be our default transformer @mozilla/readability not smart enough and removing your desired content.

Please try the other mode which does not tend to remove things:

curl https://r.jina.ai/https://portal.ct.gov/deep/air/mobile-sources/cheapr/cheapr---new-eligible-vehicles -H 'x-return-format: markdown'

Thanks! that worked! However running this gives me different results each time. Do you know if I can self-host this in AWS?

nagendrakumar02 avatar Sep 25 '24 16:09 nagendrakumar02