PDF support in AI Chat
PDFs can be weird, in that sometimes text is embedded in a way that can't be directly read. It's possible the LLM is trying to read the text contents, in which there may not be any.
As a workaround, try to extract the text out of the PDF and provide that to the AI to try.
I think it is as simple as allowing the upload of PDF files and if it is embedded not correctly that will be fine
Ahh I see, the PDF contents isn't even being sent to the AI, I understand. Will keep this open as a feature request :)
I hope you see it soon it will help alot of researchers, as mainly i will use it to analyze research papers. Consider even making it a premium feature
I’m not a developer for Cursor, I just help triage some of the issues on this repo, I totally see the benefit of the feature but I have no influence on what gets coded first! :)
I've tried various ways to include a PDF in the chat, but it does not appear to work currently.
It would be a very useful feature to have.
Extremely useful for researchers i dont understand how they still dont have this feature
I think it is heavy computation to read PDF, but a workaround is to screenshot the pieces. It also make it focus more on the relevant content.
This thread is a bit old, but I've used online PDF-to-Markdown converters and saved the Markdown file in the project - it can read that just fine. Inconvenient, but it gets the job done.
The new Mistral OCR release would be the way to do this:
https://mistral.ai/news/mistral-ocr
Until this is implemented in Cursor, you can convert your PDFs yourself. I wrote a little tool bib4llm that converts PDFs into Markdown + PNGs. Either from a directory of PDF files or a Zotero BibTex file. Its currently using PyMuPDF4LLM for the conversion and leaves any RAG / indexing to Cursor. Could be easily updated to use any LLM based text extraction like llamaparse or mistral-ocr.
I just drop a bib file synced with my Zotero collection (BetterBibTex + keep bib file updated during export) into my project folder, run bib4llm on it and start chatting with the papers.
How about use docs feature, will it work?
Tried to use Cursor AI to read PDF files from JFK files and give me results, Cursor AI propose python script to read PDF files only.
Would pay extra for this
is there any MCP to achieve this feature ?
I am still having difficulty with this. Why is this so difficult to implement? It would be great to just be able to reference PDF documents in the chat. I do not want to use OCR or MCP servers. Seems like a lot of users are asking for this.
This would be a very powerful feature, allow developers and researchers to really automate and speed up work flow. Please add them!
yes please!
Need this feature badly :(
Same question, I need this feature ...
I really need this feature.
Maybe there's a way to port PDF -> Markdown or HTML while keeping scientific formulas?
I would also love this feature, particularly for scientific/analysis/parsing workflows and to give Agents more context.
This feature is really needed. Manually converting pdfs to markdown or using extra tools/MCP is just too time-consuming.