ragflow icon indicating copy to clipboard operation
ragflow copied to clipboard

[Question]: Does ragflow support retrieving data from a specified URL? Or extracting data from a specified URL as a knowledge base

Open myplxdm opened this issue 10 months ago • 4 comments

Self Checks

  • [x] I have searched for existing issues search for existing issues, including closed ones.
  • [x] I confirm that I am using English to submit this report (Language Policy).
  • [x] Non-english title submitions will be closed directly ( 非英文标题的提交将会被直接关闭 ) (Language Policy).
  • [ ] Please do not modify this template :) and fill in all the required fields.

Describe your problem

Does ragflow support retrieving data from a specified URL? Or extracting data from a specified URL as a knowledge base

myplxdm avatar Mar 13 '25 10:03 myplxdm

I would like this feature, too. Retrieving contents from academic websites might be better than parsing PDFs.

Yaping-Wang avatar Mar 13 '25 13:03 Yaping-Wang

You might want to look at Tavily-based web search.

writinwaters avatar Mar 14 '25 02:03 writinwaters

@writinwaters Thank you for your reply. Indeed, Tavily is a great supplement, but for highly specialized and authoritative information, it is best to obtain it from professional websites, such as those related to law, government affairs, etc. Therefore, I hope RAGFlow can build a knowledge base through URLs.

myplxdm avatar Mar 14 '25 03:03 myplxdm

Ugh. Apologies. This feature is not on the roadmap. As a workaround, you can save the HTML files. They can be previewed during an AI chat.

writinwaters avatar Mar 14 '25 03:03 writinwaters