自动去除重复标题(适用于多个相同订阅源的情况)
Clear and concise description of the problem
在订阅多个来自同一网站的源时,可能会连续收到内容和标题都相同的文章,导致重复内容占用列表空间,影响阅读体验。建议增加一个自动检测并且自动去除重复,仅保留一个的功能
Suggested solution
- 自动检测文章列表中重复的标题,并去除重复项,仅保留一个副本
- 提供可选的功能开关,允许用户启用或禁用此自动去重功能
Alternative
No response
Additional context
No response
Validations
- [X] Check that there isn't already an issue that request the same feature to avoid creating a duplicate.
我也遇到了这个情况,因为我博客的目录更改了,导致rss的链接也更改了,这样在rss的订阅中,会出现多个同名的文章,重建数据库后也没有用
版本: App Version: 0.0.1-alpha.21 OS: Windows User Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Follow/0.0.1-alpha.21 Chrome/128.0.6613.162 Electron/32.1.2 Safari/537.36 Env: electron Browser: Chrome
Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑🤝🧑👫🧑🏿🤝🧑🏻👩🏾🤝👨🏿👬🏿
I also encountered this situation, because the directory of my blog was changed, which caused the rss link to be changed. In this way, multiple articles with the same name will appear in the rss subscription, and it is useless after rebuilding the database.
Version: App Version: 0.0.1-alpha.21 OS: Windows User Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Follow/0.0.1-alpha.21 Chrome/128.0.6613.162 Electron/32.1.2 Safari/537.36 Env: electron Browser: Chrome
duplicated #1125
你能描述一下「订阅多个来自同一网站的源」这个场景吗,我想知道在什么情况会遇到这个问题
Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑🤝🧑👫🧑🏿🤝🧑🏻👩🏾🤝👨🏿👬🏿
Can you describe the scenario of "subscribing to multiple sources from the same website"? I want to know under what circumstances I will encounter this problem
你能描述一下「订阅多个来自同一网站的源」这个场景吗,我想知道在什么情况会遇到这个问题
比如说,我订阅了《明报》的多个不同类别
https://rsshub.app/mingpao/ins/s00001 港聞 https://rsshub.app/mingpao/ins/s00002 經濟 https://rsshub.app/mingpao/ins/s00003 地產 https://rsshub.app/mingpao/ins/s00004 兩岸 https://rsshub.app/mingpao/ins/s00005 國際
他们可能同时在多个类别推送完全相同的文章,所以条目列表就会出现好几篇封面、标题、正文一模一样的文章排列。
但是,他们也有每个类别独占的文章。既不想错过这些独占的内容,又不希望被一模一样的文章刷屏。这时去除重复内容的作用出来了。
Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑🤝🧑👫🧑🏿🤝🧑🏻👩🏾🤝👨🏿👬🏿
Can you describe the scenario of "subscribing to multiple sources from the same website"? I would like to know under what circumstances I would encounter this problem
For example, I subscribe to several different categories of Ming Pao
https://rsshub.app/mingpao/ins/s00001 Hong Kong News https://rsshub.app/mingpao/ins/s00002 Economy https://rsshub.app/mingpao/ins/s00003 Real Estate https://rsshub.app/mingpao/ins/s00004 Cross-Strait https://rsshub.app/mingpao/ins/s00005 International
They may push the exact same article in multiple categories at the same time, so there will be several articles with the same cover, title, and text in the entry list.
However, they also have articles exclusive to each category. I don’t want to miss these exclusive contents, but I also don’t want to be flooded with the same articles. This is when the role of removing duplicate content comes into play.