zoplicate icon indicating copy to clipboard operation
zoplicate copied to clipboard

Request to add the function of detecting and deleting duplicate PDFs

Open hasibagen opened this issue 1 year ago • 12 comments

Because the merged PDFs will exist at the same time, there may be many documents with the same duplicate PDFs. Can you add the function to detect these PDFs and delete them?

hasibagen avatar May 29 '24 03:05 hasibagen

Good idea! Thanks, @hasibagen, I will add this function later.

ChenglongMa avatar May 29 '24 03:05 ChenglongMa

Good idea! Thanks, @hasibagen, I will add this function later.

Really looking forward to this function!

hasibagen avatar May 29 '24 08:05 hasibagen

目前这个功能有完善吗?一方面是删除多个重复的pdf,一方面想要用正式出版的pdf去替换accept的pdf,同时将早前的pdf笔记移植到新的正式出版的pdf上,这样可能吗

fredericky123 avatar Jul 23 '24 09:07 fredericky123

或者pdf的优先级可以这样设定,保留有注释的版本;保留有卷号,期号的版本;保留最新的版本;保留最旧的版本

fredericky123 avatar Jul 23 '24 10:07 fredericky123

不好意思,最近有点忙。不过我在一点点实现这个功能,主要除了这个功能本身还要考虑和其他插件的兼容性。我会尽快更新的,多谢你的关注和宝贵意见!

ChenglongMa avatar Jul 23 '24 10:07 ChenglongMa

感谢感谢,期待!

fredericky123 avatar Jul 23 '24 10:07 fredericky123

Hi @hasibagen and @fredericky123,

Thank you for your valuable suggestions. I'm implementing this function, but I may need your help.

The built-in merge function in Zotero will remove duplicate PDFs only when they:

  1. have exactly the same content;

  2. have the same content type, e.g., application/pdf;

  3. have the same link mode, e.g., both are imported or linked, like this:

    Snipaste_2024-08-09_09-43-30

These criteria are so strict that some duplicate files cannot be recognized.

Here I sincerely ask for more help from you:

  1. @hasibagen Could you give me some examples of exceptions where they are duplicates but do not meet the above criteria?
  2. @fredericky123 您提到的笔记移植和优先级的设置非常好!不过如何区分"正式版本"和"accept的版本"?是指arxiv中preprint的版本吗?如果方便的话能否提供一个例子?

Thank you so much for your feedback and support!

Chenglong

ChenglongMa avatar Aug 08 '24 23:08 ChenglongMa

这是同一篇文章的接收版本和in print版本 Um et al_2021_Acad Manage J_The downside of CFO function-based language incongruity.pdf Uploading um-et-al-2022-the-downside-of-cfo-function-based-language-incongruity.pdf…

fredericky123 avatar Aug 09 '24 00:08 fredericky123

这是同一篇文章的接收版本和in print版本 Um et al_2021_Acad Manage J_The downside of CFO function-based language incongruity.pdf Uploading um-et-al-2022-the-downside-of-cfo-function-based-language-incongruity.pdf…

非常感谢!我对比一下

ChenglongMa avatar Aug 09 '24 00:08 ChenglongMa

@fredericky123,不好意思,您第二个链接打不开

ChenglongMa avatar Aug 09 '24 00:08 ChenglongMa

@fredericky123 收到了,非常感谢!

ChenglongMa avatar Aug 09 '24 00:08 ChenglongMa