Python-Scripts icon indicating copy to clipboard operation
Python-Scripts copied to clipboard

Adding "PDF Organizer" Script

Open Retsamic opened this issue 1 year ago • 1 comments

Summary

This pull request introduces a new utility script, "PDF Organizer," designed to automate the organization and management of PDF files within a specified directory. By analyzing PDF metadata, the script efficiently categorizes PDFs into subfolders, renames files for uniformity, and generates a summary report of the process.

Features

  • Metadata Extraction: Utilizes PyPDF2 to extract metadata (title, author) from PDFs.
  • Dynamic Organization: Automatically sorts PDFs into author-named subfolders.
  • Standardized Renaming: Renames PDF files to a consistent "Author - Title.pdf" format.
  • Summary Reporting: Generates a report detailing the organization outcome, including counts of PDFs per category.

Motivation

Managing a large collection of PDF files can be tedious, especially when dealing with academic papers, legal documents, or extensive reports. This script aims to simplify file management, making it easier for users to locate and organize their PDFs based on content relevance.

How to Use

  1. Ensure PyPDF2 is installed: pip install PyPDF2
  2. Modify the source_folder and report_path in the script to match your directory structure.
  3. Run the script: python pdf_organizer.py

Future Enhancements

  • Support for additional metadata fields (e.g., subject, keywords) for finer categorization.
  • Interactive mode allowing users to select organization criteria directly.
  • GUI for ease of use by non-technical users.

I believe this script will be a valuable addition to the repository, offering a practical solution to a common problem faced by many. I look forward to feedback and any suggestions for improvement.

Retsamic avatar Feb 10 '24 00:02 Retsamic

@Retsamic please update the main Readme.md as well

DhanushNehru avatar Apr 20 '24 13:04 DhanushNehru