Django-link-archive
Django-link-archive copied to clipboard
Self-hostable link archive
Crawlee uses file system to store some data. This might lead to errors if multiple processes use storage / crawlee os.environ["CRAWLEE_STORAGE_DIR"] = "./storage/{}".format(os.getpid()) I changed it so new instance of...
https://github.com/neonwatty/ytdatakit - It is convenient to call yt-dlp as process (interface is stable, well defined) - might be slower - we could test speed of said solution
User passes PageOptions.ssl_verify (which is default). We want the configuration to also set ssl_verify, if user does not set it manually.