BASC-Archiver
BASC-Archiver copied to clipboard
Python-based Imageboard (4chan) complete thread archiver.
Any plans to do this, or someone working on it? I would be really nice to download threads from Desustorage in some cases.
``` Collecting basc-archiver Using cached BASC-Archiver-0.9.1.tar.gz Complete output from command python setup.py egg_info: Traceback (most recent call last): File "", line 20, in File "C:\Users\Dudu\AppData\Local\Temp\pip-build-oqsbf0f3\basc-archiver\setup.py", line 17, in long_description =...
I routinely run a dupe check, which has once freed up to 9GBs, and it is weird that the archiver can't detect that.
data:image/s3,"s3://crabby-images/04407/0440734a2b732e382ca0ee646a81259be33372ed" alt="not okay" I don't think many changes are needed (talking out of my ass). Just change the directory of the css files on the html to \board\css instead of thread\css,...
Something to just grab those without creating any other files. Use of the html is very rare (if you're just saving things for yourself) so thumbs, css, js, json are...
In `fourchan.py`, there happen to be these notes in line 246: ``` # TODO: extend BASC-py4chan to give us this number directly self.threads[thread_id]['total_files'] = len(list(thread['thread'].filenames())) ``` And line 255: ```...
We should probably create a JSON manager that loads and saves the thread json file, so we can handle it like *Fuuka does their thread grabbing. For example, being able...
Option to use the original filenames for files when writing threads out. Need to make sure we modify the images/thumbnails, and the names we write out into the html file....
Just some crashes/errors I've been running into. ``` Exception in thread Thread-3: Traceback (most recent call last): File "/usr/local/Cellar/python/2.7.10_2/Frameworks/Python.framework/Versions/2.7/lib/python2.7/threading.py", line 810, in __bootstrap_inner self.run() File "/basc-archiver/basc_archiver/sites/base.py", line 62, in run...