Miguel Sozinho Ramalho

Results 42 issues of Miguel Sozinho Ramalho

**Description** > TLDR: I (and surely others) cannot use unfurl as a python library but refactoring code.py would fix that. The current [core.py](https://github.com/obsidianforensics/unfurl/blob/main/unfurl/core.py) contains both the logic needed to do...

Bug
Enhancement

Hi all, I've been experimenting with making an AWS lambda function for browsertrix-crawler and I've gone some distance but hit a snag that the maintainers are probably better equipped to...

The [automated profile creation](https://github.com/webrecorder/browsertrix-crawler/issues/449) feature is super useful but as I understand [the current implementation](https://github.com/webrecorder/browsertrix-crawler/blob/c3b98e5047ea219336883b0b1969da425fc43456/create-login-profile.js#L224-L236) is somewhat limited in terms of the expected login form names. For instance, it does...

applies to telethon_archiver and tbot_archiver the logic in https://github.com/bellingcat/auto-archiver/blob/601572d76e19d24ce1c3e8881b15d23a4bea987e/src/auto_archiver/archivers/telethon_archiver.py#L102-L105 (for telethon) is not being properly called everytime and so ghost .session files are left behind.

bug
archiver

Add a feature which can be enabled/disabled in the screenshot_enricher that allows for a full page PDF capture of the page, in addition to the screenshot itself.

enhancement
enricher
nice to have

As it stands the logic to handle the `configs()` method of each Step (like Archivers, Enrichers, ...) does not cast int/float/bool and uses only strings. Likewise it's proven quite useful...

enhancement
nice to have

Since the wacz_enricher can sometimes have hundreds of media items it does not scale well, one alternative is to dynamically fetch those using JS in the generated hmtl file, this...

enhancement
help wanted
blocked

There's a cool new tool https://github.com/davidnewschool/sound-delay which should be tested with some examples and if proven reliable can be added as an enhancer. The idea is if a video/audio is...

enhancement
help wanted
enricher

follow up for https://github.com/bellingcat/auto-archiver/pull/93/files

enhancement
nice to have

Some results include things like `&` instead of `&` due to, I believe, the scraping mechanism.