slurp
slurp copied to clipboard
Slurps webpages and saves them as clean, uncluttered Markdown. Think Pocket, but better.
This issue is the place to report sites where the "author" field wasn't populated when it should have been.
This issue is the place to report sites where the content wasn't slurped properly. - [ ] Reddit: No relevant content slurped at all - [ ] HackerNews: Most comments...
This issue is the place to report sites where the "date" field wasn't populated when it should have been.
Gists (and likely GitHub Markdown previews, eg README.md) include anchors which use CSS to load an icon. These end up in Markdown like so: ``` ## Some Heading [](https://.../...#some-heading) ```...
Continuing indirectly from #18: How about something like cleanup tasks for specific properties? Right now, we have `s|...{s}` to simple replacements, but for instance, I'd like to add a rule...
it should fallback to http if necessary eg: https://people.math.harvard.edu/~ctm/home/text/others/shannon/entropy/entropy.pdf
"Abstract" should be displayed as a heading rather than the first word of the quoted abstract. eg: ```markdown > Abstract:Language models trained on internet-scale... ``` should be ```markdown ## Abstract...
* byline ends up populated with the submission date * the submission date isn't captured but is present in the body in a predictable format
the abstract ends up in the excerpt field but the body is almost nothing but the abstract. automagically embedding the pdf would probably worthwhile. this should be an optional behaviour...
The `view email` link is wrapped with square brackets, causing Obsidian to write out `[view email](...)` in full and making that a link which goes nowhere. eg: ```markdown ## Submission...