John Vandenberg

Results 2068 comments of John Vandenberg

https://github.com/lipoja/URLExtract/pull/65 is a first cut of showing that negative hits are cached.

btw, dns-cache was built for https://github.com/jayvdb/pypidb , where I am also using urlextract ; the test suite is processing a huge dataset, and exposes quite a lot of potential improvements...

I've started that with https://github.com/lipoja/URLExtract/issues/68 , but those are less about the DNS aspects. To see DNS issues, actually it would be helpful to add some optional mechanism for URLExtract...

config: `e.target` is *really* common ```py DEBUG pypidb._pypi:_pypi.py:313 processing Webpage: http://docs.red-dove.com/cfg/python.html INFO urlextract:urlextract_core.py:567 Unknown exception during gethostbyname(e.target) gaierror(-2) INFO urlextract:urlextract_core.py:567 Unknown exception during gethostbyname(e.target) gaierror(-2) INFO urlextract:urlextract_core.py:567 Unknown exception during...

Sounds like a great idea, and should complement caching nicely. Also `pebble` seems like a good choice.

Another is `'` in https://docs.red-dove.com/cfg/python.html results in a 404 at https://freeotp.github.io/&#39 (the ; is omitted, but it is the `&` which causes the 404 as it doesnt follow a `?`)

It might useful to have the caller inform the parser what type of text is being provided, such as html, xml, md, rst, which give clues to the parser when...

I found the WSL2 bash.exe in System32 folder kept getting in the way; easy workaround was to copy the msys `bash.exe` over the top of the one in System32.

I am seeing the same type of problem in GitHub Actions (msys, not cygwin) it seems, at least when running `--version` https://github.com/jayvdb/py2many/commit/ac6aca501cb35065273e7dc651dff559a9dfd13f `kotlin -version`: > Kotlin version 1.4.30-release-302 (JRE 1.8.0_292-b10)...

If I install kotlin 1.5.0, `kotlin -version`: > Kotlin version 1.5.0-release-749 (JRE 1.8.0_292-b10)