URLExtract
URLExtract copied to clipboard
VCS/Git remote URLs not found
[email protected]:snowplow/snowplow-python-tracker.git
is not found
This can be found at https://pypi.org/project/minimal-snowplow-tracker/
>>> import urlextract
>>> e = urlextract.urlextract_core.URLExtract()
>>> e.find_urls('[email protected]:snowplow/snowplow-python-tracker.git')
[]
A good list of sample VCS links can be found at https://github.com/coala/git-url-parse/blob/master/test/conftest.py
Also worth noting that pip VCS branch support @
causes no result if there is no path
>>> e.find_urls(' git+https://git.foobar.com@branch ')
[]
>>> e.find_urls(' git+https://git.foobar.com/@foo ')
['git+https://git.foobar.com/@foo']
Hi @jayvdb or @lipoja I am interested to work on this issue.
So the expected result, we should return the git URL in this condition:
>>> e.find_urls('[email protected]:snowplow/snowplow-python-tracker.git')
['[email protected]:snowplow/snowplow-python-tracker.git']
Re: https://github.com/coala/git-url-parse/blob/master/test/conftest.py should we copy the URLs to test/unit/conftest.py
?