pyWhat
pyWhat copied to clipboard
URL regex does not fully match every URL
For example:

I would like it to return both google.co and google.com. Sadly, it may be impossible considering the way regular expressions work. Thus, it would be amazing to match the longest string (pywhat google.com/help should return google.com/help). It is crucial for implementing URL subcategories properly(#51). Btw, URL regex is too long, I do not think that valid TLDs should be checked, so it may be shortened.
I do not think that valid TLDs should be checked
The reason we do this is because of false positives by the way :)
sad
Reopening this since pywhat does not match something like https://www.google.com fully.
Hey, @amadejpapez, do you have any ideas about this one?
Hey, @amadejpapez, do you have any ideas about this one?
Hm will check this later today and see if I get any.