Jan-Petter Gundersen

Results 9 issues of Jan-Petter Gundersen

There's an encoding issue in the HTML provider. `$embed->getProviders()['html']->getTitle()` Returns: `Tilbehør for brytere =` Expected: `Tilbehør for brytere =

[Misc::fix_protocol](https://github.com/simplepie/simplepie/blob/master/library/SimplePie/Misc.php#L194) is returning an invalid encoded URL. The root issue is probably that [IRI::parse_iri](https://github.com/simplepie/simplepie/blob/master/library/SimplePie/IRI.php#L361) is trying to parse an protocol-less URL, witch is expecting perfectly valid URLs only. Example: (links...

Hi I love Beyondpad, but I really miss better browser support on phones (or even the app for Android). Is it possible to set default / fallback page width (or...

enhancement

When parsing large robots.txt files, the process can take several minutes to finish, even with 100% CPU power dedicated. This is a problem in general, and is not related to...

enhancement

``` $t1gor = new t1gor\RobotsTxtParser\RobotsTxtParser(

bug

No errors/warnings should be generated when parsing, still I get these: ``` mb_internal_encoding(): Unknown encoding "OSF10020402" // valid, but not installed mb_internal_encoding(): Unknown encoding "UTF9" // invalid mb_internal_encoding(): Unknown encoding...

bug
enhancement

**Feature request:** Limit the maximum number of bytes to parse. > A maximum file size may be enforced per crawler. Content which is after the maximum file size may be...

enhancement

``` User-agent: * Disallow: / ``` ``` $this->assertTrue($parser->isDisallowed("&&1@|")); $this->assertFalse($parser->isAllowed('+£€@@1¤')); ``` The two tests above fails, paths allowed according to the parser, the robots.txt says otherwise... At least the second one...

bug

Thanks for this great addon. One of my automations failed a few days ago, and I ended up using way more power at the cheapest hour than intended. My target...

enhancement