plugins icon indicating copy to clipboard operation
plugins copied to clipboard

[import_feed] Path/URL bugs in import_feed plugin

Open aigarius opened this issue 4 years ago • 3 comments

  1. Redirections are created before the page import happens, so the redirection map is always empty and no redirects are written into the config file

  2. Folder structure of the posts is not re-created, so everything is dumped in the same folder (requiring redirections)

  3. Redirections construct a wrong destination URL if the SITE_URL from the feed contains path elements (the path part gets included twice)

IMHO, when importing a RSS with SITE_URL set to "http://example.com/blog/" and posts like "http://example.com/blog/2011/04/03/testing-this-engine/" the output file should be "posts/2011/04/03/testing-this-engine/indext.html" with no redirect required. With SITE_URL set to "http://example.com/" and BASE_URL set to "http://example.com/blog/"

If slugifying each path element of the target name produces a different outcome, only then a redirect should be created from "2011/04/03/testing-utf-8-in-title-ā/" to "2011/04/03/testing-utf-8-in-title-a/index.html" and file be created as "posts/2011/04/03/testing-utf-8-in-title-a/index.html"

aigarius avatar Jun 27 '21 13:06 aigarius

For reference I was trying to import this feed

aigarius avatar Jun 27 '21 13:06 aigarius

Yeah, that's really buggy :-(

ralsina avatar Jun 27 '21 15:06 ralsina

Pushed some code that worked for my case, not 100% sure it will work in all use cases. But given the above feed and specifying "blog" as the new top folder option, I got very reasonable output in both files and URLs (no redirects required).

aigarius avatar Jun 27 '21 16:06 aigarius