LinguaCafe icon indicating copy to clipboard operation
LinguaCafe copied to clipboard

More sources for importing.

Open simjanos-dev opened this issue 1 year ago • 8 comments

Add support for imports from:

  • [x] plain text
  • [x] text file
  • [x] youtube
  • [X] jellyfin subtitle
  • [x] subtitle files
  • [x] website
  • [ ] RSS feed
  • [ ] PDF files
  • [ ] mpv player
  • [ ] manga
  • [ ] OPDS #171

If you have an other source you would like to use, please make a request here, and I'll add to the list if it's possible.

simjanos-dev avatar Jan 10 '24 16:01 simjanos-dev

Maybe RSS feed ?

TLaborde avatar Jan 15 '24 11:01 TLaborde

I've looked into it a while ago, then forgot about it. Apparently things pushed onto an RSS feed are also copyrighted.

If I can have no legal problems from it, I would be happy to add it. If someone knows more about this, I would love some information.

simjanos-dev avatar Jan 15 '24 11:01 simjanos-dev

From what I can read here andthere, if you are consuming the content of the RSS feed for yourself, and not republishing it, it's fine, it's how all RSS feed reader application (self hosted or SaaS) works after all. If you plan to allow instances where users can share sources however, it could be tricky (is user A has a feed requiring authentication, user B shouldn't access it). If you don't plan to allow shared sources for multi user setup then it should be fine (i'm not a not a lawyer).

TLaborde avatar Jan 15 '24 14:01 TLaborde

I think I you are right, I can add it. Do you mean only the data from the RSS feed, or importing the webpage it points to?

Edit: I've found some webpage parser tools, so it should be doable to add the content of the RSS links as well.

simjanos-dev avatar Jan 16 '24 00:01 simjanos-dev

I have found a library, and it works. I've tested it on 3 languages with a few articles, it seems to work very well. I will add full RSS support, with option to auto import the content of the links as well, not just headlines. I will also add a separate option, to import a single website with the same method.

These are the supported languages:


  ar              Arabic
  be              Belarusian
  bg              Bulgarian
  da              Danish
  de              German
  el              Greek
  en              English
  es              Spanish
  et              Estonian
  fa              Persian
  fi              Finnish
  fr              French
  he              Hebrew
  hi              Hindi
  hr              Croatian
  hu              Hungarian
  id              Indonesian
  it              Italian
  ja              Japanese
  ko              Korean
  lt              Lithuanian
  mk              Macedonian
  nb              Norwegian (Bokmål)
  nl              Dutch
  no              Norwegian
  pl              Polish
  pt              Portuguese
  ro              Romanian
  ru              Russian
  sl              Slovenian
  sr              Serbian
  sv              Swedish
  sw              Swahili
  th              Thai
  tr              Turkish
  uk              Ukrainian
  vi              Vietnamese
  zh              Chinese

simjanos-dev avatar Jan 18 '24 21:01 simjanos-dev

Join

Mpay57 avatar Mar 01 '24 16:03 Mpay57

@Mpay57 Sorry, I don't understand what you mean.

simjanos-dev avatar Mar 01 '24 17:03 simjanos-dev

I'm not sure if this would be beyond the scope of this issue, but adding support for Markdown imports would be awesome.

Not as a regular text import though. I'm thinking more along the lines of rendering the markdown or at least a subset of Markdown features like headings, bold, italics, block quotes, and lists.

cblanken avatar Aug 19 '24 02:08 cblanken