wikiextractor icon indicating copy to clipboard operation
wikiextractor copied to clipboard

section arg not regonized

Open tgalery opened this issue 3 years ago • 1 comments

Passing a -s flag to the extractor seems to be invalid. Any pointers ?

usage: WikiExtractor.py [-h] [-o OUTPUT] [-b n[KMG]] [-c] [--json] [--html] [-l] [-ns ns1,ns2]
                        [--templates TEMPLATES] [--no-templates] [--html-safe HTML_SAFE]
                        [--processes PROCESSES] [-q] [--debug] [-a] [-v]
                        input

tgalery avatar May 03 '21 21:05 tgalery

My impression is that this functionality (preserving sections) is now the default behavior, so the flag is not needed. The keepSections flag is set as True when the Extractor class is initialized and there does not appear to be any place in the code that provides an option to alter it.

class Extractor():
  """
  An extraction task on a article.
  """
  ##
  # Whether to preserve links in output
  keepLinks = False

  ##
  # Whether to preserve section titles
  keepSections = True

bryanjohns avatar Nov 06 '21 00:11 bryanjohns