wikiextractor
wikiextractor copied to clipboard
section arg not regonized
Passing a -s
flag to the extractor seems to be invalid. Any pointers ?
usage: WikiExtractor.py [-h] [-o OUTPUT] [-b n[KMG]] [-c] [--json] [--html] [-l] [-ns ns1,ns2]
[--templates TEMPLATES] [--no-templates] [--html-safe HTML_SAFE]
[--processes PROCESSES] [-q] [--debug] [-a] [-v]
input
My impression is that this functionality (preserving sections) is now the default behavior, so the flag is not needed. The keepSections
flag is set as True
when the Extractor
class is initialized and there does not appear to be any place in the code that provides an option to alter it.
class Extractor():
"""
An extraction task on a article.
"""
##
# Whether to preserve links in output
keepLinks = False
##
# Whether to preserve section titles
keepSections = True