CERMINE
CERMINE copied to clipboard
no bibtex output in v1.13
Hi, when I download the lates version 1.13 from the repository, there seems to be no bibtex option?
Unknown output types: [bibtex]
Usage: ContentExtractor -path <path> [optional parameters]
Tool for extracting metadata and content from PDF files.
Arguments:
-path <path> path to a directory containing PDF files
-outputs <list> (optional) comma-separated list of extraction
output(s); possible values: "jats" (document
metadata and content in NLM JATS format), "text"
(raw document text), "zones" (text zones with
their labels), "trueviz" (geometric structure in
TrueViz format), "images" (images from the
document); default: "jats,images"
-exts <list> (optional) comma-separated list of extensions of the
resulting files; the list has to have the same
length as output list; default: "cermxml,images"
-override override already existing files
-timeout <seconds> (optional) approximate maximum allowed processing
time for a PDF file in seconds; by default, no
timeout is used; the value is approximate because in
some cases, the program might be allowed to slightly
exceeded this time, say by a second or two
-configuration <path> (optional) path to configuration properties file
see https://github.com/CeON/CERMINE
for description of available configuration properties
Is that intentional?
@eds-slim At the moment we do not export bibliographic references from PDFs in BibTeX. We might add this in the future, though.
@eds-slim The current official release, 1.13, does not contain this feature, but you will find it in the current snapshot: http://maven.ceon.pl/artifactory/webapp/#/artifacts/browse/simple/General/kdd-snapshots/pl/edu/icm/cermine/cermine-impl/1.14-SNAPSHOT I cannot say at the moment when the next release including the bibtex output feature will happen.
The bibtex option also works if you build the code yourself and run with the following parameters: -path your_path_to_pdfs -outputs bibtex