alvisnlp
alvisnlp copied to clipboard
ALvisNLP corpus processing engine
Bumps [org.postgresql:postgresql](https://github.com/pgjdbc/pgjdbc) from 42.3.8 to 42.3.9. Changelog Sourced from org.postgresql:postgresql's changelog. Changelog Notable changes since version 42.0.0, read the complete History of Changes. The format is based on Keep a...
## Omit parameter alias name if it is the same ```xml ``` could be written as: ```xml ``` ## Shortcut for corpus *SetFeature* pattern ```xml $ bar ``` could be...
Several modules and functions use [`Iterators`](https://github.com/Bibliome/bibliome-java-utils/blob/master/src/main/java/fr/inra/maiage/bibliome/util/Iterators.java), [`Filter`](https://github.com/Bibliome/bibliome-java-utils/blob/master/src/main/java/fr/inra/maiage/bibliome/util/filters/Filter.java), and [`Mapper`](https://github.com/Bibliome/bibliome-java-utils/blob/master/src/main/java/fr/inra/maiage/bibliome/util/mappers/Mapper.java). These classes will be deprecated and should replaced with the standard Java `Stream` API. Notable methods: - [`Evaluator#evaluateElements()`](https://github.com/Bibliome/alvisnlp/blob/4edc54237bc0312903c89df3ebceef5ec1315199/alvisnlp-core/src/main/java/fr/inra/maiage/bibliome/alvisnlp/core/corpus/expressions/Evaluator.java#L68) - [`Corpus#documentIterator()`](https://github.com/Bibliome/alvisnlp/blob/4edc54237bc0312903c89df3ebceef5ec1315199/alvisnlp-core/src/main/java/fr/inra/maiage/bibliome/alvisnlp/core/corpus/Corpus.java#L77) -...
On constate un retard de 2h dans les logs d'AlvisNLP. Ce traitement lancé via une image singularity à 10h21 produit les logs suivants ``` mba@tiers-esv-dev:/work_projet/bibliome/nginx-apis/tiers-esv-api/runs/tmp32set9eb$ cat alvisnlp.log [2023-07-13 08:21:19.944][alvisnlp] build...
Le module Ab3P ne passe pas sur plusieurs textes avec la version alvisnlp 0.10.0 avec [le plan suivant](https://forgemia.inra.fr/omnicrobe/text-mining-workflow/-/blob/dev/plans/entities.plan). Un batch pour reproduire est disponible ici https://forgemia.inra.fr/omnicrobe/text-mining-workflow/-/blob/dev/corpora/pubmed/batches-test/2765/2765.xml ``` [2023-03-28 09:47:29.290][entities.abbrev] Failed...
See: https://github.com/stanfordnlp/stanza/issues/244 Add `stanza.download(LANG)` in order to download resources. https://stanfordnlp.github.io/stanza/download_models.html Maybe check either these resources already exist. Stanza resources by default are placed in `$HOME/stanza_resources`, regardless of Conda environment.
Annotators (online automatic annotation services) now can be registered on PubAnnotation: http://pubannotation.org/annotators The API is documented here: http://www.pubannotation.org/docs/annotation-server-api/
Several modules use [`FileLines`](https://github.com/Bibliome/bibliome-java-utils/blob/master/src/main/java/fr/inra/maiage/bibliome/util/filelines/FileLines.java) to read *CSV* and *TSV* files. This class will be deprecated. Usage of `FileLines` should be replaced with Apache Commons CSV classes. List of modules: -...
``` ▲ dev/alvis_test/alvisnlp ▶ bash --version GNU bash, version 3.2.57(1)-release (x86_64-apple-darwin18) Copyright (C) 2007 Free Software Foundation, Inc. ▲ dev/alvis_test/alvisnlp ▶ brew info coreutils master :: 27d :: ⬡ coreutils:...