Christian Boulanger

Results 136 comments of Christian Boulanger

Hi, I am trying to train a finder model with self-annotated material that i converted from the EXCite/Cermine format (.csv) to .ttx. Unfortunately, I am getting an error: ``` anystyle...

I am posting this here because I have the exact same use case: trying to parse references from footnotes. My training material consists of 40 documents with annotated references in...

@inukshuk Because the material is copyrighted, I cannot publicly share the documents but do you use [Gitter](https://gitter.im/) by any chance?- I could send you the link there.

It might be good to catch the exception and output the name of the document and (if possible) line number in which the error is encountered.

BTW, I am happy to add .ttx as an export format to the online annotation tool https://cboulanger.github.io/excite-docker/

Ok, thanks, that was useful. Turns out that a last line ``` blank | ``` will crash the document parser for some reason. I'll make sure that the converter script...

I am training a finder model, using 38 .ttx containing references in footnotes from files that I converted from a different format (happy to send them offline, they are from...

After having looked closer at the source code, I can see that AnyStyle currently trains the finder model by labelling whole lines, which I fear defeats the goal of parsing...

@inukshuk You are right - it makes more sense to offload the task of finding multiple references in one line to the parser which does reference segmentation. So the first...

@inukshuk Yay! The `--solo` option was the key! ``` $ anystyle -F test/finder.mod -f csl --verbose find --solo test/10.1515_zfrs-1980-0104.pdf test Analyzing 10.1515_zfrs-1980-0104.pdf ... 47 references found. ``` Cool. The results...