CLAVIN
CLAVIN copied to clipboard
CLAVIN (Cartographic Location And Vicinity INdexer) is an open source software package for document geoparsing and georesolution that employs context-based geographic entity resolution.
After having read the [issue 22](https://github.com/Novetta/CLAVIN/issues/22) I was trying to change the training files for using the italian language. I took the files for the italian language from the OpenNLP...
I've been looking at the source code a bit. It looks like it should be easy to make this run against elastic search instead of running an embedded lucene. The...
I am using CLAVIN through maven. I've the latest lucene index. ``` public static void main(String args[]) throws Exception { GeoParser parser = GeoParserFactory.getDefault("CLAVIN/IndexDirectory"); File inputFile = new File("classifiers/input"); String...
Because it is hard to ship the lucene DB due to its size and memory constraints, we are deploying clavin as a web service. https://github.com/edwardcapriolo/clavin-aas I just wanted to get...
I haven't done rigorous testing yet, but has anyone else noticed v2.0.0 being about half the speed of v1.1? I updated our [CLIFF tool to use it](https://github.com/c4fcm/CLIFF/releases/tag/v2.0.0), but it is...
Executing `mvn test` will incur this issue: ``` bash testHeuristicsOntario(com.berico.clavin.resolver.LocationResolverHeuristicsTest): \ LocationResolver chose the wrong "Hamilton" expected: but was: ``` Suggest separating the notion of algorithm performance from unit testing...
Figure out how we want to handle location names that appear inside the name of an organization entity (e.g., "University of Oxford," "California Pizza Kitchen").
We need to do a better job of extracting & resolving abbreviated location names. We can start with a big list of abbreviations for all countries, US states, provinces of...