JedAIToolkit
JedAIToolkit copied to clipboard
An open source, high scalability toolkit in Java for Entity Resolution.
constructor parameters not assigned to the class properties
The PrintToFile.toCSV() method should output the original entity urls, and should be in a format which is easier to import into a database. e.g. 3 columns: custer_id, dataset, entity_url
StandardBlocking.getTokens() throws null pointer exception when input parameter is null. We ought to stop null values from being added to the EntityProfile when reading from a database
The second URL of every row is mapped to 0. So all of the records in the first column of the CSV file are considered duplicates (via transitive closure).