Representation-Learning-for-Information-Extraction icon indicating copy to clipboard operation
Representation-Learning-for-Information-Extraction copied to clipboard

Candidate Selection

Open bradfox2 opened this issue 3 years ago • 6 comments

Have you experimented with altering the candidate selection process?

I am interested in what occurs when the candidate selection process is simplified or removed entirely so that every possible candidate is evaluated.

bradfox2 avatar May 12 '21 03:05 bradfox2

I didn't try that as removing it would mean a lot of meaningless negative candidates. For eg: for a date, a floating-point number or random text doesn't make sense. So I didn't try that.

Praneet9 avatar May 12 '21 06:05 Praneet9

@Praneet9 I have trained this model on structured documents dataset and during the training, validation loss and accuracy were 0.0006 and .98421 respectively, but when I am testing it on the new documents the result is very poor. The trained model is not able to predict those keys for which there are multiple candidates.
I am attaching the snapshot of the documents for amounts information, in which, many keys have same candidate so model is not able to detect the keys. amounts we have to extract all the key value present in the snapshot. Just want to confirm one thing, is there are any condition that a single candidate text can't be part of multiple keys? please can you suggest how we can solve this issue?

Neelesh1121 avatar Oct 28 '21 11:10 Neelesh1121

@Neelesh1121 Can you please explain what you mean by

we have to extract all the key value present in the snapshot.

There's no rule like that. In the above case, what are you trying to extract for the amount key?

Praneet9 avatar Nov 02 '21 06:11 Praneet9