awesome-neural-models-for-semantic-match icon indicating copy to clipboard operation
awesome-neural-models-for-semantic-match copied to clipboard

Recommandation (unsupervised/low ressource text alignment)

Open pltrdy opened this issue 5 years ago • 1 comments

Hey guys,

Good work both on MatchZoo and this list!
I would be interested in quick advices/pointers on something related: I'd like to match related parts of texts.

More formally, I've a document, made of different sections (each with multiple sentences), and I'd like to map it to a similar text (transcription in fact), which is a bit longer, with some noise but talk about the same thing (lot of similarities) and in the same order. I made a dynamic programming algorithms which maximize a cosine similarities between sentence embeddings. Results aren't too bad, but I'd like to experiment other stuff.

Any idea?

Thanks a lot for any clue / references that seems relevant. We could discuss through gitter.im as well.

Paul


I've not much gold data (i.e. suitable segments to be training pairs), which is why I mention unsupervised/low ressources).

pltrdy avatar Dec 20 '18 11:12 pltrdy

@faneshion can you answer?

bwanglzu avatar Dec 20 '18 11:12 bwanglzu