lingo
lingo copied to clipboard
package lingo provides the data structures and algorithms required for natural language processing
I don't see any included models or tagged text corpuses with this library. I [found a note](https://github.com/chewxy/lingo/blob/master/treebank/treebank.go#L106) to use [English Web Treebank](https://catalog.ldc.upenn.edu/LDC2012T13)(?) ([free version](https://github.com/UniversalDependencies/UD_English-EWT)), but I'm wondering what assets I...
Trying to train a treebank against `ud-treebanks-v2.3/UD_English-EWT/en_ewt-ud-dev.conllu` and I've noticed that all rows that have a head field with the value `_` panic. Any ideas on how to deal with...
You are creating a binary called `dep` that overwrites the `dep` binary from the dependency management tool. Independent from who claimed that name first, it causes some unexpected behavior :smile:
Move to https://github.com/go-nlp, which ideally will combine efforts of @james-bowman (TF-IDF, LDA, LSA etc) and @ynqa (Word Embeddings), possibly some help from @sajari guys (@dhowden, @mish15, @tbillington). # TODO #...
`"hello there, world"` and `"hello there , world"` yields different parses. Should look into lexer
It's very obvious when you see this: ```go func pipeline(name string, f io.Reader) (*lingo.Dependency, error) { l := lexer.New(name, f) p := pos.New(pos.WithModel(posModel), pos.WithCluster(clusters), pos.WithStemmer(stemmer{}), pos.WithLemmatizer(fixer{})) d := dep.New(depModel) //...
``` ../../const.go:58: constant 1000000000000 overflows int ../../const.go:59: constant 1000000000000000 overflows int ``` ## Tasks ## * [ ] Create a version of `const.go` for ARM that doesn't have those two...
https://arxiv.org/pdf/1603.06042 This has been on my radar for a while now. But haven't had the time to implement it Tasks: * [ ] Add beam search instead of greedy decoding...