lingo icon indicating copy to clipboard operation
lingo copied to clipboard

package lingo provides the data structures and algorithms required for natural language processing

Results 16 lingo issues
Sort by recently updated
recently updated
newest added

I don't see any included models or tagged text corpuses with this library. I [found a note](https://github.com/chewxy/lingo/blob/master/treebank/treebank.go#L106) to use [English Web Treebank](https://catalog.ldc.upenn.edu/LDC2012T13)(?) ([free version](https://github.com/UniversalDependencies/UD_English-EWT)), but I'm wondering what assets I...

Trying to train a treebank against `ud-treebanks-v2.3/UD_English-EWT/en_ewt-ud-dev.conllu` and I've noticed that all rows that have a head field with the value `_` panic. Any ideas on how to deal with...

You are creating a binary called `dep` that overwrites the `dep` binary from the dependency management tool. Independent from who claimed that name first, it causes some unexpected behavior :smile:

Move to https://github.com/go-nlp, which ideally will combine efforts of @james-bowman (TF-IDF, LDA, LSA etc) and @ynqa (Word Embeddings), possibly some help from @sajari guys (@dhowden, @mish15, @tbillington). # TODO #...

`"hello there, world"` and `"hello there , world"` yields different parses. Should look into lexer

It's very obvious when you see this: ```go func pipeline(name string, f io.Reader) (*lingo.Dependency, error) { l := lexer.New(name, f) p := pos.New(pos.WithModel(posModel), pos.WithCluster(clusters), pos.WithStemmer(stemmer{}), pos.WithLemmatizer(fixer{})) d := dep.New(depModel) //...

``` ../../const.go:58: constant 1000000000000 overflows int ../../const.go:59: constant 1000000000000000 overflows int ``` ## Tasks ## * [ ] Create a version of `const.go` for ARM that doesn't have those two...

easy
Non-Standard Builds

https://arxiv.org/pdf/1603.06042 This has been on my radar for a while now. But haven't had the time to implement it Tasks: * [ ] Add beam search instead of greedy decoding...

new feature