link-grammar
link-grammar copied to clipboard
The CMU Link Grammar natural language parser
`dictionary_lookup_list()` and `dictionary_lookup_wild()` are part of the API, and words that they return, if subscripted, have `SUBSCRIPT_MARK` in them. However, this symbol is currently not part of the API. Similarly,...
I wondered why this dict has only 5 sentence examples and none of them can be fully parsed. The project in this URL has recently been transferred to https://bitbucketarchive.softwareheritage.org/projects/ng/ngocminh/lienkate.html, from...
Idioms
In PR #1194 I said: > There are some more things to discuss on idioms., to be done here or in a new issue. Currently, a word with underbars is...
There is a bug here regarding the link to `LEFT-WALL`: ``` text test.v: [71186]0.000= @E- Qa- dWV- B*d- VC+ VC+ : : @E- & : Qa- & : dWV- &...
From 5.7.0: ```text linkparser> !!test.v Token "test.v" matches: test.v 18101 disjuncts ``` Now (248a6133d8793528f6376cd26baa3b9733001bf6): ```text linkparser> !!test.v Token "test.v" matches: test.v 93621 disjuncts ``` This also seems to cause a...
Cost cutoff
# Background I'm trying to fix the SAT-parser disjunct cost calculations to get accurate linkage disjunct and sentence costs, and mostly succeeded with that. (See issue #188 on the ignoring...
These is the current setup of panic mode: ``` parse_options_set_disjunct_cost(opts, 4.0f); parse_options_set_min_null_count(opts, 1); parse_options_set_max_null_count(opts, 100); parse_options_set_max_parse_time(opts, 60); parse_options_set_islands_ok(opts, false); parse_options_set_short_length(opts, 12); parse_options_set_all_short_connectors(opts, true); parse_options_set_linkage_limit(opts, 100); parse_options_set_spell_guess(opts, 0); ``` Proposed fixes:...
In an unrelated search I encountered [page 358 of "Intelligent Information and Database Systems: 8th Asian Conference ..., Part 2"](https://books.google.co.il/books?id=DtCzCwAAQBAJ&pg=PA358&lpg=PA358&dq=%22link-grammar%22+graph&source=bl&ots=aJRAMVvpoX&sig=ACfU3U0DuHIY7CZPdi18Gkvi2HKcmhdJzQ&hl=en&sa=X&ved=2ahUKEwipi9fc8u_lAhWfRxUIHZOmAV04ChDoATADegQIBxAB#v=onepage&q=%22link-grammar%22%20graph&f=false). This conference was in 2016, but according to their benchmark...
After banging my head on this for a couple days, I finally figured out what was causing link-grammar to segfault. Funny enough I found out 'why' by stumbling on an...
The capitalized "Pound" appearing in `words/currency` prevents the `` regex from running, thus braking the parse of "Most admiring commentators have read the poem more nearly as Pound read it...