AutoNER
AutoNER copied to clipboard
Learning Named Entity Tagger from Domain-Specific Dictionary
Hi, I'm just curious about the decoding process of Fuzzy CRF. The paper said "For inference, we apply the Viterbi algorithm to maximize the score." But the original Viterbi algorithm...
Hey Prof. Shang, The link to the `bio_embedding.txt` is broken. Do you have a new place to host that file? Or could you explain the format of the embedding file?...
Thanks for the work. When I dig deep into the code, I found that in the result of the function 'read_noisy_corpus', all the start token is marked as 'O' but...
Is is hard for us to download bio_embedding.txt because its memory is nearly 10GB. We hope to construct one by ourselves, so what is the baisc format of bio_embedding.txt. Can...
could anyone help me with this error?
I got this error after running the AutoNER without DEV_SET and TEST_SET: ``` Traceback (most recent call last): File "preprocess_partial_ner/encode_folder.py", line 281, in testa_dataset = encode_dataset(args.input_testa, w_map, c_map, cl_map, tl_map)...
when running the train script, it does not create a test.pk file, is there a solution to this?
Hi, the original paper says > In our AutoNER model, these “unknown” positions have undefined boundary and type losses, be- cause (1) they make the boundary labels unclear; and (2)...
Added language model optional parameters to the training script. If LMs are given, it will add contextualized representation as part of the input feature. It also needs to have model_word_ada...
Hello, the link to bio_embedding.txt does not open, where can I get this file?