Does pynlp keep the original tag type "O" which is the non-entity part? #13

hexingren · 2018-04-27T14:14:53Z

Hello,

Does pynlp keep the original tag type "O" which is the non-entity part?

For example,
sentence = "Nora Jani, a single person, Matt Jani and Susan Jani, husband and wife"

Expecting result:
[('Nora Jani', 'PERSON'), ('a single person', 'O'), ('Matt Jani', 'PERSON'), ('and', 'O'), ('Susan Jani', 'PERSON'), ('husband and wife', 'O')]

Thanks.

sina-al · 2018-04-27T14:48:00Z

Yes, try this:

from pynlp import StanfordCoreNLP

nlp = StanfordCoreNLP(annotators='tokenize, ssplit, pos, ner')

document = nlp("Nora Jani, a single person, Matt Jani and Susan Jani, husband and wife")

for sentence in document:
    for token in sentence:
        print(token, token.ner)

This will give you token level named entity recognition.

If you want entities that span multiple tokens, use entitymentions

nlp = StanfordCoreNLP(annotators='entitymentions')

for entity in document.entities:
    print(entity)

sina-al · 2018-04-27T14:55:37Z

I will try to write up some docs soon.

hexingren · 2018-04-27T15:55:57Z

For the first block of code, it will fall back to #12 if I add 'tokenize, ssplit, pos'. The working code for now is

from pynlp import StanfordCoreNLP

nlp = StanfordCoreNLP(annotators='ner', options = {"ner.useSUTime": False})
# The code below throws CoreNLPServerError: Status code: [500] 
# nlp = StanfordCoreNLP(annotators='tokenize, ssplit, pos, ner', options = {"ner.useSUTime": False})

document = nlp("Nora Jani, a single person, Matt Jani and Susan Jani, husband and wife")

for sentence in document:
    for token in sentence:
        print(token, token.ner)

Should be a problem on the CoreNLP server side. Thanks!

sina-al added the documentation label Apr 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does pynlp keep the original tag type "O" which is the non-entity part? #13

Does pynlp keep the original tag type "O" which is the non-entity part? #13

hexingren commented Apr 27, 2018

sina-al commented Apr 27, 2018 •

edited

Loading

sina-al commented Apr 27, 2018

hexingren commented Apr 27, 2018

Does pynlp keep the original tag type "O" which is the non-entity part? #13

Does pynlp keep the original tag type "O" which is the non-entity part? #13

Comments

hexingren commented Apr 27, 2018

sina-al commented Apr 27, 2018 • edited Loading

sina-al commented Apr 27, 2018

hexingren commented Apr 27, 2018

sina-al commented Apr 27, 2018 •

edited

Loading