- SpaCy
- AllenNLP
- Google Cloud Natural Language
- Azure Entity Service
- Flair
- Stanford CoreNLP
- Lightner
- DBPedia Spotlight
- OpenCalais
OpeNER seemed promising, as it breaks things down into components and is open source. Unfortunately their project doesn't look like it's been updated in a couple of years, and their entity recognizer doesn't work.
- Does it cost money?
- Is it open source?
- What kinds of entities does it identify?
- Does it require downloading model data? (how large is the data?)
- are there different prebuilt models?
- Does it have a CLI?
- Can it be integrated into another program?
- If so, what programming languages?
- What kind of output do you get?
- How fast is it?
- Does it provide cross referencing of entities?
- between matches in the output?
- to wikipedia, or another knowledge base?
- Qualitatively...
- Did it miss entities? A few? A lot?
- Did it misclassify entities?
- What do these tools do functionally?
- Find entities
- What do these tools do technically?
- How do these tools differ under the hood?
- How does entity extraction relate to other tools?
- Well NER is a sequence labeling task. There are other sequence labelers, some of which get used in the news, like many of the products which DataMade has built & deployed.
- There are other approaches to information extraction.