Intellexer Named Entity Recognizer

The named entity recognition task involves identification of proper names in texts and their classification into a set of predefined categories of interest. Most commercially available software packages detect proper names that refer to people, places and companies.

Intellexer Named Entity Recognizer successfully identifies not only personal names, names of organizations and geographical locations, but also extracts such entities as positions/occupations, nationalities, dates, ages, durations and names of events. The results of the Intellexer Named Entity Recognizer can be of great value to information end-user industries of all kinds, especially banks, finance companies, publishers and governments.

In order to provide the highest quality results, Intellexer Named Entity Recognizer combines different algorithms:

  • Statistical model, which is based on Hidden Markov Model and trained on a part-of-speech annotated corpus of business transaction articles, news articles and web pages (contains more than 500 thousands token-tag pairs);
  • Machine learning algorithms automatically generate named entity recognition patterns using a set of semantic dictionaries and a tagged corpus as training data;
  • Expert rules are used to improve statistical algorithms results. Intellexer Named Entity Recognizer contains more than 500 rules manually created by our linguists.

For the evaluation of the effectiveness of Intellexer Named Entity Recognizer we’ve created a dataset of news articles from different domains (science, sport, business, information technology, etc.). The experiments showed that our named entity recognition technique achieves 88-97% in accuracy and can be successfully applied in various document management and analytical systems:

  • Information Security solutions for tracing of suspect activity in corporate mail, social networks and other means of communication: special entities mentioning and occurrence of special entity relations;
  • Marketing and PR solutions for media events and entities analysis;
  • Social Media Solutions for monitoring of special entities behavior.

