Method, System and Apparatus for Automatic Keyword Extraction

PDF Version Also Available for Download.


Patent relating to a method, system and apparatus for automatic keyword extraction.

Physical Description

34 p. : ill.

Creation Information

Csomai, Andras & Mihalcea, Rada, 1974- January 1, 2013.


This patent is part of the collection entitled: UNT Scholarly Works and was provided by the UNT College of Engineering to the UNT Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 470 times. More information about this patent can be viewed below.


People and organizations associated with either the creation of this patent or its content.



Provided By

UNT College of Engineering

The UNT College of Engineering strives to educate and train engineers and technologists who have the vision to recognize and solve the problems of society. The college comprises six degree-granting departments of instruction and research.

Contact Us


Descriptive information to help identify this patent. Follow the links below to find similar items on the Digital Library.

Degree Information


Patent relating to a method, system and apparatus for automatic keyword extraction.

Physical Description

34 p. : ill.


Abstract: The present invention provides a method and a system for automatic keyword extraction based on supervised or unsupervised machine learning techniques. Novel linguistically-motivated machine learning features are introduced, including discourse comprehension features based on construction integration theory, numeric features making use of syntactic part-of-speech patterns, and probabilistic features based on analysis of online encyclopedia annotations. The improved keyword extraction methods are combined with word sense disambiguation into a system for automatically generating annotations to enrich text with links to encyclopedic knowledge.

Prior Publication Data: US 2010/0145678 A1, June 10, 2010.

Related U.S. Application Data: Provisional application number 61/112,182, filed on November 6, 2008.



Item Type


Unique identifying numbers for this patent in the Digital Library or other systems.


This patent is part of the following collection of related materials.

UNT Scholarly Works

Materials from the UNT community's research, creative, and scholarly activities and UNT's Open Access Repository. Access to some items in this collection may be restricted.

What responsibilities do I have when using this patent?


Dates and time periods associated with this patent.

Submitted Date

  • November 6, 2009

Accepted Date

  • January 1, 2013

Creation Date

  • January 1, 2013

Added to The UNT Digital Library

  • July 7, 2014, 8:20 a.m.

Description Last Updated

  • Oct. 31, 2023, 10:28 a.m.

Usage Statistics

When was this patent last used?

Yesterday: 0
Past 30 days: 2
Total Uses: 470


Geographical information about where this patent originated or about its content.

Map Information

  • map marker Place Name coordinates. (May be approximate.)
  • Repositioning map may be required for optimal printing.

Mapped Locations

Interact With This Patent

Here are some suggestions for what to do next.

Start Viewing

PDF Version Also Available for Download.

International Image Interoperability Framework

IIF Logo

We support the IIIF Presentation API

Csomai, Andras & Mihalcea, Rada, 1974-. Method, System and Apparatus for Automatic Keyword Extraction, patent, January 1, 2013; [Washington, D.C.]. ( accessed March 4, 2024), University of North Texas Libraries, UNT Digital Library,; crediting UNT College of Engineering.

Back to Top of Screen