| Description: | This paper introduces the use of Wikipedia as a resource for automatic keyword extraction and word sense disambiguation, and shows how this online encyclopedia can be used to achieve state-of-the-art results on both these tasks. The paper also shows how the two methods can be combined into a system able to automatically enrich a text with links to encyclopedic knowledge. Given an input document, the system identifies the important concepts in the text and automatically links these concepts to the corresponding Wikipedia pages. Evaluations of the system show that the automatic annotations are reliable and hardly distinguishable from manual annotations. |
|---|---|
| Creator(s): | |
| Creation Date: | November 2007 |
| Partner(s): |
UNT College of Engineering
|
| Collection(s): |
UNT Scholarly Works
|
| Usage: |
Total Uses: 660
Past 30 days: 15
Yesterday: 0
|
| Creator (Author): |
Mihalcea, Rada
University of North Texas |
|
|---|---|---|
| Creator (Author): |
Csomai, Andras
University of North Texas |
|
| Publisher Info: |
Publisher Name: Association for Computing Machinery (ACM)
Place of Publication: [New York, New York]
|
|
| Original Creation Date: | November 2007 | |
| Description: | This paper introduces the use of Wikipedia as a resource for automatic keyword extraction and word sense disambiguation, and shows how this online encyclopedia can be used to achieve state-of-the-art results on both these tasks. The paper also shows how the two methods can be combined into a system able to automatically enrich a text with links to encyclopedic knowledge. Given an input document, the system identifies the important concepts in the text and automatically links these concepts to the corresponding Wikipedia pages. Evaluations of the system show that the automatic annotations are reliable and hardly distinguishable from manual annotations. |
|
| Degree: |
Department:
Computer Science and Engineering
|
|
| Physical Description: |
9 p. |
|
| Language(s): | ||
| Subject(s): |
|
|
| Keyword(s): | keyword extraction | word sense disambiguation | Wikipedia | semantic annotation | |
| Source: | Association for Computing Machinery (ACM) Conference on Information and Knowledge Management (CIKM), 2007, Lisbon, Portugal | |
| Contributor(s): |
|
|
| Partner: |
UNT College of Engineering
|
|
| Collection: |
UNT Scholarly Works
|
|
| Identifier: |
|
|
| Resource Type: | Paper | |
| Format: | Text | |
| Rights: |
Access:
Public
|
|