Improving the Search on the Internet by Using WordNet and Lexical Operators Metadata

Metadata describes a digital item, providing (if known) such information as creator, publisher, contents, size, relationship to other resources, and more. Metadata may also contain "preservation" components that help us to maintain the integrity of digital files over time.

Title

  • Main Title Improving the Search on the Internet by Using WordNet and Lexical Operators

Creator

  • Author: Moldovan, Dan I.
    Creator Type: Personal
    Creator Info: Southern Methodist University
  • Author: Mihalcea, Rada, 1974-
    Creator Type: Personal
    Creator Info: University of North Texas; Southern Methodist University

Publisher

  • Name: Institute of Electrical and Electronics Engineers
    Place of Publication: [New York, New York]

Date

  • Creation: 1999-07-21

Language

  • English

Description

  • Content Description: This article discusses improving the search on the internet by using WordNet and lexical operators.
  • Physical Description: 18 p.

Subject

  • Keyword: information retrieval
  • Keyword: natural language processing
  • Keyword: search engines
  • Keyword: word sense disambiguation

Source

  • Journal: IEEE Internet Computing, 2000, New York: Institute of Electrical and Electronics Engineers

Citation

  • Publication Title: IEEE Internet Computing
  • Volume: 14
  • Issue: 1
  • Peer Reviewed: True

Collection

  • Name: UNT Scholarly Works
    Code: UNTSW

Institution

  • Name: UNT College of Engineering
    Code: UNTCOE

Rights

  • Rights Access: public

Resource Type

  • Article

Format

  • Text

Identifier

  • Archival Resource Key: ark:/67531/metadc83306

Degree

  • Academic Department: Computer Science and Engineering

Note

  • Display Note: Abstract: This paper presents a natural language interface system to an Internet search engine that provides the following improvements: (1) accepts natural language (English) questions, (2) expands the query, based on a word sense disambiguation method, and (3) uses a new lexical operator to post-process the documents retrieved for extracting only the part of a document that is relevant to a query. The system was tested on 100 queries of which 50 were adopted from the TIPSTER topics collection, provided at the 6th Text Retrieval Conference (TREC-6) and 50 were selected from among the queries submitted by users to an existing Web search engine. The results obtained demonstrate a substantial increase in both the precision and the percentage of queries answered correctly, while the amount of text presented to the user is reduced in comparison with the current Internet search engine technology.