Latent Semantic Analysis: Five methodological recommendations


Article discussing latent semantic analysis and five methodological recommendations.

Creation Date: 2012
UNT College of Business
UNT Scholarly Works
Total Uses: 60
Past 30 days: 1
Yesterday: 0
Creator (Author):
Evangelopoulos, Nicholas

University of North Texas

Creator (Author):
Zhang, Xiaoni

Northern Kentucky University

Creator (Author):
Prybutok, Victor R.

University of North Texas

Publisher Info:
Publisher Name: Palgrave Macmillan (Firm)
Place of Publication: [New York, New York]
  • Creation: 2012

Article discussing latent semantic analysis and five methodological recommendations.


Abstract: The recent influx in generation, storage, and availability of textual data presents researchers with the challenge of developing suitable methods for their analysis. Latent Semantic Analysis (LSA), a member of a family of methodological approaches that offers an opportunity to address this gap by describing the semantic content in textual data as a set of vectors, was pioneered by researchers in psychology, information retrieval, and bibliometrics. LSA involves a matrix operation called singular value decomposition, an extension of principal component analysis. LSA generates latent semantic dimensions that are either interpreted, if the researcher's primary interest lies with the understanding of the thematic structure in the textual data, or used for purposes of clustering, categorization, and predictive modeling, if the interest lies with the conversion of raw text into numerical data, as a precursor to subsequent analysis. This paper reviews five methodological issues that need to be addressed by the researcher who will embark on LSA. We examine the dilemmas, present the choices, and discuss the considerations under which good methodological decisions are made. We illustrate these issues with the help of four small studies, involving the analysis of abstracts for papers published in the European Journal of Information Systems.


This is the author manuscript version of an article published in the European Journal of Information Systems. Copyright Palgrave Macmillan (Firm).

Physical Description:

45 p.

Keyword(s): text mining | analysis of textual data | singular value decomposition | clustering | factor analysis
Source: European Journal of Information Systems, 2012, New York: Palgrave Macmillan (Firm), pp. 70-86
UNT College of Business
UNT Scholarly Works
  • DOI: 10.1057/ejis.2010.61 |
  • ARK: ark:/67531/metadc288006
Resource Type: Article
Format: Text
Access: Public
Publication Title: European Journal of Information Systems
Volume: 21
Page Start: 70
Page End: 86
Peer Reviewed: Yes