Willett, P. (2006) The Porter stemming algorithm: then and now. Program: Electronic Library and Information Systems, 40 (3). pp. 219-223. ISSN 0033-0337Full text available as:
Purpose: In 1980, Porter presented a simple algorithm for stemming English language words. This paper summarises the main features of the algorithm, and highlights its role not just in modern information retrieval research, but also in a range of related subject domains.
Design: Review of literature and research involving use of the Porter algorithm.
Findings: The algorithm has been widely adopted and extended so that it has become the standard approach to word conflation for information retrieval in a wide range of languages.
Value: The 1980 paper in Program by Porter describing his algorithm has been highly cited. This paper provides a context for the original paper as well as an overview of its subsequent use.
|Copyright, Publisher and Additional Information:||This is an author produced version of a paper published in Program.|
|Keywords:||conflation, information retrieval, Porter stemming algorithm, stemming algorithm, suffix, word variant|
|Institution:||The University of Sheffield|
|Academic Units:||The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield)|
|Depositing User:||Repository Assistant|
|Date Deposited:||02 Aug 2006|
|Last Modified:||04 Jun 2014 12:58|