Willett, P. (2006) The Porter stemming algorithm: then and now. Program: Electronic Library and Information Systems, 40 (3). pp. 219-223. ISSN 0033-0337
Abstract
Purpose: In 1980, Porter presented a simple algorithm for stemming English language words. This paper summarises the main features of the algorithm, and highlights its role not just in modern information retrieval research, but also in a range of related subject domains.
Design: Review of literature and research involving use of the Porter algorithm.
Findings: The algorithm has been widely adopted and extended so that it has become the standard approach to word conflation for information retrieval in a wide range of languages.
Value: The 1980 paper in Program by Porter describing his algorithm has been highly cited. This paper provides a context for the original paper as well as an overview of its subsequent use.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | This is an author produced version of a paper published in Program. |
Keywords: | conflation, information retrieval, Porter stemming algorithm, stemming algorithm, suffix, word variant |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield) |
Depositing User: | Repository Assistant |
Date Deposited: | 02 Aug 2006 |
Last Modified: | 04 Jun 2014 12:58 |
Published Version: | http://dx.doi.org/10.1108/00330330610681295 |
Status: | Published |
Publisher: | Emerald |
Refereed: | Yes |
Identification Number: | 10.1108/00330330610681295 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:1434 |