Willett, P. (2006) The Porter stemming algorithm: then and now. Program: Electronic Library and Information Systems, 40 (3). pp. 219-223. ISSN 0033-0337
Full text available as:|
Text
willettp9_PorterStemmingReview.pdf Download (134Kb) |
Abstract
Purpose: In 1980, Porter presented a simple algorithm for stemming English language words. This paper summarises the main features of the algorithm, and highlights its role not just in modern information retrieval research, but also in a range of related subject domains.
Design: Review of literature and research involving use of the Porter algorithm.
Findings: The algorithm has been widely adopted and extended so that it has become the standard approach to word conflation for information retrieval in a wide range of languages.
Value: The 1980 paper in Program by Porter describing his algorithm has been highly cited. This paper provides a context for the original paper as well as an overview of its subsequent use.
| Item Type: | Article |
|---|---|
| Copyright, Publisher and Additional Information: | This is an author produced version of a paper published in Program. |
| Keywords: | conflation, information retrieval, Porter stemming algorithm, stemming algorithm, suffix, word variant |
| Academic Units: | The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield) |
| Depositing User: | Repository Assistant |
| Date Deposited: | 02 Aug 2006 |
| Last Modified: | 08 Feb 2013 16:49 |
| Published Version: | http://dx.doi.org/10.1108/00330330610681295 |
| Status: | Published |
| Publisher: | Emerald |
| Refereed: | Yes |
| Identification Number: | 10.1108/00330330610681295 |
| Related URLs: | |
| URI: | http://eprints.whiterose.ac.uk/id/eprint/1434 |
Actions (login required)
![]() |
View Item |





