White Rose University Consortium logo
University of Leeds logo University of Sheffield logo York University logo

The Porter stemming algorithm: then and now

Willett, P. (2006) The Porter stemming algorithm: then and now. Program: Electronic Library and Information Systems, 40 (3). pp. 219-223. ISSN 0033-0337

Full text available as:
[img] Text
willettp9_PorterStemmingReview.pdf

Download (134Kb)

Abstract

Purpose: In 1980, Porter presented a simple algorithm for stemming English language words. This paper summarises the main features of the algorithm, and highlights its role not just in modern information retrieval research, but also in a range of related subject domains.

Design: Review of literature and research involving use of the Porter algorithm.

Findings: The algorithm has been widely adopted and extended so that it has become the standard approach to word conflation for information retrieval in a wide range of languages.

Value: The 1980 paper in Program by Porter describing his algorithm has been highly cited. This paper provides a context for the original paper as well as an overview of its subsequent use.

Item Type: Article
Copyright, Publisher and Additional Information: This is an author produced version of a paper published in Program.
Keywords: conflation, information retrieval, Porter stemming algorithm, stemming algorithm, suffix, word variant
Institution: The University of Sheffield
Academic Units: The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield)
Depositing User: Repository Assistant
Date Deposited: 02 Aug 2006
Last Modified: 04 Jun 2014 12:58
Published Version: http://dx.doi.org/10.1108/00330330610681295
Status: Published
Publisher: Emerald
Refereed: Yes
Identification Number: 10.1108/00330330610681295
Related URLs:
URI: http://eprints.whiterose.ac.uk/id/eprint/1434

Actions (repository staff only: login required)