Nawab, R.M.A., Stevenson, M. and Clough, P. (2011) External Plagiarism Detection using Information Retrieval and Sequence Alignment - Notebook for PAN at CLEF 2011. In: Petras, V., Forner, P. and Clough, P., (eds.) Proceedings of the 5th International Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse. CLEF 2011 Conference on Multilingual and Multimodal Information Access Evaluation, 19-22 Sep 2011, Amsterdam, Netherlands. .
Abstract
This paper describes the University of Sheffield entry for the 3rd International Competition on Plagiarism Detection which attempted the monolingual external plagiarism detection task. A three stage framework was used: preprocessing and indexing, candidate document selection (using an Information Retrieval based approach) and detailed analysis (using the Running Karp-Rabin Greedy String Tiling algorithm). The submitted system obtained an overall performance of 0.0804, precision of 0.2780, recall of 0.0885 and granularity of 2.18 in the formal evaluation.
Metadata
Authors/Creators: |
|
---|---|
Copyright, Publisher and Additional Information: | © 2011 CLEF. This is an author produced version of a paper subsequently published in Proceedings of the 5th International Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse. Uploaded in accordance with the publisher's self-archiving policy. |
Keywords: | External plagiarism detection; Information retrieval; Greedy string tiling. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 14 Apr 2014 14:36 |
Last Modified: | 19 Dec 2022 13:26 |
Published Version: | http://www.informatik.uni-trier.de/~ley/db/conf/cl... |
Status: | Published |
Refereed: | Yes |