Nawab, R.M.A., Stevenson, M. and Clough, P. (2011) External Plagiarism Detection using Information Retrieval and Sequence Alignment - Notebook for PAN at CLEF 2011. In: Petras, V., Forner, P. and Clough, P., (eds.) Proceedings of the 5th International Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse. CLEF 2011 Conference on Multilingual and Multimodal Information Access Evaluation, 19-22 Sep 2011, Amsterdam, Netherlands.
Abstract
This paper describes the University of Sheffield entry for the 3rd International Competition on Plagiarism Detection which attempted the monolingual external plagiarism detection task. A three stage framework was used: preprocessing and indexing, candidate document selection (using an Information Retrieval based approach) and detailed analysis (using the Running Karp-Rabin Greedy String Tiling algorithm). The submitted system obtained an overall performance of 0.0804, precision of 0.2780, recall of 0.0885 and granularity of 2.18 in the formal evaluation.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Editors: |
|
Copyright, Publisher and Additional Information: | © 2011 CLEF. This is an author produced version of a paper subsequently published in Proceedings of the 5th International Workshop on Uncovering Plagiarism, Authorship, and Social Software Misuse. Uploaded in accordance with the publisher's self-archiving policy. |
Keywords: | External plagiarism detection; Information retrieval; Greedy string tiling. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 14 Apr 2014 14:36 |
Last Modified: | 19 Dec 2022 13:26 |
Published Version: | http://www.informatik.uni-trier.de/~ley/db/conf/cl... |
Status: | Published |
Refereed: | Yes |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:78502 |