Abstract
In this paper, we present a plagiarized source retrieval system called CopyCaptor using global word frequency and local feedback to generate an effective query for finding plagiarized source documents from the given suspicious document on PAN'13 source retrieval task. The system achieved 3rd place in competition with 0.33 F1 score, 0.50 precision and 0.33 recall on the test which find appropriate source documents of 58 suspicious documents from approx. 1 billion web pages.
Original language | English |
---|---|
Title of host publication | CEUR Workshop Proceedings |
Publisher | CEUR-WS |
Volume | 1179 |
Publication status | Published - 2013 |
Event | 2013 Working Notes for CLEF Conference, CLEF 2013 - Valencia, Spain Duration: 2013 Sept 23 → 2013 Sept 26 |
Other
Other | 2013 Working Notes for CLEF Conference, CLEF 2013 |
---|---|
Country/Territory | Spain |
City | Valencia |
Period | 13/9/23 → 13/9/26 |
ASJC Scopus subject areas
- General Computer Science