Non-relevant document reduction in anti-plagiarism using asymmetric similarity and AVL tree index

Abstract
Anti-plagiarism applications have been developed using various approaches. Many methods compare one document to others, regardless of their relevance. This paper proposes a method to reduce non-relevant documents (those having no similar topic with query document) by using asymmetric similarity. Whole documents are collected in one corpus. Each document is preprocessed using winnowing algorithm. The feature from winnowing is then indexed using AVL Tree algorithm to fasten document comparing process. The result shows that reducing non-relevant document shortens almost 10 times of the processing time compared to non-reduced process. Meanwhile, both processes show the same accuracy of 89.78% to give suspected documents.

Link:
https://ieeexplore.ieee.org/abstract/document/6869547/

@inproceedings{oktoveri2014non,
  title={Non-relevant document reduction in anti-plagiarism using asymmetric similarity and AVL tree index},
  author={Oktoveri, Adeva and Wibowo, Agung Toto and Barmawi, Ari Moesriami},
  booktitle={Intelligent and Advanced Systems (ICIAS), 2014 5th International Conference on},
  pages={1--5},
  year={2014},
  organization={IEEE}
}

Leave a Reply

Your email address will not be published. Required fields are marked *