Web crawler utilization for resource search on Indonesian anti-plagiarism detection: Pemanfaatan web crawler untuk pencarian referensi pada deteksi anti-plagiarisme dokumen Bahasa Indonesia
A. Wibowo, A. Arifianto, Adeva Oktoveri, Arif M Barmawi
{"title":"Web crawler utilization for resource search on Indonesian anti-plagiarism detection: Pemanfaatan web crawler untuk pencarian referensi pada deteksi anti-plagiarisme dokumen Bahasa Indonesia","authors":"A. Wibowo, A. Arifianto, Adeva Oktoveri, Arif M Barmawi","doi":"10.1109/CYBERNETICSCOM.2013.6865793","DOIUrl":null,"url":null,"abstract":"Matching one document with other documents is one of anti-plagiarism tasks. Matching can be performed both intra and extra-corpal. This paper will discuss extra-corpal matching utilize the web crawlers as reference search. The role of web-crawler described in extra-corpal anti-plagiarism architecture. Matching of plagiarism indication will use Modified Histogram Intersection based on N-Gram of term. Similarity value utilizing modified normalized histogram intersection that devoted to matching extra corpal. Based on our experiment the best accuracy is given in 0.4 and 0.5 threshold value that give 94% accuracy.","PeriodicalId":351051,"journal":{"name":"2013 IEEE International Conference on Computational Intelligence and Cybernetics (CYBERNETICSCOM)","volume":"126 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE International Conference on Computational Intelligence and Cybernetics (CYBERNETICSCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CYBERNETICSCOM.2013.6865793","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Matching one document with other documents is one of anti-plagiarism tasks. Matching can be performed both intra and extra-corpal. This paper will discuss extra-corpal matching utilize the web crawlers as reference search. The role of web-crawler described in extra-corpal anti-plagiarism architecture. Matching of plagiarism indication will use Modified Histogram Intersection based on N-Gram of term. Similarity value utilizing modified normalized histogram intersection that devoted to matching extra corpal. Based on our experiment the best accuracy is given in 0.4 and 0.5 threshold value that give 94% accuracy.