Plagiarism Detection System for the Kurdish Language

Full Text (PDF, 840KB), PP.64-71

Views: 0 Downloads: 0

Author(s)

Karzan Wakil 1,* Muhammad Ghafoor 1 Mehyeddin Abdulrahman 1 Shvan Tariq 1

1. University of Human Development-Iraq

* Corresponding author.

DOI: https://doi.org/10.5815/ijitcs.2017.12.08

Received: 6 Sep. 2017 / Revised: 17 Sep. 2017 / Accepted: 22 Sep. 2017 / Published: 8 Dec. 2017

Index Terms

Plagiarism Detection, Plagiarism Detection System, N-Gram, Kurdish Language, Theft

Abstract

One of the serious issues is plagiarism, especially in the education field. Detecting the plagiarism became a challenging task, particularly in natural language texts. In the past years, some plagiarism detection tools have been developed for diverse natural languages, mainly English. Language-independent tools exist as well but are considered as too restrictive as they usually do not consider specific language features. The problem is there is no plagiarism Detection system for the Kurdish language.  In this paper, we introduce a new system for plagiarism detection for Kurdish Language, based on n-gram algorithm, our system can detect the word, phrases, and paragraphs. Moreover, our system effectiveness for detect plagiarist texts in localhost and online especially in Google search engine. This system is more useful for the academic organizations such as schools, institutes, and universities for finding copied texts from another document.

Cite This Paper

Karzan Wakil, Muhammad Ghafoor, Mehyeddin Abdulrahman, Shvan Tariq, "Plagiarism Detection System for the Kurdish Language", International Journal of Information Technology and Computer Science(IJITCS), Vol.9, No.12, pp.64-71, 2017. DOI:10.5815/ijitcs.2017.12.08

Reference

[1]plagiarism.com, "glatt plagiarism services," 2017.

[2]UKessays, "A Survey Of Plagiarism Detection Methods Information Technology Essay," 2015.

[3]Plagiarism.org, "What is Plagiarism?" 2015.

[4]A. Jadalla and A. Elnagar, "A plagiarism detection system for Arabic text-based documents," in Pacific-Asia Workshop on Intelligence and Security Informatics, 2012, pp. 145-153: Springer.

[5]C. Lyon, R. Barrett, and J. Malcolm, Plagiarism is easy, but also easy to detect. Ann Arbor, MI: Scholarly Publishing Office, University of Michigan Library, 2006.

[6]R. Lukashenko, V. Graudina, and J. Grundspenkis, "Computer-based plagiarism detection methods and tools: an overview," in Proceedings of the 2007 international conference on Computer systems and technologies, 2007, p. 40: ACM.

[7]R. Ibrahim, S. Saeed, and K. Wakil, "Plagiarism Detection Techniques for Arabic Script Languages: A Literature Review," Kurdistan Journal of Applied Research, vol. 2, no. 3, 2017.

[8]N. Meuschke and B. Gipp, "State-of-the-art in detecting academic plagiarism," International Journal for Educational Integrity, vol. 9, no. 1, 2013.

[9]A. Riad, F. Farahat, A. Asem, and M. Zaher, "Studying different methods for plagiarism detection," International Journal of Computer Science, vol. 2, no. 5, pp. 147-154, 2013.

[10]J. Ferrero, L. Besacier, D. Schwab, and F. Agnes, "Deep Investigation of Cross-Language Plagiarism Detection Methods," arXiv preprint arXiv:1705.08828, 2017.

[11]A. S. Hussein, "Arabic document similarity analysis using n-grams and singular value decomposition," in Research Challenges in Information Science (RCIS), 2015 IEEE 9th International Conference on, 2015, pp. 445-455: IEEE.

[12]W. Adouane and S. Dobnik, "Identification of Languages in Algerian Arabic Multilingual Documents," WANLP 2017 (co-located with EACL 2017), p. 1, 2017.

[13]A. A. Raza, A. Athar, and S. Nadeem, "N-Gram Based Authorship Attribution in Urdu Poetry," in Proceedings of the Conference on Language & Technology, 2009, pp. 88-93.

[14]M. E. B. Menai, "Detection of plagiarism in Arabic documents," International journal of information technology and computer science (IJITCS), vol. 4, no. 10, p. 80, 2012.

[15]M. Hussein, H. M. Mousa, and R. M. Sallam, "Arabic Text Categorization Using Mixed Words," 2016.

[16]S. M. Alzahrani, N. Salim, and A. Abraham, "Understanding plagiarism linguistic patterns, textual features, and detection methods," IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol. 42, no. 2, pp. 133-149, 2012.

[17]I. Bensalem, P. Rosso, and S. Chikhi, "Intrinsic Plagiarism Detection using N-gram Classes," in EMNLP, 2014, pp. 1459-1464.