Zeyad Ahmed

Work place: Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt

E-mail: zeyadfahem1@gmail.com

Website:

Research Interests: Computer Vision, Natural Language Processing

Biography

Zeyad Ahmed is a Software Engineer. Zeyad earned his bachelor’s degree (2022) in Computer Science, majoring in Artificial Intelligence, from the Faculty of Computer and Information Sciences at Ain Shams University, Cairo, Egypt. His research interests include Natural Language Processing, Computer Vision, and AI Knowledge Retention Systems.

Author Articles
Introducing Arabic-SQuADv2.0 for Effective Arabic Machine Reading Comprehension

By Zeyad Ahmed Mariam Zeyada Youssef Amin Donia Gamal Hanan Hindy

DOI: https://doi.org/10.5815/ijeme.2023.05.03, Pub. Date: 8 Oct. 2023

Machine Reading Comprehension (MRC), known as the ability of computers to read and understand unstructured text and then answer questions, is still an open research field. MRC is considered one of the most research-demanding sub-tasks in Natural Language Processing (NLP) and Natural Language Understanding (NLU). MRC introduces multiple research challenges. One of these challenges is that the models should be trained to answer all questions and abstain from answering when the answer is not covered in the given context. Another challenge lies in dataset availability. These challenges are amplified for non-Latin-based languages; Arabic as an example. Currently, available Arabic MCR datasets are either small-sized high-quality collections or large-sized low-quality datasets. Additionally, they do not include unanswerable questions. This lack of resources depicts the model as incapable of real-world deployments. To tackle these challenges, this paper proposes a novel large-size high-quality Arabic MRC dataset that includes unanswerable questions, named “Arabic-SQuAD v2.0'”. The dataset consists of 96051 triplets {question, context, answer} in an attempt to help enrich the field of Arabic-MRC. Furthermore, a Machine Learning (ML)-based model is introduced that is capable of effectively solving Arabic MRC-with-unanswerable questions. The results of the proposed model are satisfactory and comparable with Latin-based language models. Furthermore, the results show a significant improvement of the current state-of-the-art Arabic MRC. To be exact, the model scores 71.49 F1-score and 65.12 Exact Match (EM). This proposed dataset and implementation pave the way to further Arabic MRC; aiming to reach a state when MRC models could mimic human text reasoning.

[...] Read more.
Other Articles