International Journal of Intelligent Systems and Applications (IJISA)

IJISA Vol. 16, No. 5, Oct. 2024

Cover page and Table of Contents: PDF (size: 237KB)

Table Of Contents

REGULAR PAPERS

Real-time Deep Learning Based Mobile Application for Detecting Edible Fungi: Mushapp

By Remzi Gurfidan Zekeriya AKCAY

DOI: https://doi.org/10.5815/ijisa.2024.05.01, Pub. Date: 8 Oct. 2024

Mushroom consumption and wild mushroom gathering are increasing in our country and in the world. Mushroom poisoning has an important place in food poisoning cases. Mushroom poisoning accounts for approximately 7% of poisoning cases in adults. Mushroom collection and consumption is common in many regions of our country. In this study, a deep learning based mobile application was developed to reduce the incidence of mushroom poisoning by taking a photo of a mushroom and determining the type and toxicity of the mushroom from the photo. This mobile application is called MushAPP. In the first phase of the study, 5150 mushroom images of 20 mushroom species were used to create the dataset. The dataset was then pre-processed and converted into a format that can be used by the deep learning algorithm. The mobile application side of the project was developed in Android Studio IDE environment. An artificial intelligence model was integrated into the designed mobile application. In the application, the type and toxicity status of the mushroom viewed from the mobile device camera are determined and presented to the user. The research findings were analyzed and it was determined that the accuracy rate of the application in detecting the mushroom species was 99.8%.

[...] Read more.
Data-driven Approximation of Cumulative Distribution Function Using Particle Swarm Optimization based Finite Mixtures of Logistic Distribution

By Rajasekharreddy Poreddy Gopi E. S.

DOI: https://doi.org/10.5815/ijisa.2024.05.02, Pub. Date: 8 Oct. 2024

This paper proposes a data-driven approximation of the Cumulative Distribution Function using the Finite Mixtures of the Cumulative Distribution Function of Logistic distribution. Since it is not possible to solve the logistic mixture model using the Maximum likelihood method, the mixture model is modeled to approximate the empirical cumulative distribution function using the computational intelligence algorithms. The Probability Density Function is obtained by differentiating the estimate of the Cumulative Distribution Function. The proposed technique estimates the Cumulative Distribution Function of different benchmark distributions. Also, the performance of the proposed technique is compared with the state-of-the-art kernel density estimator and the Gaussian Mixture Model. Experimental results on κ−μ distribution show that the proposed technique performs equally well in estimating the probability density function. In contrast, the proposed technique outperforms in estimating the cumulative distribution function. Also, it is evident from the experimental results that the proposed technique outperforms the state-of-the-art Gaussian Mixture model and kernel density estimation techniques with less training data.

[...] Read more.
Performance Evaluation of Deep Learning Architectures for Blood Pressure Estimation Using Photoplethysmography

By Mohammed Attya

DOI: https://doi.org/10.5815/ijisa.2024.05.03, Pub. Date: 8 Oct. 2024

High blood pressure (BP) monitoring Blood pressure (BP) is one of the common cardiovascular diseases and therefore the early high blood pressure (hypertension) detection, management, and prevention are mandatory. One promising method of continuous, non-invasive blood pressure estimation is photoplethysmography (PPG). In this study, a novel method was proposed to introduce the AlexNet framework into the time-frequency domain for classification of BP levels based on PPG signals. The study was conducted using the publicly available Figshare dataset which offers PPG signals, and the blood pressure labels against them. Data balancing techniques were used to alleviate class imbalances. Preprocessing and Feature Extraction of PPG Signals. The PPG signals were preprocessed with noise filtering and signals were then transformed from 1D-time to image to facilitate robust feature extraction. The proposed classification model, based on AlexNet showed the best result, with 98.89% accuracy, recall, and precision, and 99.44% specificity. This model outperformed alternative models (VGG16, DenseNet, ResNet50, GoogleNet) for classifying BP levels into the JNC 7 report standard categories normotension, prehypertension and hypertension. This study has two primary contributions. Initially, it demonstrates the efficacy of AlexNet model to extract meaningful features from PPG signals by its hierarchical convolutional and max-pooling layers thereby enabling accurate classification of BP levels. This study underscores the potential of deep learning and PPG signals for developing a highly accurate and truly non-invasive BP monitoring system. In the second aspect, the study offers a systematic assessment and comparison of the proposed over other well-known deep-learning networks, presenting the effectiveness of the AlexNet-based one. These results are of critical importance in the development of novel non-invasive BP monitoring modalities and optimization of cardiovascular health managements and personalized health cares.

[...] Read more.
Deep Learning for Robust Facial Expression Recognition: A Resilient Defense Against Adversarial Attacks

By Tinuk Agustin Moch. Hari Purwidiantoro Mochammad Luthfi Rahmadi

DOI: https://doi.org/10.5815/ijisa.2024.05.04, Pub. Date: 8 Oct. 2024

Adversarial attacks can be extremely dangerous, particularly in scenarios where the precision of facial expression identification is of utmost importance. Hiring adversarial training methods proves effective in mitigating these threats. Although effective, this technique requires large computing resources. This study aims to strengthen deep learning model resilience against adversarial attacks while optimizing performance and resource efficiency. Our proposed method uses adversarial training techniques to create adversarial examples, which are permanently stored as a separate dataset. This strategy helps the model learn and enhances its resilience to adversarial attacks. This study also evaluates models by subjecting them to adversarial attacks, such as the One Pixel Attack and the Fast Gradient Sign Method, to identify any potential vulnerabilities. Moreover, we use two different model architectures to see how well they are protected against adversarial attacks. It compared their performances to determine the best model for making systems more resistant while still maintaining good performance. The findings show that the combination of the proposed adversarial training technique and an efficient model architecture outcome in increased resistance to adversarial attacks. This also improves the reliability of the model and saves more resources for computation. This is evidenced by the high accuracy results achieved at 98.81% accuracy on the CK+ datasets. The adversarial training technique proposed in this study offers an efficient alternative to overcome the limitations of computational resources. This fortifies the model against adversarial attacks, resulting in significant increases in model resilience without loss of performance.

[...] Read more.
Reduction of Misclassifications in Wildfire Detection: A Weighted Ensemble Deep Learning Approach

By Lawrence Owusu Robert B Eshun Leila Hashemi-Beni Ali AlQahtani Masud R Rashel AKM K. Islam

DOI: https://doi.org/10.5815/ijisa.2024.05.05, Pub. Date: 8 Oct. 2024

Governments worldwide are increasingly prioritizing early wildfire detection to safeguard lives, property, and the environment. Although CNN-based models have demonstrated exceptional performance in various computer vision applications, the evolving nature of wildfire images poses significant challenges for a single CNN-based model in wildfire detection. In this study, we addressed this issue by integrating and weighting the differential learning capabilities of three individual transfer learning models: InceptionV3, ResNet50, and VGG16. Experimental results show that the ensemble deep learning models significantly outperformed all single classifiers across all performance metrics. Both the ensemble and weighted ensemble deep learning models achieved 99.7% accuracy, 99.5% precision, 100% recall, 99.8% F1-score, 0.5%false positive rate, 0.0% false negative rate and 0.3% error rate. Additionally, these models reduced the error rate by 98%, 91%, and 40% compared to the error rates of ResNet50, InceptionV3, and VGG16 respectively. A false negative rate of 0% indicates that our proposed ensemble deep learning models identified and predicted all the wildfire instances present in the test set correctly without a single misclassification. This positions our proposed ensemble deep learning models as superior choices for reducing misclassifications in wildfire detection.

[...] Read more.
Identification of Customer Through Voice Biometric System in Call Centres

By Amjad Hassan Khan M. K. P. S. Aithal

DOI: https://doi.org/10.5815/ijisa.2024.05.06, Pub. Date: 8 Oct. 2024

In recent times, there has been a growing emphasis on adjusting communication strategies to foster strong customer relationships. This shift is driven by intensified competition, market maturation, and swift advancements in business technology. Consequently, companies have established call centers to efficiently handle customer support and fulfil customer inquiries. A pivotal aspect of enhancing service quality within these call centers involves accurately identifying customers during their interactions. The primary objective of this study is to introduce a methodology for identifying customers within call centers by analysing their voice characteristics. Voice authentication (VA) has gained prominence in critical security operations, including banking transactions and conversations within call centers. The susceptibility of automatic speaker verification systems (ASVs) to deceptive spoofing attacks has prompted the development of countermeasures (CMs). These countermeasures are designed to differentiate between authentic and fabricated speech. ASVs and CMs collectively constitute contemporary VA systems, positioned as robust access control mechanisms. To achieve this goal, various customer identification systems within call centers have been examined, along with an analysis of audio signal attributes. Ultimately, the manuscript presents a novel approach to customer identification through voice biometrics. Notably, this method excels in recognizing customers even when provided with limited voice data. Empirical findings demonstrate that the suggested speaker identity confirmation method outperforms alternative techniques utilizing different algorithms, exhibiting a higher recognition rate. The present research work is based on two important perspectives of the call centres: a. call center agents experience and b. customer experience. The data collected separately from customers and agents for understanding the effective usage of voice biometric system in call centres. The data represented and satisfies the effectiveness of voice biometric system from both the perspectives. From the data it is also cleared that, the implementation of voice biometric system in call centres still have long way to go but will be a major technological change for the industries worldwide.

[...] Read more.