Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Computer Engineering

Classification

Institution
Publication Year
Publication
Publication Type

Articles 1 - 30 of 146

Full-Text Articles in Engineering

Machine Learning Approaches In Comparative Studies For Alzheimer’S Diagnosis Using 2d Mri Slices, Zhen Zhao, Joon Huang Chuah, Chee-Onn Chow, Kaijian Xia, Yee Kai Tee, Yan Chai Hum, Khin Wee Lai Feb 2024

Machine Learning Approaches In Comparative Studies For Alzheimer’S Diagnosis Using 2d Mri Slices, Zhen Zhao, Joon Huang Chuah, Chee-Onn Chow, Kaijian Xia, Yee Kai Tee, Yan Chai Hum, Khin Wee Lai

Turkish Journal of Electrical Engineering and Computer Sciences

Alzheimer’s disease (AD) is an illness that involves a gradual and irreversible degeneration of the brain. It is crucial to establish a precise diagnosis of AD early on in order to enable prompt therapies and prevent further deterioration. Researchers are currently focusing increasing attention on investigating the potential of machine learning techniques to simplify the automated diagnosis of AD using neuroimaging. The present study involved a comparison of models for the detection of AD through the utilization of 2D image slices obtained from magnetic resonance imaging brain scans. Five models, namely ResNet, ConvNeXt, CaiT, Swin Transformer, and CVT, were implemented …


Deep Feature Extraction, Dimensionality Reduction, And Classification Of Medical Images Using Combined Deep Learning Architectures, Autoencoder, And Multiple Machine Learning Models, Ahmet Hi̇dayet Ki̇raz, Fatime Oumar Djibrillah, Mehmet Emi̇n Yüksel Oct 2023

Deep Feature Extraction, Dimensionality Reduction, And Classification Of Medical Images Using Combined Deep Learning Architectures, Autoencoder, And Multiple Machine Learning Models, Ahmet Hi̇dayet Ki̇raz, Fatime Oumar Djibrillah, Mehmet Emi̇n Yüksel

Turkish Journal of Electrical Engineering and Computer Sciences

Accurate analysis and classification of medical images are essential factors in clinical decision-making and patient care. A novel comparative approach for medical image classification is proposed in this study. This new approach involves several steps: deep feature extraction, which extracts the informative features from medical images; concatenation, which concatenates the extracted deep features to form a robust feature vector; dimensionality reduction with autoencoder, which reduces the dimensionality of the feature vector by transforming it into a different feature space with a lower dimension; and finally, these features obtained from all these steps were fed into multiple machine learning classifiers (SVM, …


Cognitive Digital Modelling For Hyperspectral Image Classification Using Transfer Learning Model, Mohammad Shabaz, Mukesh Soni Oct 2023

Cognitive Digital Modelling For Hyperspectral Image Classification Using Transfer Learning Model, Mohammad Shabaz, Mukesh Soni

Turkish Journal of Electrical Engineering and Computer Sciences

Deep convolutional neural networks can fully use the intrinsic relationship between features and improve the separability of hyperspectral images, which has received extensive in recent years. However, the need for a large number of labelled samples to train deep network models limits the application of such methods. The idea of transfer learning is introduced into remote sensing image classification to reduce the need for the number of labelled samples. In particular, the situation in which each class in the target picture only has one labelled sample is investigated. In the target domain, the number of training samples is enlarged by …


Classification Of Chronic Pain Using Fmri Data: Unveiling Brain Activity Patterns For Diagnosis, Rejula V, Anitha J, Belfin Robinson Oct 2023

Classification Of Chronic Pain Using Fmri Data: Unveiling Brain Activity Patterns For Diagnosis, Rejula V, Anitha J, Belfin Robinson

Turkish Journal of Electrical Engineering and Computer Sciences

Millions of people throughout the world suffer from the complicated and crippling condition of chronic pain. It can be brought on by several underlying disorders or injuries and is defined by chronic pain that lasts for a period exceeding three months. To better understand the brain processes behind pain and create prediction models for pain-related outcomes, machine learning is a potent technology that may be applied in Functional magnetic resonance imaging (fMRI) chronic pain research. Data (fMRI and T1-weighted images) from 76 participants has been included (30 chronic pain and 46 healthy controls). The raw data were preprocessed using fMRIprep …


Stepwise Dynamic Nearest Neighbor (Sdnn): A New Algorithm For Classification, Deni̇z Karabaş, Derya Bi̇rant, Peli̇n Yildirim Taşer Sep 2023

Stepwise Dynamic Nearest Neighbor (Sdnn): A New Algorithm For Classification, Deni̇z Karabaş, Derya Bi̇rant, Peli̇n Yildirim Taşer

Turkish Journal of Electrical Engineering and Computer Sciences

Although the standard k-nearest neighbor (KNN) algorithm has been used widely for classification in many different fields, it suffers from various limitations that abate its classification ability, such as being influenced by the distribution of instances, ignoring distances between the test instance and its neighbors during classification, and building a single/weak learner. This paper proposes a novel algorithm, called stepwise dynamic nearest neighbor (SDNN), which can effectively handle these problems. Instead of using a fixed parameter k like KNN, it uses a dynamic neighborhood strategy according to the data distribution and implements a new voting mechanism, called stepwise voting. Experimental …


A Machine Learning Approach For Dyslexia Detection Using Turkish Audio Records, Tuğberk Taş, Muhammed Abdullah Bülbül, Abas Haşi̇moğlu, Yavuz Meral, Yasi̇n Çalişkan, Gunay Budagova, Mücahi̇d Kutlu Sep 2023

A Machine Learning Approach For Dyslexia Detection Using Turkish Audio Records, Tuğberk Taş, Muhammed Abdullah Bülbül, Abas Haşi̇moğlu, Yavuz Meral, Yasi̇n Çalişkan, Gunay Budagova, Mücahi̇d Kutlu

Turkish Journal of Electrical Engineering and Computer Sciences

Dyslexia is a learning disorder, characterized by impairment in the ability to read, spell, and decode letters. It is vital to detect dyslexia in earlier stages to reduce its effects. However, diagnosing dyslexia is a time-consuming and costly process. In this paper, we propose a machine-learning model that predicts whether a Turkish-speaking child has dyslexia using his/her audio records. Therefore, our model can be easily used by smart phones and work as a warning system such that children who are likely to be dyslexic according to our model can seek an examination by experts. In order to train and evaluate, …


Compatibility Of Clique Clustering Algorithm With Dimensionality Reduction, Ug ̆Ur Madran, Duygu Soyog ̆Lu Sep 2023

Compatibility Of Clique Clustering Algorithm With Dimensionality Reduction, Ug ̆Ur Madran, Duygu Soyog ̆Lu

Applied Mathematics & Information Sciences

In our previous work, we introduced a clustering algorithm based on clique formation. Cliques, the obtained clusters, are constructed by choosing the most dense complete subgraphs by using similarity values between instances. The clique algorithm successfully reduces the number of instances in a data set without substantially changing the accuracy rate. In this current work, we focused on reducing the number of features. For this purpose, the effect of the clique clustering algorithm on dimensionality reduction has been analyzed. We propose a novel algorithm for support vector machine classification by combining these two techniques and applying different strategies by differentiating …


Risk Assessment Approaches In Banking Sector –A Survey, Mona Sharaf, Shimaa Mohamed Ouf, Amira M. Idrees Ami Jul 2023

Risk Assessment Approaches In Banking Sector –A Survey, Mona Sharaf, Shimaa Mohamed Ouf, Amira M. Idrees Ami

Future Computing and Informatics Journal

Prediction analysis is a method that makes predictions based on the data currently available. Bank loans come with a lot of risks to both the bank and the borrowers. One of the most exciting and important areas of research is data mining, which aims to extract information from vast amounts of accumulated data sets. The loan process is one of the key processes for the banking industry, and this paper examines various prior studies that used data mining techniques to extract all served entities and attributes necessary for analytical purposes, categorize these attributes, and forecast the future of their business …


A Practical Framework For Early Detection Of Diabetes Using Ensemble Machine Learning Models, Qusay Saihood, Emrullah Sonuç Jul 2023

A Practical Framework For Early Detection Of Diabetes Using Ensemble Machine Learning Models, Qusay Saihood, Emrullah Sonuç

Turkish Journal of Electrical Engineering and Computer Sciences

The diagnosis of diabetes, a prevalent global health condition, is crucial for preventing severe complications. In recent years, there has been a growing effort to develop intelligent diagnostic systems for diabetes utilizing machine learning (ML) algorithms. Despite these efforts, achieving high accuracy rates using such systems remains a significant challenge. Recent advancements in ensemble ML methods offer promising opportunities for early detection of diabetes, as they are known to be faster and more cost-effective than traditional approaches. Therefore, this study proposes a practical framework for diagnosing diabetes that involves three stages. The data preprocessing stage encompasses several crucial tasks, including …


Assessment Of E-Senses Performance Through Machine Learning Models For Colombian Herbal Teas Classification, Jeniffer Katerine Carrillo, Cristhian Manuel Durán, Juan Martin Cáceres, Carlos Alberto Cuastumal, Jordana Ferreira, José Ramos, Brian Bahder, Martin Oates, Antonio Ruiz Jun 2023

Assessment Of E-Senses Performance Through Machine Learning Models For Colombian Herbal Teas Classification, Jeniffer Katerine Carrillo, Cristhian Manuel Durán, Juan Martin Cáceres, Carlos Alberto Cuastumal, Jordana Ferreira, José Ramos, Brian Bahder, Martin Oates, Antonio Ruiz

CCE Faculty Articles

This paper describes different E-Senses systems, such as Electronic Nose, Electronic Tongue, and Electronic Eyes, which were used to build several machine learning models and assess their performance in classifying a variety of Colombian herbal tea brands such as Albahaca, Frutos Verdes, Jaibel, Toronjil, and Toute. To do this, a set of Colombian herbal tea samples were previously acquired from the instruments and processed through multivariate data analysis techniques (principal component analysis and linear discriminant analysis) to feed the support vector machine, K-nearest neighbors, decision trees, naive Bayes, and random forests algorithms. The results of the E-Senses were validated using …


User Classification Based On Mouse Dynamic Authentication Using K-Nearest Neighbor, Didih Rizki Chandranegara, Anzilludin Ashari, Zamah Sari, Hardianto Wibowo, Wildan Suharso Apr 2023

User Classification Based On Mouse Dynamic Authentication Using K-Nearest Neighbor, Didih Rizki Chandranegara, Anzilludin Ashari, Zamah Sari, Hardianto Wibowo, Wildan Suharso

Makara Journal of Technology

Mouse dynamics authentication is a method for identifying a person by analyzing the unique pattern or rhythm of their mouse movement. Owing to its distinctive properties, such mouse movements can be used as the basis for security. The development of technology is followed by the urge to keep private data safe from hackers. Therefore, increasing the accuracy of user classification and reducing the false acceptance rate (FAR) are necessary to improve data security. In this study, we propose to combine the K-nearest neighbor method and simple random sampling and obtain a sample from a dataset to improve the classification of …


Domain Specific Analysis Of Privacy Practices And Concerns In The Mobile Application Market, Fahimeh Ebrahimi Meymand Apr 2023

Domain Specific Analysis Of Privacy Practices And Concerns In The Mobile Application Market, Fahimeh Ebrahimi Meymand

LSU Doctoral Dissertations

Mobile applications (apps) constantly demand access to sensitive user information in exchange for more personalized services. These-mostly unjustified-data collection tactics have raised major privacy concerns among mobile app users. Existing research on mobile app privacy aims to identify these concerns, expose apps with malicious data collection practices, assess the quality of apps' privacy policies, and propose automated solutions for privacy leak detection and prevention. However, existing solutions are generic, frequently missing the contextual characteristics of different application domains. To address these limitations, in this dissertation, we study privacy in the app store at a domain level. Our objective is to …


A Novel Insect And Pest Identification Model Based On A Weighted Multipath Convolutional Neural Network And Generative Adversarial Network, Vinita Abhishek Gupta, M.V. Padmavati, Ravi R. Saxena, Raunak Kumar Tamrakar Jan 2023

A Novel Insect And Pest Identification Model Based On A Weighted Multipath Convolutional Neural Network And Generative Adversarial Network, Vinita Abhishek Gupta, M.V. Padmavati, Ravi R. Saxena, Raunak Kumar Tamrakar

Karbala International Journal of Modern Science

Timely identification of insects and their management play a significant role in sustainable agriculture development. The proposed hybrid model integrates a weighted multipath convolutional neural network and generative adversarial network to identify insects efficiently. To address the shortcomings of single-path networks, this novel model takes input from numerous iterations of the same image to learn more specific features. To avoid redundancy produced due to multipath, weights have been assigned to each path. For Xie2 dataset, the model shows 3.75%, 2.74%, 1.54%, 1.76%, 1.76%, 2.74 %, and 2.14% performance improvement from AlexNet, ResNet50, ResNet101, GoogleNet, VGG-16, VGG-19, and simple CNN respectively. …


An Evaluation Of The Eeg Alpha-To-Theta And Theta-To-Alpha Band Ratios As Indexes Of Mental Workload, Bujar Raufi, Luca Longo Jan 2023

An Evaluation Of The Eeg Alpha-To-Theta And Theta-To-Alpha Band Ratios As Indexes Of Mental Workload, Bujar Raufi, Luca Longo

Articles

Many research works indicate that EEG bands, specifically the alpha and theta bands, have been potentially helpful cognitive load indicators. However, minimal research exists to validate this claim. This study aims to assess and analyze the impact of the alpha-to-theta and the theta-to-alpha band ratios on supporting the creation of models capable of discriminating self-reported perceptions of mental workload. A dataset of raw EEG data was utilized in which 48 subjects performed a resting activity and an induced task demanding exercise in the form of a multitasking SIMKAP test. Band ratios were devised from frontal and parietal electrode clusters. Building …


Deep Learning-Based Classification Of Chaotic Systems Over Phase Portraits, Sezgi̇n Kaçar, Süleyman Uzun, Burak Aricioğlu Jan 2023

Deep Learning-Based Classification Of Chaotic Systems Over Phase Portraits, Sezgi̇n Kaçar, Süleyman Uzun, Burak Aricioğlu

Turkish Journal of Electrical Engineering and Computer Sciences

This study performed a deep learning-based classification of chaotic systems over their phase portraits. To the best of the authors' knowledge, such classification studies over phase portraits have not been conducted in the literature. To that end, a dataset consisting of the phase portraits of the most known two chaotic systems, namely Lorenz and Chen, is generated for different values of the parameters, initial conditions, step size, and time length. Then, a classification with high accuracy is carried out employing transfer learning methods. The transfer learning methods used in the study are SqueezeNet, VGG-19, AlexNet, ResNet50, ResNet101, DenseNet201, ShuffleNet, and …


Early Diagnosis Of Pancreatic Cancer By Machine Learning Methods Using Urine Biomarker Combinations, İrem Acer, Firat Orhan Bulucu, Semra İçer, Fatma Lati̇foğlu Jan 2023

Early Diagnosis Of Pancreatic Cancer By Machine Learning Methods Using Urine Biomarker Combinations, İrem Acer, Firat Orhan Bulucu, Semra İçer, Fatma Lati̇foğlu

Turkish Journal of Electrical Engineering and Computer Sciences

The most common type of pancreatic cancer is pancreatic ductal adenocarcinoma (PDAC), which accounts for the vast majority of pancreatic cancers. The five-year survival rate for PDAC due to late diagnosis is 9%. Early diagnosed PDAC patients survive longer than patients diagnosed at a more advanced stage. Biomarkers can play an essential role in the early detection of PDAC to assist the health professional. Machine learning and deep learning methods are used with biomarkers obtained in recent studies for diagnostic purposes. In order to increase the survival rates of PDAC patients, early diagnosis of the disease with a noninvasive test …


Schizo-Net: A Novel Schizophrenia Diagnosis Framework Using Late Fusion Multimodal Deep Learning On Electroencephalogram-Based Brain Connectivity Indices, Nitin Grover, Aviral Chharia, Rahul Upadhyay, Luca Longo Jan 2023

Schizo-Net: A Novel Schizophrenia Diagnosis Framework Using Late Fusion Multimodal Deep Learning On Electroencephalogram-Based Brain Connectivity Indices, Nitin Grover, Aviral Chharia, Rahul Upadhyay, Luca Longo

Articles

Schizophrenia (SCZ) is a serious mental condition that causes hallucinations, delusions, and disordered thinking. Traditionally, SCZ diagnosis involves the subject’s interview by a skilled psychiatrist. The process needs time and is bound to human errors and bias. Recently, brain connectivity indices have been used in a few pattern recognition methods to discriminate neuro-psychiatric patients from healthy subjects. The study presents Schizo-Net , a novel, highly accurate, and reliable SCZ diagnosis model based on a late multimodal fusion of estimated brain connectivity indices from EEG activity. First, the raw EEG activity is pre-processed exhaustively to remove unwanted artifacts. Next, six brain …


Tutorial: Neuro-Symbolic Ai For Mental Healthcare, Kaushik Roy, Usha Lokala, Manas Gaur, Amit Sheth Oct 2022

Tutorial: Neuro-Symbolic Ai For Mental Healthcare, Kaushik Roy, Usha Lokala, Manas Gaur, Amit Sheth

Publications

Artificial Intelligence (AI) systems for mental healthcare (MHCare) have been ever-growing after realizing the importance of early interventions for patients with chronic mental health (MH) conditions. Social media (SocMedia) emerged as the go-to platform for supporting patients seeking MHCare. The creation of peer-support groups without social stigma has resulted in patients transitioning from clinical settings to SocMedia supported interactions for quick help. Researchers started exploring SocMedia content in search of cues that showcase correlation or causation between different MH conditions to design better interventional strategies. User-level Classification-based AI systems were designed to leverage diverse SocMedia data from various MH conditions, …


Analysis Of Patch And Sample Size Effects For 2d-3d Cnn Models Using Multiplatform Dataset: Hyperspectral Image Classification Of Rosis And Jilin-1 Gp01 Imagery, Taşkin Kavzoğlu, Eli̇f Özlem Yilmaz Sep 2022

Analysis Of Patch And Sample Size Effects For 2d-3d Cnn Models Using Multiplatform Dataset: Hyperspectral Image Classification Of Rosis And Jilin-1 Gp01 Imagery, Taşkin Kavzoğlu, Eli̇f Özlem Yilmaz

Turkish Journal of Electrical Engineering and Computer Sciences

Modern hyperspectral sensors provide a huge volume of data at spectral and spatial domains with high redundancy, which requires robust methods for analysis. In this study, 2D and 3D CNN models were applied to hyperspectral image datasets (ROSIS and Jilin-1 GP01) using varying patch and sample sizes to determine their combined impacts on the performance of deep learning models. Differences in classification performances in relation to particle and sample sizes were statistically analysed using McNemar?s test. According to the findings, raising the patch and sample size enhances the performance of the 2D/3D CNN model and produces more accurate results in …


Ml-Based Online Traffic Classification For Sdns, Mohammed Nsaif, Gergely Kovasznai, Mohammed Abboosh, Ali Malik, Ruairí De Fréin May 2022

Ml-Based Online Traffic Classification For Sdns, Mohammed Nsaif, Gergely Kovasznai, Mohammed Abboosh, Ali Malik, Ruairí De Fréin

Articles

Traffic classification is a crucial aspect for Software-Defined Networking functionalities. This paper is a part of an on-going project aiming at optimizing power consumption in the environment of software-defined datacenter networks. We have developed a novel routing strategy that can blindly balance between the power consumption and the quality of service for the incoming traffic flows. In this paper, we demonstrate how to classify the network traffic flows so that the quality of service of each flow-class can be guaranteed efficiently. This is achieved by creating a dataset that encompasses different types of network traffic such as video, VoIP, game …


Classification And Phenological Staging Of Crops From In Situ Image Sequences By Deep Learning, Uluğ Bayazit, Deni̇z Turgay Altilar, Ni̇lgün Güler Bayazit May 2022

Classification And Phenological Staging Of Crops From In Situ Image Sequences By Deep Learning, Uluğ Bayazit, Deni̇z Turgay Altilar, Ni̇lgün Güler Bayazit

Turkish Journal of Electrical Engineering and Computer Sciences

Accurate knowledge of crop type information is not only valuable for verifying the declaration of farmers to obtain subsidy or insurance for the grown crop, but also for generating crop type maps that serve a variety of purposes in land monitoring and policy. On the other hand, accurate knowledge of crop phenological stage can help farm personnel apply fertilization and irrigation regimes on a timely basis. Although deep learning based networks have been applied in the past to classify the type and predict the phenological stage of crops from in situ images of fields, more advanced deep learning based networks, …


A Machine Learning Framework For Identifying Molecular Biomarkers From Transcriptomic Cancer Data, Md Abdullah Al Mamun Mar 2022

A Machine Learning Framework For Identifying Molecular Biomarkers From Transcriptomic Cancer Data, Md Abdullah Al Mamun

FIU Electronic Theses and Dissertations

Cancer is a complex molecular process due to abnormal changes in the genome, such as mutation and copy number variation, and epigenetic aberrations such as dysregulations of long non-coding RNA (lncRNA). These abnormal changes are reflected in transcriptome by turning oncogenes on and tumor suppressor genes off, which are considered cancer biomarkers.

However, transcriptomic data is high dimensional, and finding the best subset of genes (features) related to causing cancer is computationally challenging and expensive. Thus, developing a feature selection framework to discover molecular biomarkers for cancer is critical.

Traditional approaches for biomarker discovery calculate the fold change for each …


The Analysis And Optimization Of Cnn Hyperparameters With Fuzzy Tree Modelfor Image Classification, Kübra Uyar, Şaki̇r Taşdemi̇r, İlker Ali̇ Özkan Mar 2022

The Analysis And Optimization Of Cnn Hyperparameters With Fuzzy Tree Modelfor Image Classification, Kübra Uyar, Şaki̇r Taşdemi̇r, İlker Ali̇ Özkan

Turkish Journal of Electrical Engineering and Computer Sciences

The meaningful performance of convolutional neural network (CNN) has enabled the solution of various state-of-the-art problems. Although CNNs achieve satisfactory results in computer-vision problems, they still have some difficulties. As the designed CNN models are deepened to achieve much better accuracy, computational cost and complexity increase. It is significant to train CNNs with suitable topology and training hyperparameters that include initial learning rate, minibatch size, epoch number, filter size, number of filters, etc. because the initialization of hyperparameters affects classification results. On the other hand, it is not possible to make a definite inference for the hyperparameter initialization and there …


Exploring The Concept Of The Digital Educator During Covid-19, Fernando Jimenez, Gracia Sanchez, Jose Palma, Luis Miralles-Pechuán, Juan A. Botia Jan 2022

Exploring The Concept Of The Digital Educator During Covid-19, Fernando Jimenez, Gracia Sanchez, Jose Palma, Luis Miralles-Pechuán, Juan A. Botia

Articles

T In many machine learning classification problems, datasets are usually of high dimensionality and therefore require efficient and effective methods for identifying the relative importance of their attributes, eliminating the redundant and irrelevant ones. Due to the huge size of the search space of the possible solutions, the attribute subset evaluation feature selection methods are not very suitable, so in these scenarios feature ranking methods are used. Most of the feature ranking methods described in the literature are univariate methods, which do not detect interactions between factors. In this paper, we propose two new multivariate feature ranking methods based on …


Optimized Cancer Detection On Various Magnified Histopathological Colon Imagesbased On Dwt Features And Fcm Clustering, Tina Babu, Tripty Singh, Deepa Gupta, Shahin Hameed Jan 2022

Optimized Cancer Detection On Various Magnified Histopathological Colon Imagesbased On Dwt Features And Fcm Clustering, Tina Babu, Tripty Singh, Deepa Gupta, Shahin Hameed

Turkish Journal of Electrical Engineering and Computer Sciences

Due to the morphological characteristics and other biological aspects in histopathological images, the computerized diagnosis of colon cancer in histopathology images has gained popularity. The images acquired using the histopathology microscope may differ for greater visibility by magnifications. This causes a change in morphological traits leading to intra and inter-observer variability. An automatic colon cancer diagnosis system for various magnification is therefore crucial. This work proposes a magnification independent segmentation approach based on the connected component area and double density dual tree DWT (discrete wavelet transform) coefficients are derived from the segmented region. The derived features are reduced further shortened …


Computer Vision Based Classification Of Fruits And Vegetables For Self-Checkout At Supermarkets, Khurram Hameed Jan 2022

Computer Vision Based Classification Of Fruits And Vegetables For Self-Checkout At Supermarkets, Khurram Hameed

Theses: Doctorates and Masters

The field of machine learning, and, in particular, methods to improve the capability of machines to perform a wider variety of generalised tasks are among the most rapidly growing research areas in today’s world. The current applications of machine learning and artificial intelligence can be divided into many significant fields namely computer vision, data sciences, real time analytics and Natural Language Processing (NLP). All these applications are being used to help computer based systems to operate more usefully in everyday contexts. Computer vision research is currently active in a wide range of areas such as the development of autonomous vehicles, …


A Statistical-Mining Techniques’ Collaboration For Minimizing Dimensionality In Ovarian Cancer Data, Mohamed Attia, Maha Farghaly, Mohamed Hamada, Amira M. Idrees Ami Nov 2021

A Statistical-Mining Techniques’ Collaboration For Minimizing Dimensionality In Ovarian Cancer Data, Mohamed Attia, Maha Farghaly, Mohamed Hamada, Amira M. Idrees Ami

Future Computing and Informatics Journal

A feature is a single measurable criterion to an observation of a process. While knowledge discovery techniques successfully contribute in many fields, however, the extensive required data processing could hinder the performance of these techniques. One of the main issues in processing data is the dimensionality of the data. Therefore, focusing on reducing the data dimensionality through eliminating the insignificant attributes could be considered one of the successful steps for raising the applied techniques’ performance. On the other hand, focusing on the applied field, ovarian cancer patients continuously suffer from the extensive analysis requirements for detecting the disease as well …


Cybert: Cybersecurity Claim Classification By Fine-Tuning The Bert Language Model, Kimia Ameri, Michael Hempel, Hamid Sharif, Juan Lopez Jr., Kalyan Perumalla Nov 2021

Cybert: Cybersecurity Claim Classification By Fine-Tuning The Bert Language Model, Kimia Ameri, Michael Hempel, Hamid Sharif, Juan Lopez Jr., Kalyan Perumalla

Department of Electrical and Computer Engineering: Faculty Publications

We introduce CyBERT, a cybersecurity feature claims classifier based on bidirectional encoder representations from transformers and a key component in our semi-automated cybersecurity vetting for industrial control systems (ICS). To train CyBERT, we created a corpus of labeled sequences from ICS device documentation collected across a wide range of vendors and devices. This corpus provides the foundation for fine-tuning BERT’s language model, including a prediction-guided relabeling process. We propose an approach to obtain optimal hyperparameters, including the learning rate, the number of dense layers, and their configuration, to increase the accuracy of our classifier. Fine-tuning all hyperparameters of the resulting …


Review Of Data Mining Techniques For Detecting Churners In The Telecommunication Industry, Mahmoud Ewieda, Mohamed Ismail Roushdy, Essam Shaaban Jul 2021

Review Of Data Mining Techniques For Detecting Churners In The Telecommunication Industry, Mahmoud Ewieda, Mohamed Ismail Roushdy, Essam Shaaban

Future Computing and Informatics Journal

The telecommunication sector has been developed rapidly and with large amounts of data obtained as a result of increasing in the number of subscribers, modern techniques, data-based applications, and services. As well as better awareness of customer requirements and excellent quality that meets their satisfaction. This satisfaction raises rivalry between firms to maintain the quality of their services and upgrade them. These data can be helpfully extracted for analysis and used for predicting churners. Researchers around the world have conducted important research to understand the uses of Data mining (DM) that can be used to predict customers' churn. This …


Choice Of Feature Space For Classification Of Network Ip-Traffic By Machine Learning Methods, Avazjon Marakhimov, Ulugbek Ohundadaev Jun 2021

Choice Of Feature Space For Classification Of Network Ip-Traffic By Machine Learning Methods, Avazjon Marakhimov, Ulugbek Ohundadaev

Bulletin of National University of Uzbekistan: Mathematics and Natural Sciences

IP-protocol and transport layer protocols (TCP, UDP) have many different parameters and characteristics, which can be obtained both directly from packet headers and statistical observations of the flows. To solve the problem of classification of network traffc by methods of machine learning, it is necessary to determine a set of data (attributes), which it is reasonable to use for solving the classification problem.