Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Convolutional neural networks

Discipline
Institution
Publication Year
Publication
Publication Type
File Type

Articles 1 - 30 of 114

Full-Text Articles in Physical Sciences and Mathematics

Anomaly Detection On Small Wind Turbine Blades Using Deep Learning Algorithms, Bridger Altice, Edwin Nazario, Mason Davis, Mohammad Shekaramiz, Todd K. Moon, Mohammad A. S. Masoum Feb 2024

Anomaly Detection On Small Wind Turbine Blades Using Deep Learning Algorithms, Bridger Altice, Edwin Nazario, Mason Davis, Mohammad Shekaramiz, Todd K. Moon, Mohammad A. S. Masoum

Electrical and Computer Engineering Faculty Publications

Wind turbine blade maintenance is expensive, dangerous, time-consuming, and prone to misdiagnosis. A potential solution to aid preventative maintenance is using deep learning and drones for inspection and early fault detection. In this research, five base deep learning architectures are investigated for anomaly detection on wind turbine blades, including Xception, Resnet-50, AlexNet, and VGG-19, along with a custom convolutional neural network. For further analysis, transfer learning approaches were also proposed and developed, utilizing these architectures as the feature extraction layers. In order to investigate model performance, a new dataset containing 6000 RGB images was created, making use of indoor and …


Ground Target Recognition And Damage Assessment Of Patrol Missiles Based On Multi-Source Information Fusion, Yibo Xu, Qinghua Yu, Yanjuan Wang, Ce Guo, Shiru Feng, Huimin Lu Feb 2024

Ground Target Recognition And Damage Assessment Of Patrol Missiles Based On Multi-Source Information Fusion, Yibo Xu, Qinghua Yu, Yanjuan Wang, Ce Guo, Shiru Feng, Huimin Lu

Journal of System Simulation

Abstract: For the multiple patrol missiles to attack the high defense capacity targets, a mobile ground target detection and damage assessment method based on multi-source information fusion is proposed. The multi-source information fusion of infrared images and RGB images is carried out by using IoU determination. A novel two-stage tightly coupled damage assessment method based on YOLO-VGGNet of patrol missiles to mobile ground targets is proposed. This method can fully use the advantage of deep semantic information extraction of CNNs and introduce the infrared damaging information simultaneously to achieve the online and real-time damage assessment of mobile ground targets. The …


Urban Flood Extent Segmentation And Evaluation From Real-World Surveillance Camera Images Using Deep Convolutional Neural Network, Yidi Wang, Yawen Shen, Behrouz Salahshour, Mecit Cetin, Khan Iftekharuddin, Navid Tahvildari, Guoping Huang, Devin K. Harris, Kwame Ampofo, Jonathan L. Goodall Jan 2024

Urban Flood Extent Segmentation And Evaluation From Real-World Surveillance Camera Images Using Deep Convolutional Neural Network, Yidi Wang, Yawen Shen, Behrouz Salahshour, Mecit Cetin, Khan Iftekharuddin, Navid Tahvildari, Guoping Huang, Devin K. Harris, Kwame Ampofo, Jonathan L. Goodall

Civil & Environmental Engineering Faculty Publications

This study explores the use of Deep Convolutional Neural Network (DCNN) for semantic segmentation of flood images. Imagery datasets of urban flooding were used to train two DCNN-based models, and camera images were used to test the application of the models with real-world data. Validation results show that both models extracted flood extent with a mean F1-score over 0.9. The factors that affected the performance included still water surface with specular reflection, wet road surface, and low illumination. In testing, reduced visibility during a storm and raindrops on surveillance cameras were major problems that affected the segmentation of flood extent. …


Infrared Imaging Segmentation Employing An Explainable Deep Neural Network, Xinfei Liao, Dan Wang, Zairan Li, Nilanjan Dey, Rs Simon, Fuqian Shi Oct 2023

Infrared Imaging Segmentation Employing An Explainable Deep Neural Network, Xinfei Liao, Dan Wang, Zairan Li, Nilanjan Dey, Rs Simon, Fuqian Shi

Turkish Journal of Electrical Engineering and Computer Sciences

Explainable AI (XAI) improved by a deep neural network (DNN) of a residual neural network (ResNet) and long short-term memory networks (LSTMs), termed XAIRL, is proposed for segmenting foot infrared imaging datasets. First, an infrared sensor imaging dataset is acquired by a foot infrared sensor imaging device and preprocessed. The infrared sensor image features are then defined and extracted with XAIRL being applied to segment the dataset. This paper compares and discusses our results with XAIRL. Evaluation indices are applied to perform various measurements for foot infrared image segmentation including accuracy, precision, recall, F1 score, intersection over union (IoU), Dice …


Short-Term Vehicle Speed Prediction With Spatiotemporal Convolution Fused With Variational Modal Decomposition, Kai Zhang, Haipeng Lu, Ying Han, Lingyun Zhang, Yujie Ding Aug 2023

Short-Term Vehicle Speed Prediction With Spatiotemporal Convolution Fused With Variational Modal Decomposition, Kai Zhang, Haipeng Lu, Ying Han, Lingyun Zhang, Yujie Ding

Journal of System Simulation

Abstract: Accurate short-term vehicle speed prediction helps to resolve city traffic congestion problems. Focusing on the defect that CNN cannot process non-Euclidean geometric data, GCN and BiLSTM are combined to fully process the spatiotemporal characteristics of road network information, in which the advantages of GCN integrating global features and the ability of BiLSTM to extract temporal features are considered. In order to reduce the interference of noise to the data, variational modal decomposition (VMD) is introduced and short-term vehicle speed prediction model based on VMD-GCN-BiLSTM (VGBLSTM) is proposed . Simulation results show that the prediction accuracy of VGBLSTM model is …


Tree-Based Unidirectional Neural Networks For Low-Power Computer Vision, Abhinav Goel, Caleb Tung, Nick Eliopoulos, Amy Wang, Jamie C. Davis, George K. Thiruvathukal, Yung-Hisang Lu Jun 2023

Tree-Based Unidirectional Neural Networks For Low-Power Computer Vision, Abhinav Goel, Caleb Tung, Nick Eliopoulos, Amy Wang, Jamie C. Davis, George K. Thiruvathukal, Yung-Hisang Lu

Computer Science: Faculty Publications and Other Works

This article describes the novel Tree-based Unidirectional Neural Network (TRUNK) architecture. This architecture improves computer vision efficiency by using a hierarchy of multiple shallow Convolutional Neural Networks (CNNs), instead of a single very deep CNN. We demonstrate this architecture’s versatility in performing different computer vision tasks efficiently on embedded devices. Across various computer vision tasks, the TRUNK architecture consumes 65% less energy and requires 50% less memory than representative low-power CNN architectures, e.g., MobileNet v2, when deployed on the NVIDIA Jetson Nano.


Automated Delineation Of Visual Area Boundaries And Eccentricities By A Cnn Using Functional, Anatomical, And Diffusion-Weighted Mri Data, Noah C. Benson, Bogeng Song, Toshikazu Miyata, Hiromasa Takemura, Jonathan Winawer May 2023

Automated Delineation Of Visual Area Boundaries And Eccentricities By A Cnn Using Functional, Anatomical, And Diffusion-Weighted Mri Data, Noah C. Benson, Bogeng Song, Toshikazu Miyata, Hiromasa Takemura, Jonathan Winawer

MODVIS Workshop

Delineating visual field maps and iso-eccentricities from fMRI data is an important but time-consuming task for many neuroimaging studies on the human visual cortex because the traditional methods of doing so using retinotopic mapping experiments require substantial expertise as well as scanner, computer, and human time. Automated methods based on gray-matter anatomy or a combination of anatomy and functional mapping can reduce these requirements but are less accurate than experts. Convolutional Neural Networks (CNNs) are powerful tools for automated medical image segmentation. We hypothesize that CNNs can define visual area boundaries with high accuracy. We trained U-Net CNNs with ResNet18 …


Using Deep Learning Model To Identify Iron Chlorosis In Plants, Munir Majdalawieh, Shafaq Khan, Md. T. Islam May 2023

Using Deep Learning Model To Identify Iron Chlorosis In Plants, Munir Majdalawieh, Shafaq Khan, Md. T. Islam

All Works

Iron deficiency in plants causes iron chlorosis which frequently occurs in soils that are alkaline (pH greater than 7.0) and that contain lime. This deficiency turns affected plant leaves to yellow, or with brown edges in advanced stages. The goal of this research is to use the deep learning model to identify a nutrient deficiency in plant leaves and perform soil analysis to identify the cause of the deficiency. Two pre-trained deep learning models, Single Shot Detector (SSD) MobileNet v2 and EfficientDet D0, are used to complete this task via transfer learning. This research also contrasts the architecture and performance …


An Efficient Deep Learning Architecture For Turkish Lira Recognition And Counterfeit Detection, Burak İyi̇kesi̇ci̇, Ergun Erçelebi̇ May 2023

An Efficient Deep Learning Architecture For Turkish Lira Recognition And Counterfeit Detection, Burak İyi̇kesi̇ci̇, Ergun Erçelebi̇

Turkish Journal of Electrical Engineering and Computer Sciences

Banknote counterfeiting is a common practice worldwide. Due to the recent developments in technology, banknote imitation has become easier than before. There are different kinds of algorithms developed for the detection of counterfeit banknotes for different countries in the literature. The earlier algorithms utilized classical image processing techniques where the implementations of machine learning and deep learning algorithms appeared with the developments in the artificial intelligence field as well as the computer hardware. In this study, a novel convolutional neural networks-based deep learning algorithm has been developed that detects counterfeit Turkish Lira banknotes and their denominations using the banknote images …


An Advanced Deep Learning Models-Based Plant Disease Detection: A Review Of Recent Research, Muhammad Shoaib, Babar Shah, Shaker Ei-Sappagh, Akhtar Ali, Asad Ullah, Fayadh Alenezi, Tsanko Gechev, Tariq Hussain, Farman Ali Mar 2023

An Advanced Deep Learning Models-Based Plant Disease Detection: A Review Of Recent Research, Muhammad Shoaib, Babar Shah, Shaker Ei-Sappagh, Akhtar Ali, Asad Ullah, Fayadh Alenezi, Tsanko Gechev, Tariq Hussain, Farman Ali

All Works

Plants play a crucial role in supplying food globally. Various environmental factors lead to plant diseases which results in significant production losses. However, manual detection of plant diseases is a time-consuming and error-prone process. It can be an unreliable method of identifying and preventing the spread of plant diseases. Adopting advanced technologies such as Machine Learning (ML) and Deep Learning (DL) can help to overcome these challenges by enabling early identification of plant diseases. In this paper, the recent advancements in the use of ML and DL techniques for the identification of plant diseases are explored. The research focuses on …


Enhanced Convolutional Neural Network For Non-Small Cell Lung Cancer Classification, Yahya Tashtoush, Rasha Obeidat, Abdallah Al-Shorman, Omar Darwish, Mohammad A. Al-Ramahi, Dirar Darweesh Feb 2023

Enhanced Convolutional Neural Network For Non-Small Cell Lung Cancer Classification, Yahya Tashtoush, Rasha Obeidat, Abdallah Al-Shorman, Omar Darwish, Mohammad A. Al-Ramahi, Dirar Darweesh

Computer Information Systems Faculty Publications

Lung cancer is a common type of cancer that causes death if not detected
early enough. Doctors use computed tomography (CT) images to diagnose
lung cancer. The accuracy of the diagnosis relies highly on the doctor's
expertise. Recently, clinical decision support systems based on deep learning
valuable recommendations to doctors in their diagnoses. In this paper, we
present several deep learning models to detect non-small cell lung cancer in
CT images and differentiate its main subtypes namely adenocarcinoma,
large cell carcinoma, and squamous cell carcinoma. We adopted standard
convolutional neural networks (CNN), visual geometry group-16 (VGG16),
and VGG19. Besides, we …


Convolutional-Neural-Network-Based Des-Level Aerodynamic Flow Field Generation From Urans Data, John P. Romano, Oktay Baysal, Alec C. Brodeur Jan 2023

Convolutional-Neural-Network-Based Des-Level Aerodynamic Flow Field Generation From Urans Data, John P. Romano, Oktay Baysal, Alec C. Brodeur

Mechanical & Aerospace Engineering Faculty Publications

The present paper culminates several investigations into the use of convolutional neural networks (CNNs) as a post-processing step to improve the accuracy of unsteady Reynolds-averaged Navier–Stokes (URANS) simulations for subsonic flows over airfoils at low angles of attack. Time-averaged detached eddy simulation (DES)-generated flow fields serve as the target data for creating and training CNN models. CNN post-processing generates flow-field data comparable to DES resolution, but after using only URANS-level resources and properly training CNN models. This document outlines the underlying theory and progress toward the goal of improving URANS simulations by looking at flow predictions for a class of …


Detecting Road Intersections From Satellite Images Using Convolutinal Neural Networks, Fatmaelzahraa Eltaher, Luis Miralles-Pechuán, Jane Courtney, Susan Mckeever Jan 2023

Detecting Road Intersections From Satellite Images Using Convolutinal Neural Networks, Fatmaelzahraa Eltaher, Luis Miralles-Pechuán, Jane Courtney, Susan Mckeever

Academic Posters Collection

The location of intersections is an important consideration for vulnerable road users such as People with Blindness or Visually Impairment (PBVI) or children. Route planning applications, however, do not give information about the location of intersections as this information is not available at scale. In this paper, we propose a deep learning framework to automatically detect the location of intersections from satellite images using convolutional neural networks. For this purpose, we labelled 7,342 Google maps images from Washington, DC, USA to create a dataset. This dataset covers a region of 58.98 km$^{2}$ and has 7,548 intersections. We then applied a …


Integrating The Spatial Pyramid Pooling Into 3d Convolutional Neural Networks For Cerebral Microbleeds Detection, Andre Accioly Veira Jan 2023

Integrating The Spatial Pyramid Pooling Into 3d Convolutional Neural Networks For Cerebral Microbleeds Detection, Andre Accioly Veira

CCE Theses and Dissertations

Cerebral microbleeds (CMB) are small foci of chronic blood products in brain tissues that are critical markers for cerebral amyloid angiopathy. CMB increases the risk of symptomatic intracerebral hemorrhage and ischemic stroke. CMB can also cause structural damage to brain tissues resulting in neurologic dysfunction, cognitive impairment, and dementia. Due to the paramagnetic properties of blood degradation products, CMB can be better visualized via susceptibility-weighted imaging (SWI) than magnetic resonance imaging (MRI).CMB identification and classification have been based mainly on human visual identification of SWI features via shape, size, and intensity information. However, manual interpretation can be biased. Visual screening …


Light Auditor: Power Measurement Can Tell Private Data Leakage Through Iot Covert Channels, Woosub Jung, Kailai Cui, Kenneth Koltermann, Junjie Wang, Chunsheng Xin, Gang Zhou Jan 2023

Light Auditor: Power Measurement Can Tell Private Data Leakage Through Iot Covert Channels, Woosub Jung, Kailai Cui, Kenneth Koltermann, Junjie Wang, Chunsheng Xin, Gang Zhou

Electrical & Computer Engineering Faculty Publications

Despite many conveniences of using IoT devices, they have suffered from various attacks due to their weak security. Besides well-known botnet attacks, IoT devices are vulnerable to recent covert-channel attacks. However, no study to date has considered these IoT covert-channel attacks. Among these attacks, researchers have demonstrated exfiltrating users' private data by exploiting the smart bulb's capability of infrared emission.

In this paper, we propose a power-auditing-based system that defends the data exfiltration attack on the smart bulb as a case study. We first implement this infrared-based attack in a lab environment. With a newly-collected power consumption dataset, we pre-process …


Generation Of Phase Transitions Boundaries Via Convolutional Neural Networks, Christopher Alexis Ibarra Dec 2022

Generation Of Phase Transitions Boundaries Via Convolutional Neural Networks, Christopher Alexis Ibarra

Open Access Theses & Dissertations

Accurate mapping of phase transitions boundaries is crucial in accurately modeling the equation of state of materials. The phase transitions can be structural (solid-solid) driven by temperature or pressure or a phase change like melting which defines the solid-liquid melt line. There exist many computational methods for evaluating the phase diagram at a particular point in temperature (T) and pressure (P). Most of these methods involve evaluation of a single (P,T) point at a time. The present work partially automates the search for phase boundaries lines utilizing a machine learning method based on convolutional neural networks and an efficient search …


How To Train Vision Transformer On Small-Scale Datasets?, Hanan Gani, Muzammal Naseer, Mohammad Yaqub Nov 2022

How To Train Vision Transformer On Small-Scale Datasets?, Hanan Gani, Muzammal Naseer, Mohammad Yaqub

Computer Vision Faculty Publications

Vision Transformer (ViT), a radically different architecture than convolutional neural networks offers multiple advantages including design simplicity, robustness and state-of-the-art performance on many vision tasks. However, in contrast to convolutional neural networks, Vision Transformer lacks inherent inductive biases. Therefore, successful training of such models is mainly attributed to pre-training on large-scale datasets such as ImageNet with 1.2M or JFT with 300M images. This hinders the direct adaption of Vision Transformer for small-scale datasets. In this work, we show that self-supervised inductive biases can be learned directly from small-scale datasets and serve as an effective weight initialization scheme for fine-tuning. This …


Machine Learning For Aiding Blood Flow Velocity Estimation Based On Angiography, Swati Padhee, Mark Johnson, Hang Yi, Tanvi Banerjee, Zifeng Yang Oct 2022

Machine Learning For Aiding Blood Flow Velocity Estimation Based On Angiography, Swati Padhee, Mark Johnson, Hang Yi, Tanvi Banerjee, Zifeng Yang

Computer Science and Engineering Faculty Publications

Computational fluid dynamics (CFD) is widely employed to predict hemodynamic characteristics in arterial models, while not friendly to clinical applications due to the complexity of numerical simulations. Alternatively, this work proposed a framework to estimate hemodynamics in vessels based on angiography images using machine learning (ML) algorithms. First, the iodine contrast perfusion in blood was mimicked by a flow of dye diffusing into water in the experimentally validated CFD modeling. The generated projective images from simulations imitated the counterpart of light passing through the flow field as an analogy of X-ray imaging. Thus, the CFD simulation provides both the ground …


Medical Image Segmentation With Deep Convolutional Neural Networks, Chuanbo Wang Aug 2022

Medical Image Segmentation With Deep Convolutional Neural Networks, Chuanbo Wang

Theses and Dissertations

Medical imaging is the technique and process of creating visual representations of the body of a patient for clinical analysis and medical intervention. Healthcare professionals rely heavily on medical images and image documentation for proper diagnosis and treatment. However, manual interpretation and analysis of medical images are time-consuming, and inaccurate when the interpreter is not well-trained. Fully automatic segmentation of the region of interest from medical images has been researched for years to enhance the efficiency and accuracy of understanding such images. With the advance of deep learning, various neural network models have gained great success in semantic segmentation and …


Adversarial Pixel Restoration As A Pretext Task For Transferable Perturbations, Hashmat Shadab Malik, Shahina K. Kunhimon, Muzammal Nasser, Salman Khan, Fahad Shahbaz Khan Jul 2022

Adversarial Pixel Restoration As A Pretext Task For Transferable Perturbations, Hashmat Shadab Malik, Shahina K. Kunhimon, Muzammal Nasser, Salman Khan, Fahad Shahbaz Khan

Computer Vision Faculty Publications

Transferable adversarial attacks optimize adversaries from a pretrained surrogate model and known label space to fool the unknown black-box models. Therefore, these attacks are restricted by the availability of an effective surrogate model. In this work, we relax this assumption and propose Adversarial Pixel Restoration as a self-supervised alternative to train an effective surrogate model from scratch under the condition of no labels and few data samples. Our training approach is based on a min-max objective which reduces overfitting via an adversarial objective and thus optimizes for a more generalizable surrogate model. Our proposed attack is complimentary to our adversarial …


Deep Convolution Neural Networks For Image Classification, Arun D. Kulkarni Jul 2022

Deep Convolution Neural Networks For Image Classification, Arun D. Kulkarni

Computer Science Faculty Publications and Presentations

Deep learning is a highly active area of research in machine learning community. Deep Convolutional Neural Networks (DCNNs) present a machine learning tool that enables the computer to learn from image samples and extract internal representations or properties underlying grouping or categories of the images. DCNNs have been used successfully for image classification, object recognition, image segmentation, and image retrieval tasks. DCNN models such as Alex Net, VGG Net, and Google Net have been used to classify large dataset having millions of images into thousand classes. In this paper, we present a brief review of DCNNs and results of our …


A New Automatic Bearing Fault Size Diagnosis Using Time-Frequency Images Of Cwt And Deep Transfer Learning Methods, Yilmaz Kaya, Fatma Kuncan, Hüseyi̇n Meti̇n Ertunç Jul 2022

A New Automatic Bearing Fault Size Diagnosis Using Time-Frequency Images Of Cwt And Deep Transfer Learning Methods, Yilmaz Kaya, Fatma Kuncan, Hüseyi̇n Meti̇n Ertunç

Turkish Journal of Electrical Engineering and Computer Sciences

Bearings are generally used as bearings or turning elements. Bearings are subjected to high loads and rapid speeds. Furthermore, metal-to-metal contact within the bearing makes it sensitive. In today?s machines, bearing failures disrupt the operation of the system or completely stop the system. Bearing failures that can occur can cause enormous damage to the entire system. Therefore, it is necessary to anticipate bearing failures and to carry out a regular diagnostic examination. Various systems have been developed for fault diagnosis. In recent years, deep transfer learning (DTL) methods are often preferred in current bearing diagnosis models, as they provide time …


Edgenext: Efficiently Amalgamated Cnn-Transformer Architecture For Mobile Vision Applications, Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Anwer, Fahad Shahbaz Khan Jun 2022

Edgenext: Efficiently Amalgamated Cnn-Transformer Architecture For Mobile Vision Applications, Muhammad Maaz, Abdelrahman Shaker, Hisham Cholakkal, Salman Khan, Syed Waqas Zamir, Rao Anwer, Fahad Shahbaz Khan

Computer Vision Faculty Publications

In the pursuit of achieving ever-increasing accuracy, large and complex neural networks are usually developed. Such models demand high computational resources and therefore cannot be deployed on edge devices. It is of great interest to build resource-efficient general purpose networks due to their usefulness in several application areas. In this work, we strive to effectively combine the strengths of both CNN and Transformer models and propose a new efficient hybrid architecture EdgeNeXt. Specifically in EdgeNeXt, we introduce split depth-wise transpose attention (SDTA) encoder that splits input tensors into multiple channel groups and utilizes depth-wise convolution along with self-attention across channel …


Learning Enriched Features For Fast Image Restoration And Enhancement, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao Apr 2022

Learning Enriched Features For Fast Image Restoration And Enhancement, Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao

Computer Vision Faculty Publications

Given a degraded input image, image restoration aims to recover the missing high-quality image content. Numerous applications demand effective image restoration, e.g., computational photography, surveillance, autonomous vehicles, and remote sensing. Significant advances in image restoration have been made in recent years, dominated by convolutional neural networks (CNNs). The widely-used CNN-based methods typically operate either on full-resolution or on progressively low-resolution representations. In the former case, spatial details are preserved but the contextual information cannot be precisely encoded. In the latter case, generated outputs are semantically reliable but spatially less accurate. This paper presents a new architecture with a holistic goal …


Malware Binary Image Classification Using Convolutional Neural Networks, John Kiger, Shen-Shyang Ho, Vahid Heydari Mar 2022

Malware Binary Image Classification Using Convolutional Neural Networks, John Kiger, Shen-Shyang Ho, Vahid Heydari

Faculty Scholarship for the College of Science & Mathematics

The persistent shortage of cybersecurity professionals combined with enterprise networks tasked with processing more data than ever before has led many cybersecurity experts to consider automating some of the most common and time-consuming security tasks using machine learning. One of these cybersecurity tasks where machine learning may prove advantageous is malware analysis and classification. To evade traditional detection techniques, malware developers are creating more complex malware. This is achieved through more advanced methods of code obfuscation and conducting more sophisticated attacks. This can make the manual process of analyzing malware an infinitely more complex task. Furthermore, the proliferation of malicious …


An Ensemble Approach For Patient Prognosis Of Head And Neck Tumor Using Multimodal Data, Numan Saeed, Roba Al Majzoub, Ikboljon Sobirov, Mohammad Yaqub Feb 2022

An Ensemble Approach For Patient Prognosis Of Head And Neck Tumor Using Multimodal Data, Numan Saeed, Roba Al Majzoub, Ikboljon Sobirov, Mohammad Yaqub

Computer Vision Faculty Publications

Accurate prognosis of a tumor can help doctors provide a proper course of treatment and, therefore, save the lives of many. Tradi-tional machine learning algorithms have been eminently useful in crafting prognostic models in the last few decades. Recently, deep learning algorithms have shown significant improvement when developing diag-nosis and prognosis solutions to different healthcare problems. However, most of these solutions rely solely on either imaging or clinical data. Utilizing patient tabular data such as demographics and patient med-ical history alongside imaging data in a multimodal approach to solve a prognosis task has started to gain more interest recently and …


A Survey Of Blind Modulation Classification Techniques For Ofdm Signals, Anand Kumar, Sudhan Majhi, Guan Gui, Hsiao-Chun Wu, Chau Yuen Feb 2022

A Survey Of Blind Modulation Classification Techniques For Ofdm Signals, Anand Kumar, Sudhan Majhi, Guan Gui, Hsiao-Chun Wu, Chau Yuen

Faculty Publications

Blind modulation classification (MC) is an integral part of designing an adaptive or intelligent transceiver for future wireless communications. Blind MC has several applications in the adaptive and automated systems of sixth generation (6G) communications to improve spectral efficiency and power efficiency, and reduce latency. It will become a integral part of intelligent software-defined radios (SDR) for future communication. In this paper, we provide various MC techniques for orthogonal frequency division multiplexing (OFDM) signals in a systematic way. We focus on the most widely used statistical and machine learning (ML) models and emphasize their advantages and limitations. The statistical-based blind …


Transformers In Vision: A Survey, Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, Mubarak Shah Jan 2022

Transformers In Vision: A Survey, Salman Khan, Muzammal Naseer, Munawar Hayat, Syed Waqas Zamir, Fahad Shahbaz Khan, Mubarak Shah

Computer Vision Faculty Publications

Astounding results from Transformer models on natural language tasks have intrigued the vision community to study their application to computer vision problems. Among their salient benefits, Transformers enable modeling long dependencies between input sequence elements and support parallel processing of sequence as compared to recurrent networks e.g., Long short-term memory (LSTM). Different from convolutional networks, Transformers require minimal inductive biases for their design and are naturally suited as set-functions. Furthermore, the straightforward design of Transformers allows processing multiple modalities (e.g., images, videos, text and speech) using similar processing blocks and demonstrates excellent scalability to very large capacity networks and huge …


Automatic Segmentation Of Head And Neck Tumor: How Powerful Transformers Are?, Ikboljon Sobirov, Otabek Nazarov, Hussain Alasmawi, Mohammad Yaqub Jan 2022

Automatic Segmentation Of Head And Neck Tumor: How Powerful Transformers Are?, Ikboljon Sobirov, Otabek Nazarov, Hussain Alasmawi, Mohammad Yaqub

Computer Vision Faculty Publications

Cancer is one of the leading causes of death worldwide, and head and neck (H&N) cancer is amongst the most prevalent types. Positron emission tomography and computed tomography are used to detect and segment the tumor region. Clinically, tumor segmentation is extensively time-consuming and prone to error. Machine learning, and deep learning in particular, can assist to automate this process, yielding results as accurate as the results of a clinician. In this research study, we develop a vision transformers-based method to automatically delineate H&N tumor, and compare its results to leading convolutional neural network (CNN)-based models. We use multi-modal data …


Cnn Based Sensor Fusion Method For Real-Time Autonomous Robotics Systems, Berat Yildiz, Aki̇f Durdu, Ahmet Kayabaşi, Mehmet Duramaz Jan 2022

Cnn Based Sensor Fusion Method For Real-Time Autonomous Robotics Systems, Berat Yildiz, Aki̇f Durdu, Ahmet Kayabaşi, Mehmet Duramaz

Turkish Journal of Electrical Engineering and Computer Sciences

Autonomous robotic systems (ARS) serve in many areas of daily life. The sensors have critical importance for these systems. The sensor data obtained from the environment should be as accurate and reliable as possible and correctly interpreted by the autonomous robot. Since sensors have advantages and disadvantages over each other they should be used together to reduce errors. In this study, Convolutional Neural Network (CNN) based sensor fusion was applied to ARS to contribute the autonomous driving. In a real-time application, a camera and LIDAR sensor were tested with these networks. The novelty of this work is that the uniquely …