Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

2024

Deep learning

Discipline
Institution
Publication
Publication Type

Articles 31 - 60 of 66

Full-Text Articles in Entire DC Network

The Impact Of Artificial Intelligence And Machine Learning On Organizations Cybersecurity, Mustafa Abdulhussein Feb 2024

The Impact Of Artificial Intelligence And Machine Learning On Organizations Cybersecurity, Mustafa Abdulhussein

Doctoral Dissertations and Projects

As internet technology proliferate in volume and complexity, the ever-evolving landscape of malicious cyberattacks presents unprecedented security risks in cyberspace. Cybersecurity challenges have been further exacerbated by the continuous growth in the prevalence and sophistication of cyber-attacks. These threats have the capacity to disrupt business operations, erase critical data, and inflict reputational damage, constituting an existential threat to businesses, critical services, and infrastructure. The escalating threat is further compounded by the malicious use of artificial intelligence (AI) and machine learning (ML), which have increasingly become tools in the cybercriminal arsenal. In this dynamic landscape, the emergence of offensive AI introduces …


Anomaly Detection On Small Wind Turbine Blades Using Deep Learning Algorithms, Bridger Altice, Edwin Nazario, Mason Davis, Mohammad Shekaramiz, Todd K. Moon, Mohammad A. S. Masoum Feb 2024

Anomaly Detection On Small Wind Turbine Blades Using Deep Learning Algorithms, Bridger Altice, Edwin Nazario, Mason Davis, Mohammad Shekaramiz, Todd K. Moon, Mohammad A. S. Masoum

Electrical and Computer Engineering Faculty Publications

Wind turbine blade maintenance is expensive, dangerous, time-consuming, and prone to misdiagnosis. A potential solution to aid preventative maintenance is using deep learning and drones for inspection and early fault detection. In this research, five base deep learning architectures are investigated for anomaly detection on wind turbine blades, including Xception, Resnet-50, AlexNet, and VGG-19, along with a custom convolutional neural network. For further analysis, transfer learning approaches were also proposed and developed, utilizing these architectures as the feature extraction layers. In order to investigate model performance, a new dataset containing 6000 RGB images was created, making use of indoor and …


Motion Magnification-Inspired Feature Manipulation For Deepfake Detection, Aydamir Mirzayev, Hamdi Di̇bekli̇oğlu Feb 2024

Motion Magnification-Inspired Feature Manipulation For Deepfake Detection, Aydamir Mirzayev, Hamdi Di̇bekli̇oğlu

Turkish Journal of Electrical Engineering and Computer Sciences

Recent advances in deep learning, increased availability of large-scale datasets, and improvement of accelerated graphics processing units facilitated creation of an unprecedented amount of synthetically generated media content with impressive visual quality. Although such technology is used predominantly for entertainment, there is widespread practice of using deepfake technology for malevolent ends. This potential for malicious use necessitates the creation of detection methods capable of reliably distinguishing manipulated video content. In this work we aim to create a learning-based detection method for synthetically generated videos. To this end, we attempt to detect spatiotemporal inconsistencies by leveraging a learning-based magnification-inspired feature manipulation …


Automated Identification Of Vehicles In Very High-Resolution Uav Orthomosaics Using Yolov7 Deep Learning Model, Esra Yildirim, Umut Güneş Seferci̇k, Taşkın Kavzoğlu Feb 2024

Automated Identification Of Vehicles In Very High-Resolution Uav Orthomosaics Using Yolov7 Deep Learning Model, Esra Yildirim, Umut Güneş Seferci̇k, Taşkın Kavzoğlu

Turkish Journal of Electrical Engineering and Computer Sciences

The utilization of remote sensing products for vehicle detection through deep learning has gained immense popularity, especially due to the advancement of unmanned aerial vehicles (UAVs). UAVs offer millimeter-level spatial resolution at low flight altitudes, which surpasses traditional airborne platforms. Detecting vehicles from very high-resolution UAV data is crucial in numerous applications, including parking lot and highway management, traffic monitoring, search and rescue missions, and military operations. Obtaining UAV data at desired periods allows the detection and tracking of target objects even several times during a day. Despite challenges such as diverse vehicle characteristics, traffic congestion, and hardware limitations, the …


Blood Cell Image Segmentation And Classification: A Systematic Review, Muhammad Shahzad, Farman Ali, Syed Hamad Shirazi, Assad Rasheed, Awais Ahmad, Babar Shah, Daehan Kwak Feb 2024

Blood Cell Image Segmentation And Classification: A Systematic Review, Muhammad Shahzad, Farman Ali, Syed Hamad Shirazi, Assad Rasheed, Awais Ahmad, Babar Shah, Daehan Kwak

All Works

Background Blood diseases such as leukemia, anemia, lymphoma, and thalassemia are hematological disorders that relate to abnormalities in the morphology and concentration of blood elements, specifically white blood cells (WBC) and red blood cells (RBC). Accurate and efficient diagnosis of these conditions significantly depends on the expertise of hematologists and pathologists. To assist the pathologist in the diagnostic process, there has been growing interest in utilizing computer-aided diagnostic (CAD) techniques, particularly those using medical image processing and machine learning algorithms. Previous surveys in this domain have been narrowly focused, often only addressing specific areas like segmentation or classification but lacking …


Dataset Of Arabic Spam And Ham Tweets, Sanaa Kaddoura, Safaa Henno Feb 2024

Dataset Of Arabic Spam And Ham Tweets, Sanaa Kaddoura, Safaa Henno

All Works

This data article provides a dataset of 132421 posts and their corresponding information collected from Twitter social media. The data has two classes, ham or spam, where ham indicates non-spam clean tweets. The main target of this dataset is to study a way to classify whether a post is a spam or not automatically. The data is in Arabic language only, which makes the data essential to the researchers in Arabic natural language processing (NLP) due to the lack of resources in this language. The data is made publicly available to allow researchers to use it as a benchmark for …


Conic Challenge: Pushing The Frontiers Of Nuclear Detection, Segmentation, Classification And Counting, Simon Graham, Quoc Dang Vu, Mostafa Jahanifar, Martin Weigert, Uwe Schmidt, Wenhua Zhang, Jun Zhang, Sen Yang, Jinxi Xiang, Xiyue Wang, Josef Lorenz Rumberger, Elias Baumann, Peter Hirsch, Lihao Liu, Chenyang Hong, Angelica I. Aviles-Rivero, Ayushi Jain, Heeyoung Ahn, Yiyu Hong, Hussam Azzuni, Min Xu Feb 2024

Conic Challenge: Pushing The Frontiers Of Nuclear Detection, Segmentation, Classification And Counting, Simon Graham, Quoc Dang Vu, Mostafa Jahanifar, Martin Weigert, Uwe Schmidt, Wenhua Zhang, Jun Zhang, Sen Yang, Jinxi Xiang, Xiyue Wang, Josef Lorenz Rumberger, Elias Baumann, Peter Hirsch, Lihao Liu, Chenyang Hong, Angelica I. Aviles-Rivero, Ayushi Jain, Heeyoung Ahn, Yiyu Hong, Hussam Azzuni, Min Xu

Computer Vision Faculty Publications

Nuclear detection, segmentation and morphometric profiling are essential in helping us further understand the relationship between histology and patient outcome. To drive innovation in this area, we setup a community-wide challenge using the largest available dataset of its kind to assess nuclear segmentation and cellular composition. Our challenge, named CoNIC, stimulated the development of reproducible algorithms for cellular recognition with real-time result inspection on public leaderboards. We conducted an extensive post-challenge analysis based on the top-performing models using 1,658 whole-slide images of colon tissue. With around 700 million detected nuclei per model, associated features were used for dysplasia grading and …


Catnet: Cross-Modal Fusion For Audio-Visual Speech Recognition, Xingmei Wang, Jianchen Mi, Boquan Li, Yixu Zhao, Jiaxiang Meng Feb 2024

Catnet: Cross-Modal Fusion For Audio-Visual Speech Recognition, Xingmei Wang, Jianchen Mi, Boquan Li, Yixu Zhao, Jiaxiang Meng

Research Collection School Of Computing and Information Systems

Automatic speech recognition (ASR) is a typical pattern recognition technology that converts human speeches into texts. With the aid of advanced deep learning models, the performance of speech recognition is significantly improved. Especially, the emerging Audio–Visual Speech Recognition (AVSR) methods achieve satisfactory performance by combining audio-modal and visual-modal information. However, various complex environments, especially noises, limit the effectiveness of existing methods. In response to the noisy problem, in this paper, we propose a novel cross-modal audio–visual speech recognition model, named CATNet. First, we devise a cross-modal bidirectional fusion model to analyze the close relationship between audio and visual modalities. Second, …


Therapeutic Potential Of Snake Venom: Toxin Distribution And Opportunities In Deep Learning For Novel Drug Discovery, Anas Bedraoui, Montamas Suntravat, Salim El Mejjad, Salwa Enezari, Naoual Oukkache, Elda E. Sanchez, Jacob Galan, Rachid El Fatimy, Tariq Daouda Feb 2024

Therapeutic Potential Of Snake Venom: Toxin Distribution And Opportunities In Deep Learning For Novel Drug Discovery, Anas Bedraoui, Montamas Suntravat, Salim El Mejjad, Salwa Enezari, Naoual Oukkache, Elda E. Sanchez, Jacob Galan, Rachid El Fatimy, Tariq Daouda

School of Medicine Publications and Presentations

Snake venom is a rich source of bioactive molecules that hold great promise for therapeutic applications. These molecules can be broadly classified into enzymes and non-enzymes, each showcasing unique medicinal properties. Noteworthy compounds such as Bradykinin Potentiating Peptides (BPP) and Three-Finger Toxins (3FTx) are showing therapeutic potential in areas like cardiovascular diseases (CVDs) and pain-relief. Meanwhile, components like snake venom metalloproteinases (SVMP), L-amino acid oxidases (LAAO), and Phospholipase A2s (PLA2) are paving new ways in oncology treatments. The full medicinal scope of these toxins is still emerging. In this review, we discuss drugs derived from snake venoms that address …


The Application Of Computer Vision Combining With Deep Leaning Techniques For Rapid Discrimination Of Adulterated Star Anise Powder, Chen Jinxing Jan 2024

The Application Of Computer Vision Combining With Deep Leaning Techniques For Rapid Discrimination Of Adulterated Star Anise Powder, Chen Jinxing

Food and Machinery

Objective: This study aims to design a novel approach, utilizing computer vision combining with deep learning, for rapid determination the adulteration in star anise powder. Methods: Collected the original images of star anise powder with varying adulteration ratios. Employing preprocessing and data enhancement techniques, an image dataset was curated. Subsequently, a SqueezeNet model was constructed and compared with five machine learning models, including Support Vector Machine (SVM), K-Nearest Neighbor Learning (KNN), Random Forest (RF), Gradient Boosting Tree (GBT), and Multilayer Perceptron (MLP). Results: The highest accuracy achieved by the five machine learning models was only 66.37%, while the accuracy of …


Fruit Variety And Freshness Recognition Method Based On Yolo-Ffd, Yan Zi, Chen Liangyan, Liu Weihua, Lai Huaqing, Ye Sheng Jan 2024

Fruit Variety And Freshness Recognition Method Based On Yolo-Ffd, Yan Zi, Chen Liangyan, Liu Weihua, Lai Huaqing, Ye Sheng

Food and Machinery

Objective: In order to improve the situation that existing fruit recognition and classification methods rely on manual operation and complex equipment. Methods: A lightweight model YOLO-FFD (YOLO with fruit and freshen detection) was proposed, which based on the YOLOv5 framework. Firstly, LightweightC3 was designed as the basic unit of the backbone feature extraction network based on the depth separable convolution and GELU activation function, which reduced the number of model parameters and computation, and speeds up the convergence of the model. Secondly, EnhancedC3, a large kernel depth separable convolution module, was used to improve the neck of the original model, …


Action Recognition Model Of Directed Attention Based On Cosine Similarity, Chen Li, Ming He, Chen Dong, Wei Li Jan 2024

Action Recognition Model Of Directed Attention Based On Cosine Similarity, Chen Li, Ming He, Chen Dong, Wei Li

Journal of System Simulation

Abstract: Aiming at the lack of directionality of traditional dot product attention, this paper proposes a directed attention model (DAM) based on cosine similarity. To effectively represent the direction relationship between the spatial and temporal features of video frames, the paper defines the relationship function in the attention mechanism using the cosine similarity theory, which can remove the absolute value of the relationship between features. To reduce the computational burden of the attention mechanism, the operation is decomposed from two dimensions of time and space. The computational complexity is further optimized by combining linear attention operation. The experiment is divided …


Deep Learning-Based Multimodality Classification Of Chronic Mild Traumatic Brain Injury Using Resting-State Functional Mri And Pet Imaging, Faezeh Vedaei, Najmeh Mashhadi, Mahdi Alizadeh, George Zabrecky, Daniel A. Monti, Md, Nancy Wintering, Emily Navarreto, Chloe Hriso, Andrew B. Newberg, Feroze B. Mohamed Jan 2024

Deep Learning-Based Multimodality Classification Of Chronic Mild Traumatic Brain Injury Using Resting-State Functional Mri And Pet Imaging, Faezeh Vedaei, Najmeh Mashhadi, Mahdi Alizadeh, George Zabrecky, Daniel A. Monti, Md, Nancy Wintering, Emily Navarreto, Chloe Hriso, Andrew B. Newberg, Feroze B. Mohamed

Department of Radiology Faculty Papers

Mild traumatic brain injury (mTBI) is a public health concern. The present study aimed to develop an automatic classifier to distinguish between patients with chronic mTBI (n = 83) and healthy controls (HCs) (n = 40). Resting-state functional MRI (rs-fMRI) and positron emission tomography (PET) imaging were acquired from the subjects. We proposed a novel deep-learning-based framework, including an autoencoder (AE), to extract high-level latent and rectified linear unit (ReLU) and sigmoid activation functions. Single and multimodality algorithms integrating multiple rs-fMRI metrics and PET data were developed. We hypothesized that combining different imaging modalities provides complementary information and …


Classification Of Colorectal Cancer Using Resnet And Efficientnet Models, Abhishek Ranjan, Priyanshu Srivastva, B Prabadevi, R Sivakumar, Rahul Soangra, Shamala K. Subramaniam Jan 2024

Classification Of Colorectal Cancer Using Resnet And Efficientnet Models, Abhishek Ranjan, Priyanshu Srivastva, B Prabadevi, R Sivakumar, Rahul Soangra, Shamala K. Subramaniam

Physical Therapy Faculty Articles and Research

Introduction:

Cancer is one of the most prevalent diseases from children to elderly adults. This will be deadly if not detected at an earlier stage of the cancerous cell formation, thereby increasing the mortality rate. One such cancer is colorectal cancer, caused due to abnormal growth in the rectum or colon. Early screening of colorectal cancer helps to identify these abnormal growth and can exterminate them before they turn into cancerous cells.

Aim:

Therefore, this study aims to develop a robust and efficient classification system for colorectal cancer through Convolutional Neural Networks (CNNs) on histological images.

Methods:

Despite challenges in …


Open System Neural Networks, Bradley Hatch Jan 2024

Open System Neural Networks, Bradley Hatch

Theses and Dissertations

Recent advances in self-supervised learning have made it possible to reuse information-rich models that have been generally pre-trained on massive amounts of data for other downstream tasks. But the pre-training process can be drastically different from the fine-tuning training process, which can lead to inefficient learning. We address this disconnect in training dynamics by structuring the learning process like an open system in thermodynamics. Open systems can achieve a steady state when low-entropy inputs are converted to high-entropy outputs. We modify the the model and the learning process to mimic this behavior, and attend more to elements of the input …


Crop Classification In South Korea For Multitemporal Planetscope Imagery Using Sfc-Densenet-Am, Seonkyeong Seong, Anjin Chang, Junsang Mo, Sangil Na, Hoyong Ahn, Jaehong Oh, Jaewan Choi Jan 2024

Crop Classification In South Korea For Multitemporal Planetscope Imagery Using Sfc-Densenet-Am, Seonkyeong Seong, Anjin Chang, Junsang Mo, Sangil Na, Hoyong Ahn, Jaehong Oh, Jaewan Choi

Agricultural and Environmental Sciences Faculty Research

In this manuscript, a new methodology based on a deep learning model using a Siamese network and attention module was proposed to classify crop cultivation areas, such as onion and garlic, from multitemporal PlanetScope images in South Korea. To consider the seasonal characteristics of crops in the model, training data were constructed from multitemporal satellite images. It was generated using PlanetScope satellite imagery from January and April, corresponding to the seasonal growth period of onion and garlic, in South Korea. Image patches were generated by considering the ratio of crops to minimize the influence of imbalanced data in the training …


Advancing Cell Segmentation Methods Through An Innovative Technique And Systematic Evaluation, Yuxing Wang Jan 2024

Advancing Cell Segmentation Methods Through An Innovative Technique And Systematic Evaluation, Yuxing Wang

Theses

In the biomedical field, downstream cell-level analysis is a crucial step for comprehensively characterizing cells' location, function, morphology, and phenotype, relying on the accurate identification of individual cells. Consequently, there is increasing interest in the high quality of identified cell boundaries for biologists or researchers. Currently, cell segmentation methods are categorized into two types: Image-based and RNA-based cell segmentation. However, these methods face significant challenges. For example, most image-based methods tend to identify cell nuclear boundaries rather than entire cell boundaries when only nuclear staining images are available, and their performance is affected by noise in images and training labels. …


Developing Ai-Powered Support For Improving Software Quality, Abdulaziz Hasan M. Alhefdhi Jan 2024

Developing Ai-Powered Support For Improving Software Quality, Abdulaziz Hasan M. Alhefdhi

University of Wollongong Thesis Collection 2017+

The modern scene of software development experiences an exponential growth in the number of software projects, applications and code-bases. As software increases substantially in both size and complexity, software engineers face significant challenges in developing and maintaining high-quality software applications. Therefore, support in the form of automated techniques and tools is much needed to accelerate development productivity and improve software quality.

The rise of Artificial Intelligence (AI) has the potential to bring such support and significantly transform the practices of software development. This thesis explores the use of AI in developing automated support for improving three aspects of software quality: …


Malware Detection With Artificial Intelligence: A Systematic Literature Review, Matthew G. Gaber, Mohiuddin Ahmed, Helge Janicke Jan 2024

Malware Detection With Artificial Intelligence: A Systematic Literature Review, Matthew G. Gaber, Mohiuddin Ahmed, Helge Janicke

Research outputs 2022 to 2026

In this survey, we review the key developments in the field of malware detection using AI and analyze core challenges. We systematically survey state-of-the-art methods across five critical aspects of building an accurate and robust AI-powered malware-detection model: malware sophistication, analysis techniques, malware repositories, feature selection, and machine learning vs. deep learning. The effectiveness of an AI model is dependent on the quality of the features it is trained with. In turn, the quality and authenticity of these features is dependent on the quality of the dataset and the suitability of the analysis tool. Static analysis is fast but is …


A Survey On Few-Shot Class-Incremental Learning, Songsong Tian, Lusi Li, Weijun Li, Hang Ran, Xin Ning, Prayag Tiwari Jan 2024

A Survey On Few-Shot Class-Incremental Learning, Songsong Tian, Lusi Li, Weijun Li, Hang Ran, Xin Ning, Prayag Tiwari

Computer Science Faculty Publications

Large deep learning models are impressive, but they struggle when real-time data is not available. Few-shot class-incremental learning (FSCIL) poses a significant challenge for deep neural networks to learn new tasks from just a few labeled samples without forgetting the previously learned ones. This setup can easily leads to catastrophic forgetting and overfitting problems, severely affecting model performance. Studying FSCIL helps overcome deep learning model limitations on data volume and acquisition time, while improving practicality and adaptability of machine learning models. This paper provides a comprehensive survey on FSCIL. Unlike previous surveys, we aim to synthesize few-shot learning and incremental …


Machine Learning As A Tool For Early Detection: A Focus On Late-Stage Colorectal Cancer Across Socioeconomic Spectrums, Hadiza Galadima, Rexford Anson-Dwamena, Ashley Johnson, Ghalib Bello, Georges Adunlin, James Blando Jan 2024

Machine Learning As A Tool For Early Detection: A Focus On Late-Stage Colorectal Cancer Across Socioeconomic Spectrums, Hadiza Galadima, Rexford Anson-Dwamena, Ashley Johnson, Ghalib Bello, Georges Adunlin, James Blando

Community & Environmental Health Faculty Publications

Purpose: To assess the efficacy of various machine learning (ML) algorithms in predicting late-stage colorectal cancer (CRC) diagnoses against the backdrop of socio-economic and regional healthcare disparities. Methods: An innovative theoretical framework was developed to integrate individual- and census tract-level social determinants of health (SDOH) with sociodemographic factors. A comparative analysis of the ML models was conducted using key performance metrics such as AUC-ROC to evaluate their predictive accuracy. Spatio-temporal analysis was used to identify disparities in late-stage CRC diagnosis probabilities. Results: Gradient boosting emerged as the superior model, with the top predictors for late-stage CRC diagnosis being anatomic site, …


Investigation Of Software Defined Radio And Deep Learning For Ground Penetrating Radar, Yan Zhang Jan 2024

Investigation Of Software Defined Radio And Deep Learning For Ground Penetrating Radar, Yan Zhang

Graduate College Dissertations and Theses

Ground Penetrating Radar (GPR) is a non-invasive geophysical method that uses radar pulses to image the subsurface. This technology is widely used to detect and map subsurface structures, utilities, and features without the need for physical excavation. Traditional GPR systems, which rely on fixed radio frequency electronics like Application-Specific Integrated Circuits (ASICs), have significant limitations in their flexibility and adaptability. Adjusting operational parameters such as waveform, frequency, and modulation schemes is challenging, which is crucial for tailoring performance to specific tasks or conditions. The considerable size and weight of these systems restrict their applicability in harsh or confined spaces where …


A Novel Collaborative Learning Model For Mixed Dentition And Fillings Segmentation In Panoramic Radiographs., Erin Ealba Bumann, Saeed Al-Qarni, Geetha Chandrashekar, Roya Sabzian, Brenda S Bohaty, Yugyung Lee Jan 2024

A Novel Collaborative Learning Model For Mixed Dentition And Fillings Segmentation In Panoramic Radiographs., Erin Ealba Bumann, Saeed Al-Qarni, Geetha Chandrashekar, Roya Sabzian, Brenda S Bohaty, Yugyung Lee

Manuscripts, Articles, Book Chapters and Other Papers

INTRODUCTION: It is critical for dentists to identify and differentiate primary and permanent teeth, fillings, dental restorations and areas with pathological findings when reviewing dental radiographs to ensure that an accurate diagnosis is made and the optimal treatment can be planned. Unfortunately, dental radiographs are sometimes read incorrectly due to human error or low-quality images. While secondary or group review can help catch errors, many dentists work in practice alone and/or do not have time to review all of their patients' radiographs with another dentist. Artificial intelligence may facilitate the accurate interpretation of radiographs. To help support the review of …


Machine Learning To Identify Structural Motifs In Asphaltenes, Arun K. Sharma, Selsela Arsala, James Brady, Madison Franke, Shelby Franke, Supreet Gandhok, Simon-Olivier Gingras, Ana Gomez, Katelyn Huie, Kayla Katz, Samantha Kozlo, Mateo Longoria, Levi Molnar, Nathaly Peña, Sarina Regis Jan 2024

Machine Learning To Identify Structural Motifs In Asphaltenes, Arun K. Sharma, Selsela Arsala, James Brady, Madison Franke, Shelby Franke, Supreet Gandhok, Simon-Olivier Gingras, Ana Gomez, Katelyn Huie, Kayla Katz, Samantha Kozlo, Mateo Longoria, Levi Molnar, Nathaly Peña, Sarina Regis

Biology and Chemistry Faculty Publications and Presentations

Asphaltenes are organic compounds that aggregate in crude oil with two dominant molecular architectures: archipelago and continental. Continental architectures possess a single uniform island structure composed of aromatic rings in contrast to archipelago architectures with aromatic cores interconnected through aliphatic chains. The structural composition of asphaltenes varies globally due to geographical differences, posing challenges in their classification due to a lack of uniformity. This study is the first known exploration of using image-based supervised machine learning, particularly the ResNet-50 neural network, for the binary classification of asphaltenes into continental and archipelago motifs. 255 continental and archipelago models underwent structural augmentations …


Multimodal Fusion For Audio-Image And Video Action Recognition, Muhammad B. Shaikh, Douglas Chai, Syed M. S. Islam, Naveed Akhtar Jan 2024

Multimodal Fusion For Audio-Image And Video Action Recognition, Muhammad B. Shaikh, Douglas Chai, Syed M. S. Islam, Naveed Akhtar

Research outputs 2022 to 2026

Multimodal Human Action Recognition (MHAR) is an important research topic in computer vision and event recognition fields. In this work, we address the problem of MHAR by developing a novel audio-image and video fusion-based deep learning framework that we call Multimodal Audio-Image and Video Action Recognizer (MAiVAR). We extract temporal information using image representations of audio signals and spatial information from video modality with the help of Convolutional Neutral Networks (CNN)-based feature extractors and fuse these features to recognize respective action classes. We apply a high-level weights assignment algorithm for improving audio-visual interaction and convergence. This proposed fusion-based framework utilizes …


Integration Of Infrared Thermography And Deep Learning For Real-Time In-Situ Defect Detection And Rapid Elimination Of Defect Propagation In Material Extrusion, Asef Ishraq Sadaf Jan 2024

Integration Of Infrared Thermography And Deep Learning For Real-Time In-Situ Defect Detection And Rapid Elimination Of Defect Propagation In Material Extrusion, Asef Ishraq Sadaf

Electronic Theses and Dissertations

This study presents a novel approach to overcoming process reliability challenges in Material Extrusion (ME), a prominent additive manufacturing (AM) technique. Despite ME's advantages in cost, versatility, and rapid prototyping, it faces significant barriers to commercial-scale production, primarily due to quality issues such as overextrusion and underextrusion, which compromise final part performance. Traditional manual monitoring methods severely lack the capability to efficiently detect these defects and highlight the necessity for an efficient and real-time monitoring solution. Considering these challenges, an innovative and field-deployable infrared thermography-based in-situ real-time defect detection and feedback control system is proposed in this thesis. A novel …


Physics-Informed Deep Learning With Kalman Filter Mixture For Traffic State Prediction, Niharika Deshpande, Hyoshin (John) Park Jan 2024

Physics-Informed Deep Learning With Kalman Filter Mixture For Traffic State Prediction, Niharika Deshpande, Hyoshin (John) Park

Engineering Management & Systems Engineering Faculty Publications

Accurate traffic forecasting is crucial for understanding and managing congestion for efficient transportation planning. However, conventional approaches often neglect epistemic uncertainty, which arises from incomplete knowledge across different spatiotemporal scales. This study addresses this challenge by introducing a novel methodology to establish dynamic spatiotemporal correlations that captures the unobserved heterogeneity in travel time through distinct peaks in probability density functions, guided by physics-based principles. We propose an innovative approach to modifying both prediction and correction steps of the Kalman Filter (KF) algorithm by leveraging established spatiotemporal correlations. Central to our approach is the development of a novel deep learning model …


Deep Learning One-Class Classification With Support Vector Methods, Hayden D. Hampton Jan 2024

Deep Learning One-Class Classification With Support Vector Methods, Hayden D. Hampton

Graduate Thesis and Dissertation 2023-2024

Through the specialized lens of one-class classification, anomalies–irregular observations that uncharacteristically diverge from normative data patterns–are comprehensively studied. This dissertation focuses on advancing boundary-based methods in one-class classification, a critical approach to anomaly detection. These methodologies delineate optimal decision boundaries, thereby facilitating a distinct separation between normal and anomalous observations. Encompassing traditional approaches such as One-Class Support Vector Machine and Support Vector Data Description, recent adaptations in deep learning offer a rich ground for innovation in anomaly detection. This dissertation proposes three novel deep learning methods for one-class classification, aiming to enhance the efficacy and accuracy of anomaly detection in …


Nonuniform Sampling-Based Breast Cancer Classification, Santiago Posso Jan 2024

Nonuniform Sampling-Based Breast Cancer Classification, Santiago Posso

Theses and Dissertations--Electrical and Computer Engineering

The emergence of deep learning models and their success in visual object recognition have fueled the medical imaging community's interest in integrating these algorithms to improve medical diagnosis. However, natural images, which have been the main focus of deep learning models and mammograms, exhibit fundamental differences. First, breast tissue abnormalities are often smaller than salient objects in natural images. Second, breast images have significantly higher resolutions but are generally heavily downsampled to fit these images to deep learning models. Models that handle high-resolution mammograms require many exams and complex architectures. Additionally, spatially resizing mammograms leads to losing discriminative details essential …


A Wavegan Approach For Mmwave-Based Fanet Topology Optimization, Enas Odat, Hakim Ghazzai, Ahmad Alsharoa Jan 2024

A Wavegan Approach For Mmwave-Based Fanet Topology Optimization, Enas Odat, Hakim Ghazzai, Ahmad Alsharoa

Electrical and Computer Engineering Faculty Research & Creative Works

The integration of dynamic Flying Ad hoc Networks (FANETs) and millimeter Wave (mmWave) technology can offer a promising solution for numerous data-intensive applications, as it enables the establishment of a robust flying infrastructure with significant data transmission capabilities. However, to enable effective mmWave communication within this dynamic network, it is essential to precisely align the steerable antennas mounted on Unmanned Aerial Vehicles (UAVs) with their corresponding peer units. Therefore, it is important to design a novel approach that can quickly determine an optimized alignment and network topology. In this paper, we propose a Generative Adversarial Network (GAN)-based approach, called WaveGAN, …