Open Access. Powered by Scholars. Published by Universities.®

Data Science Commons

Open Access. Powered by Scholars. Published by Universities.®

Discipline
Institution
Keyword
Publication Year
Publication
Publication Type
File Type

Articles 121 - 150 of 1577

Full-Text Articles in Data Science

Quantification Of Landside Congestion In Ports: An Analysis Based On Gps Data, Kumushini Thennakoon, Namal Bandaranayake, Senevi Kiridena, Asela K. Kulatunga Jan 2024

Quantification Of Landside Congestion In Ports: An Analysis Based On Gps Data, Kumushini Thennakoon, Namal Bandaranayake, Senevi Kiridena, Asela K. Kulatunga

Computer Science Faculty Publications

Hinterland transport is a critical segment in maritime cross-border logistics, which links the end-users of global supply chains to the maritime segment. Truck-based hinterland transport is known to cause congestion in and around ports. This study aimed to quantify the congestion caused by trucks at the Port of Colombo, which has not been a subject of a systematic study. To this end, the study makes use of GPS data. In addition to revealing heavy congestion within the port, the study also reveals significant variations in congestion during different times of the day with the duration of journeys peaking from 1200hrs …


Reducing Generalization Error In Multiclass Classification Through Factorized Cross Entropy Loss, Oleksandr Horban Jan 2024

Reducing Generalization Error In Multiclass Classification Through Factorized Cross Entropy Loss, Oleksandr Horban

CMC Senior Theses

This paper introduces Factorized Cross Entropy Loss, a novel approach to multiclass classification which modifies the standard cross entropy loss by decomposing its weight matrix W into two smaller matrices, U and V, where UV is a low rank approximation of W. Factorized Cross Entropy Loss reduces generalization error from the conventional O( sqrt(k / n) ) to O( sqrt(r / n) ), where k is the number of classes, n is the sample size, and r is the reduced inner dimension of U and V.


Towards Algorithmic Justice: Human Centered Approaches To Artificial Intelligence Design To Support Fairness And Mitigate Bias In The Financial Services Sector, Jihyun Kim Jan 2024

Towards Algorithmic Justice: Human Centered Approaches To Artificial Intelligence Design To Support Fairness And Mitigate Bias In The Financial Services Sector, Jihyun Kim

CMC Senior Theses

Artificial Intelligence (AI) has positively transformed the Financial services sector but also introduced AI biases against protected groups, amplifying existing prejudices against marginalized communities. The financial decisions made by biased algorithms could cause life-changing ramifications in applications such as lending and credit scoring. Human Centered AI (HCAI) is an emerging concept where AI systems seek to augment, not replace human abilities while preserving human control to ensure transparency, equity and privacy. The evolving field of HCAI shares a common ground with and can be enhanced by the Human Centered Design principles in that they both put humans, the user, at …


Xgboost Hyperberd Model Using Steam Platform, Yuh-Haur Chen Jan 2024

Xgboost Hyperberd Model Using Steam Platform, Yuh-Haur Chen

Data Science and Data Mining

This project investigates game pricing strategies in the Steam market using an XGBoost model, drawing motivation from Professor Xie's lecture, and presenting findings through a density plot that delineates two primary pricing strategies. A free-to-play approach, indicated by a significant hot spot, is adopted by developers focusing on post-purchase revenues through DLC, aesthetic purchases, and in-game transactions. This sailing strategy includes community-centric developers aiming to distribute their games for player engagement rather than profit.

The project illustrates the effectiveness of advanced modeling techniques in handling complex datasets, with significant predictive accuracy reflected by a reduced MSE from 0.3472 to 0.1397. …


Combating Cyberbullying On Social Media: A Machine Learning Approach With Text Analysis On Twitter, Amir Alipour Yengejeh Jan 2024

Combating Cyberbullying On Social Media: A Machine Learning Approach With Text Analysis On Twitter, Amir Alipour Yengejeh

Data Science and Data Mining

The popularity of the electronic mobile devices along with social media as well as networking websites have been tremendously increased in the recent year. Most people around the world daily engage in the variety of cyberspace additives. Even though the users can take most advantages of these system such as exchange the idea and information, being sociable, and enjoyments, they might be faced with such adverse behaviors such as toxicity, bullying, extremism, and cruelty. The recent statistics reports that such mentioned behaviors has been noticeably grown on the cyberspace such that can threaten the individuals and even any community. Thus, …


Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe Jan 2024

Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe

Data Science and Data Mining

Cyberbullying refers to the act of bullying using electronic means and the internet. In recent years, this act has been identifed to be a major problem among young people and even adults. It can negatively impact one’s emotions and lead to adverse outcomes like depression, anxiety, harassment, and suicide, among others. This has led to the need to employ machine learning techniques to automatically detect cyberbullying and prevent them on various social media platforms. In this study, we want to analyze the combination of some Natural Language Processing (NLP) algorithms (such as Bag-of-Words and TFIDF) with some popular machine learning …


Diagnostic In Neuroimaging: A Comparative Study Of Deep Learning And Traditional Approaches, Amina Issoufou Anaroua Jan 2024

Diagnostic In Neuroimaging: A Comparative Study Of Deep Learning And Traditional Approaches, Amina Issoufou Anaroua

Data Science and Data Mining

In the realm of medical diagnostics, precise classification of brain tumors is pivotal. This study conducts a comprehensive comparative analysis of a Convolutional Neural Network (CNN) against traditional machine learning models, Logistic Regression (LR) and Support Vector Machines (SVM) on a dataset of MRI scans for multi-class brain tumor classification. The CNN, tailored for image recognition, is evaluated alongside LR and SVM, which have established benchmarks in classification tasks. The investigation reveals that the traditional models hold their ground in terms of precision and interpretability, with the SVM, in particular, achieving remarkable accuracy. However, the CNN distinguishes itself by demonstrating …


Optimizing Ai With Advanced Data Structuring: A Comparative Analysis Of K-Means And Gmm Clustering Techniques, Amir Alipour Yengejeh Jan 2024

Optimizing Ai With Advanced Data Structuring: A Comparative Analysis Of K-Means And Gmm Clustering Techniques, Amir Alipour Yengejeh

Data Science and Data Mining

This study presents a detailed comparison of Kmeans and Gaussian Mixture Model (GMM) clustering algorithms, illustrating their unique capabilities and limitations across various synthetic datasets. By utilizing metrics such as the Adjusted Rand Index (ARI) and Normalized Mutual Information (NMI), the research provides nuanced insights into how these algorithms handle datasets with varying structures and complexities. For instance, while both K-means and GMM show robust performance on well-separated clusters, GMM demonstrates a distinct advantage in scenarios with overlapping clusters or unbalanced data distributions. Conversely, K-means excels in identifying clear, distinct groupings, highlighting its utility in simpler clustering contexts. This study …


Enhancing Flight Delay Predictions Using Network Centrality Measures, Joseph Ajayi Jan 2024

Enhancing Flight Delay Predictions Using Network Centrality Measures, Joseph Ajayi

Electronic Theses and Dissertations

Accurate prediction of flight delays remains a formidable challenge within the aviation industry, owing to its inherent complexity and the interconnectivity of its operations. Traditional flight prediction methods frequently utilize meteorological conditions—such as temperature, humidity, and dew point—alongside flight-specific data like departure and arrival times. However, these predictors often fall short of capturing the nuanced dynamics that lead to delays. This thesis introduces network centrality measures as novel predictors for enhancing the binary classification of flight arrival delays. Furthermore, it emphasizes the application of tree-based ensemble models, which are recognized for their superior ability to model complex relationships compared to …


Classification In Supervised Statistical Learning With The New Weighted Newton-Raphson Method, Toma Debnath Jan 2024

Classification In Supervised Statistical Learning With The New Weighted Newton-Raphson Method, Toma Debnath

Electronic Theses and Dissertations

In this thesis, the Weighted Newton-Raphson Method (WNRM), an innovative optimization technique, is introduced in statistical supervised learning for categorization and applied to a diabetes predictive model, to find maximum likelihood estimates. The iterative optimization method solves nonlinear systems of equations with singular Jacobian matrices and is a modification of the ordinary Newton-Raphson algorithm. The quadratic convergence of the WNRM, and high efficiency for optimizing nonlinear likelihood functions, whenever singularity in the Jacobians occur allow for an easy inclusion to classical categorization and generalized linear models such as the Logistic Regression model in supervised learning. The WNRM is thoroughly investigated …


Performing Holt-Winters Time Series Forecasting Using Neural Network Based Models, Kazeem Olanrewaju Bankole Jan 2024

Performing Holt-Winters Time Series Forecasting Using Neural Network Based Models, Kazeem Olanrewaju Bankole

Electronic Theses and Dissertations

We show how to create Artificial Neural Network based models for performing the well- known Holt-Winters time series analysis. Our work fares well compared to the well-known Holt-Winter time series prediction method while avoiding the burden of searching for the parameters of the model. We present the theoretical justification of the connection between the two models and experimental results showing the similarities of these models


Simulation Of Wave Propagation In Granular Particles Using A Discrete Element Model, Syed Tahmid Hussan Jan 2024

Simulation Of Wave Propagation In Granular Particles Using A Discrete Element Model, Syed Tahmid Hussan

Electronic Theses and Dissertations

The understanding of Bender Element mechanism and utilization of Particle Flow Code (PFC) to simulate the seismic wave behavior is important to test the dynamic behavior of soil particles. Both discrete and finite element methods can be used to simulate wave behavior. However, Discrete Element Method (DEM) is mostly suitable, as the micro scaled soil particle cannot be fully considered as continuous specimen like a piece of rod or aluminum. Recently DEM has been widely used to study mechanical properties of soils at particle level considering the particles as balls. This study represents a comparative analysis of Voigt and Best …


Statistical Modeling Of Bankruptcy Data, Andrew Elsfelder Jan 2024

Statistical Modeling Of Bankruptcy Data, Andrew Elsfelder

Williams Honors College, Honors Research Projects

My project uses a dataset of bankrupt and non-bankrupt companies in Taiwan from 1999 to 2009. This data was collected from the Taiwan Economic Journal. The statistical methods I used to model the data are CHAID, CART, and logistic regression. The models created are tools that can predict if a company is bankrupt, or not-bankrupt based on other data about the company. I created multiple models for each of the methods to find the best model for each method. I then analyzed the output from each method. Lastly, I determined which model was the best for this data based on …


In Pursuit Of Consumption-Based Forecasting, Charles Chase, Kenneth B. Kahn Jan 2024

In Pursuit Of Consumption-Based Forecasting, Charles Chase, Kenneth B. Kahn

Marketing Faculty Publications

[Introduction] Today's most mature, most sophisticated, best-in-class forecasting is what we call consumption-based forecasting (CBF). In contrast, the least sophisticated companies typically do not forecast at all, but rather set financial targets based on management expectations. Companies beginning to use statistical forecasting techniques usually take a supply-centric orientation, relying on time series techniques applied to shipment and/or order history. The next stage of progression is to incorporate promotions data, economic data, and market data alongside supply-centric data so that regression and other advanced analytics can be used. Companies pursing CBF utilize even more advanced capabilities to capture, examine, and understand …


Data Driven And Machine Learning Based Modeling And Predictive Control Of Combustion At Reactivity Controlled Compression Ignition Engines, Behrouz Khoshbakht Irdmousa Jan 2024

Data Driven And Machine Learning Based Modeling And Predictive Control Of Combustion At Reactivity Controlled Compression Ignition Engines, Behrouz Khoshbakht Irdmousa

Dissertations, Master's Theses and Master's Reports

Reactivity Controlled Compression Ignition (RCCI) engines operates has capacity to provide higher thermal efficiency, lower particular matter (PM), and lower oxides of nitrogen (NOx) emissions compared to conventional diesel combustion (CDC) operation. Achieving these benefits is difficult since real-time optimal control of RCCI engines is challenging during transient operation. To overcome these challenges, data-driven machine learning based control-oriented models are developed in this study. These models are developed based on Linear Parameter-Varying (LPV) modeling approach and input-output based Kernelized Canonical Correlation Analysis (KCCA) approach. The developed dynamic models are used to predict combustion timing (CA50), indicated mean effective pressure (IMEP), …


Mhair: A Dataset Of Audio-Image Representations For Multimodal Human Actions, Muhammad Bilal Shaikh, Douglas Chai, Syed M. S. Islam, Naveed Akhtar Jan 2024

Mhair: A Dataset Of Audio-Image Representations For Multimodal Human Actions, Muhammad Bilal Shaikh, Douglas Chai, Syed M. S. Islam, Naveed Akhtar

Research outputs 2022 to 2026

Audio-image representations for a multimodal human action (MHAiR) dataset contains six different image representations of the audio signals that capture the temporal dynamics of the actions in a very compact and informative way. The dataset was extracted from the audio recordings which were captured from an existing video dataset, i.e., UCF101. Each data sample captured a duration of approximately 10 s long, and the overall dataset was split into 4893 training samples and 1944 testing samples. The resulting feature sequences were then converted into images, which can be used for human action recognition and other related tasks. These images can …


Classification Models Using Python In Industrial/Organizational Psychology, Beyza Ceylan Jan 2024

Classification Models Using Python In Industrial/Organizational Psychology, Beyza Ceylan

Williams Honors College, Honors Research Projects

Companies, industries, and places of business use artificial intelligence and statistics to predict the characteristics of their employees and staff. Data collected from these individuals is also used to make decisions about them regarding their work life, such as promotions, salaries, or within the hiring process. Two models that are commonly used throughout the field of psychology and specifically in industrial/organizational psychology are the linear regression and the logistic regression. Examining different classification models using Python shows the potential that there may be different models that are more accurate in their predictions of employee success, including a Random Forest model …


Data Science In Finance: Challenges And Opportunities, Xianrong Zheng, Elizabeth Gildea, Sheng Chai, Tongxiao Zhang, Shuxi Wang Jan 2024

Data Science In Finance: Challenges And Opportunities, Xianrong Zheng, Elizabeth Gildea, Sheng Chai, Tongxiao Zhang, Shuxi Wang

Information Technology & Decision Sciences Faculty Publications

Data science has become increasingly popular due to emerging technologies, including generative AI, big data, deep learning, etc. It can provide insights from data that are hard to determine from a human perspective. Data science in finance helps to provide more personal and safer experiences for customers and develop cutting-edge solutions for a company. This paper surveys the challenges and opportunities in applying data science to finance. It provides a state-of-the-art review of financial technologies, algorithmic trading, and fraud detection. Also, the paper identifies two research topics. One is how to use generative AI in algorithmic trading. The other is …


Judging Our New Judges: Why We Must Remove Artificial Intelligence From Our Courtrooms Now, Kieran Duffy Newcomb Jan 2024

Judging Our New Judges: Why We Must Remove Artificial Intelligence From Our Courtrooms Now, Kieran Duffy Newcomb

Honors Theses and Capstones

In this paper, I explore some of the ways in which artificial intelligence might enhance the sentencing process through recidivism prediction technology. Notably, this technology can increase the accuracy of risk predictions and the speed with which sentencing decisions are reached. I then show, however, that the recidivism prediction technology is likely to turn into what data scientist Cathy O’Neil calls a Weapon of Math Destruction. The potential harmfulness of this technology is due not to the inherent nature of the technology, but the symbiotic relationship it will have with our already harmful criminal justice system. I argue that the …


Comparison Of Support Vector Machine (Svm), K-Nearest Neighbor (K-Nn), And Stochastic Gradient Descent (Sgd) For Classifying Corn Leaf Disease Based On Histogram Of Oriented Gradients (Hog) Feature Extraction, Firdaus Solihin, Muhammad Syarief, Eka Mala Sari Rochman, Aeri Rachmad Dec 2023

Comparison Of Support Vector Machine (Svm), K-Nearest Neighbor (K-Nn), And Stochastic Gradient Descent (Sgd) For Classifying Corn Leaf Disease Based On Histogram Of Oriented Gradients (Hog) Feature Extraction, Firdaus Solihin, Muhammad Syarief, Eka Mala Sari Rochman, Aeri Rachmad

Elinvo (Electronics, Informatics, and Vocational Education)

Image classification involves categorizing an image's pixels into specific classes based on their unique characteristics. It has diverse applications in everyday life. One such application is the classification of diseases on corn leaves. Corn is a widely consumed staple food in Indonesia, and healthy corn plants are crucial for meeting market demands. Currently, disease identification in corn plants relies on manual checks, which are time-consuming and less effective. This research aims to automate disease identification on corn leaves using the Support Vector Machine (SVM), K-Nearest Neighbor (K-NN) with K=2, and Stochastic Gradient Descent (SGD) algorithms. The classification process utilizes the …


Classification Of Organic And Inorganic Waste Types Based On Neural Networks, Fatchul Arifin, M. Habiburrahman, Wahyu Ramadhani Gusti Dec 2023

Classification Of Organic And Inorganic Waste Types Based On Neural Networks, Fatchul Arifin, M. Habiburrahman, Wahyu Ramadhani Gusti

Elinvo (Electronics, Informatics, and Vocational Education)

Garbage is the residue of unused industrial production and household consumption. In Indonesia, waste is divided into 2 types, namely organic and inorganic waste. The two types of waste can be recycled in diverse ways, so they must be separated. So far, it is often difficult for the community to sort waste. This paper presents the process of recognizing and sorting waste automatically by utilizing Artificial Intelligence technology, especially Artificial Neural Networks (ANN). The ANN architecture used in this study consists of 4 layers. The number of neurons in each layer consists of 3 neurons in the input layer, 4 …


Soybean Collect Recommender Based On Distance And Productivity Cluster Using K-Means Clustering And Simple Addictive Weighting Method, Mega Wahyu Ningtyas, Feddy Setio Pribadi Dec 2023

Soybean Collect Recommender Based On Distance And Productivity Cluster Using K-Means Clustering And Simple Addictive Weighting Method, Mega Wahyu Ningtyas, Feddy Setio Pribadi

Elinvo (Electronics, Informatics, and Vocational Education)

Soybeans are an essential agricultural product that is one of the primary food sources in Indonesia, such as tempeh, tofu, soy milk, soy sauce, and other preparations. However, production yields, harvested land area, and soybean productivity in each district or city in Central Java Province vary widely. Differences in soybean productivity in each area are due to production factors such as area, use of fertilizers, seeds, and labor. This study tries to provide recommendations for soybean harvesting based on the distance and productivity of an area using K-means clustering and the simple addictive weighting method. In the Central Java Province, …


Reducing Food Scarcity: The Benefits Of Urban Farming, S.A. Claudell, Emilio Mejia Dec 2023

Reducing Food Scarcity: The Benefits Of Urban Farming, S.A. Claudell, Emilio Mejia

Journal of Nonprofit Innovation

Urban farming can enhance the lives of communities and help reduce food scarcity. This paper presents a conceptual prototype of an efficient urban farming community that can be scaled for a single apartment building or an entire community across all global geoeconomics regions, including densely populated cities and rural, developing towns and communities. When deployed in coordination with smart crop choices, local farm support, and efficient transportation then the result isn’t just sustainability, but also increasing fresh produce accessibility, optimizing nutritional value, eliminating the use of ‘forever chemicals’, reducing transportation costs, and fostering global environmental benefits.

Imagine Doris, who is …


Tiny Machine Learning For Underwater Image Enhancement: Pruning And Quantizaition Approach, Dr Khaled Nagaty, The British University In Egypt, Andreas Pester Dr Dec 2023

Tiny Machine Learning For Underwater Image Enhancement: Pruning And Quantizaition Approach, Dr Khaled Nagaty, The British University In Egypt, Andreas Pester Dr

Computer Science

Many people have expressed an interest in underwater image processing in a variety of fields, including underwater vehicle control, archaeology, marine biological studies, etc. Underwater exploration is becoming an increasingly important element of our lives, with applications ranging from underwater marine and creature research to pipeline and communication logistics, military use, touristic and entertainment use. Underwater images suffer from poor visibility, distortion, and poor quality for a variety of causes, including light propagation. The major issue arises when these images must be captured at depths greater than 500 feet and artificial lighting needs to be provided. Efficient algorithms and models …


Interpretable Word-Level Sentiment Analysis With Attention-Based Multiple Instance Classification Models, Chenyu Yang Dec 2023

Interpretable Word-Level Sentiment Analysis With Attention-Based Multiple Instance Classification Models, Chenyu Yang

Statistical Science Theses and Dissertations

In this study, our main objective is to tackle the black-box nature of popular machine learning models in sentiment analysis and enhance model interpretability. We aim to gain more insight into the decision-making process of sentiment analysis models, which is often obscure in those complex models. To achieve this goal, we introduce two word-level sentiment analysis models.

The first model is called the attention-based multiple instance classification (AMIC) model. It combines the transparent model structure of multiple instance classification and the self-attention mechanism in deep learning to incorporate the contextual information from documents. As demonstrated by a wine review dataset …


Roadside Lidar Data Processing For Intelligent Transportation System, Md Parvez Mollah Dec 2023

Roadside Lidar Data Processing For Intelligent Transportation System, Md Parvez Mollah

Computer Science ETDs

Roadside LiDAR (Light Detection and Ranging) sensors are recently being explored for Intelligent Transportation System aiming at safer and faster traffic management and vehicular operations. However, massive data volume, occlusion, and limited viewing angles are significant obstacles to the widespread use of roadside LiDARs. In this dissertation, we address three major challenges to enable applications of Intelligent Transportation System through roadside LiDAR data: (i) real-time transmission of the massive point-cloud data from the roadside LiDAR devices to the cloud using 5G network, (ii) mitigating sensor occlusion problem to increase coverage and detect events occurred in occluded regions of a sensor, …


Utilizing Multitask Transfer Learning For Sonographic Rheumatoid Arthritis Synovitis Grading, Jordan Marie Claire Sanders Dec 2023

Utilizing Multitask Transfer Learning For Sonographic Rheumatoid Arthritis Synovitis Grading, Jordan Marie Claire Sanders

Doctoral Dissertations and Master's Theses

Classifying the four sonographic Rheumatoid Arthritis (RA) synovitis grades (Grade 0, Grade 1, Grade 2, and Grade 3) is a difficult problem due to the complexity of the relevant markers. Therefore, the current research proposes a Multitask Transfer Learning (MTL) framework for sonographic RA synovitis grading of Ultrasound (US) images in Brightness mode (B-Mode) and Power Doppler mode.

In the medical community, the lack of reliability of scoring these images has been an issue and reason for concern for doctors and other medical practitioners. The human/machine variability across the acquisition procedure of these US images creates an additional challenge that …


Learning Mortality Risk For Covid-19 Using Machine Learning And Statistical Methods, Shaoshi Zhang Dec 2023

Learning Mortality Risk For Covid-19 Using Machine Learning And Statistical Methods, Shaoshi Zhang

Electronic Thesis and Dissertation Repository

This research investigates the mortality risk of COVID-19 patients across different variant waves, using the data from Centers for Disease Control and Prevention (CDC) websites. By analyzing the available data, including patient medical records, vaccination rates, and hospital capacities, we aim to discern patterns and factors associated with COVID-19-related deaths.

To explore features linked to COVID-19 mortality, we employ different techniques such as Filter, Wrapper, and Embedded methods for feature selection. Furthermore, we apply various machine learning methods, including support vector machines, decision trees, random forests, logistic regression, K-nearest neighbours, na¨ıve Bayes methods, and artificial neural networks, to uncover underlying …


Ohio Recovery Housing: Resident Risk And Outcomes Assessment, Elyjiah Potter, Bivin Sadler Dec 2023

Ohio Recovery Housing: Resident Risk And Outcomes Assessment, Elyjiah Potter, Bivin Sadler

SMU Data Science Review

Addiction and substance abuse disorder is a significant problem in the United States. Over the past two decades, the United States has faced a boom in substance abuse, which has resulted in an increase in death and disruption of families across the nation. The State of Ohio has been particularly hard hit by the crisis, with overdose rates nearly doubling the national average. Established in the mid 1970’s Sober Living Housing is an alcohol and substance use recovery model emphasizing personal responsibility, sober living, and community support. This model has been adopted by the Ohio Recovery Housing organization, which seeks …


Deep Learning Image Analysis To Isolate And Characterize Different Stages Of S-Phase In Human Cells, Kevin A. Boyd, Rudranil Mitra, John Santerre, Christopher L. Sansam Dec 2023

Deep Learning Image Analysis To Isolate And Characterize Different Stages Of S-Phase In Human Cells, Kevin A. Boyd, Rudranil Mitra, John Santerre, Christopher L. Sansam

SMU Data Science Review

Abstract. This research used deep learning for image analysis by isolating and characterizing distinct DNA replication patterns in human cells. By leveraging high-resolution microscopy images of multiple cells stained with 5-Ethynyl-2′-deoxyuridine (EdU), a replication marker, this analysis utilized Convolutional Neural Networks (CNNs) to perform image segmentation and to provide robust and reliable classification results. First multiple cells in a field of focus were identified using a pretrained CNN called Cellpose. After identifying the location of each cell in the image a python script was created to crop out each cell into individual .tif files. After careful annotation, a CNN was …