Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 10 of 10

Full-Text Articles in Computer Sciences

Integrating Machine Learning Methods For Medical Diagnosis, Jazmin Quezada Dec 2023

Integrating Machine Learning Methods For Medical Diagnosis, Jazmin Quezada

Open Access Theses & Dissertations

Abstract:The rapid advancement of machine learning techniques has revolutionized the field of medical diagnosis by offering powerful tools to analyze complex data sets and make accurate predictions. In this proposed method, we present a novel approach that integrates machine learning and optimization models to enhance the accuracy of medical diagnoses. Our method focuses on fine-tuning and optimizing the parameters of machine learning algorithms commonly used in medical diagnosis, such as logistic regression, support vector machines, and neural networks. By employing optimization techniques, we systematically explore the parameter space of these algorithms to discover the most optimal configurations. Moreover, by representing …


A Machine Learning Approach To Stochastic Optimal Control, Pablo Ever Avalos May 2022

A Machine Learning Approach To Stochastic Optimal Control, Pablo Ever Avalos

Open Access Theses & Dissertations

Merton's portfolio optimization problem is a well-renowned problem in financial mathematics which seeks to optimize the investment decision for an investor. In the simplest situation, the market consists of a risk-less asset (i.e. a bond) that pays back a relatively low interest rate, and a risky asset (i.e. a stock) that follows a geometric Brownian motion. The optimal allocation strategy of the investor's wealth is found by optimizing the expected utility along the stochastic evolution of the market. This thesis focuses on several different applications of this optimization problem. We look at pre-constructed analytical solutions and showcase the results. We …


Making Valid Inferences With Decision Tree, George Ekow Quaye May 2021

Making Valid Inferences With Decision Tree, George Ekow Quaye

Open Access Theses & Dissertations

HypoThesis testing and Confidence Interval (CI) estimates are key statistics in predicting future values in data analysis. Most often, CI estimates are directly obtained from the summary statistics of a particular statistical methodology output. However, when it comes to the summary of decision tree outputs, these CI estimates are not directly obtained. So a na\"{i}ve way of making node-level inference is to construct a $(1-\alpha) \times 100\%$ confidence interval for a node mean $\bar{y}_t$ using the relation: $\bar{y}_t \, \pm \, z_{1-\alpha/2} \, \frac{s_t}{\sqrt{n_t}}$, where $\bar{y}_t$ is the node mean and $s_t$ is the standard deviation estimates from the decision …


Time-Reflective Text Representations For Semantic Evolution Tracking And Trend Analytics, Roberto Camacho Barranco Jan 2019

Time-Reflective Text Representations For Semantic Evolution Tracking And Trend Analytics, Roberto Camacho Barranco

Open Access Theses & Dissertations

The extraction of significant, relevant, and useful trends from massive document collections, such as a streaming newswire or scientific publications, is a challenging and significant problem in many different fields, including intelligence analysis, recommendation systems, and scientific research. However, techniques that tackle trend analytics of such large text corpora are limited because research that addresses the temporal nature of these publications is still in its early stages. In this work, we first show that it is possible to capture the evolution of a story (or trend) by connecting the dots between different documents in a text corpus. The observed results …


Estimating The Optimal Cutoff Point For Logistic Regression, Zheng Zhang Jan 2018

Estimating The Optimal Cutoff Point For Logistic Regression, Zheng Zhang

Open Access Theses & Dissertations

Binary classification is one of the main themes of supervised learning. This research is concerned about determining the optimal cutoff point for the continuous-scaled outcomes (e.g., predicted probabilities) resulting from a classifier such as logistic regression. We make note of the fact that the cutoff point obtained from various methods is a statistic, which can be unstable with substantial variation. Nevertheless, due partly to complexity involved in estimating the cutpoint, there has been no formal study on the variance or standard error of the estimated cutoff point.

In this Thesis, a bootstrap aggregation method is put forward to estimate the …


Extraction Of Fiber Morphology From Sem Images For Quality Control Of Fiber Reinforced Composites Manufacturing, Md Fashiar Rahman Jan 2018

Extraction Of Fiber Morphology From Sem Images For Quality Control Of Fiber Reinforced Composites Manufacturing, Md Fashiar Rahman

Open Access Theses & Dissertations

The morphology of fibers (e.g. spatial uniformity, orientation, and length) plays a decisive role in determining the material properties or fabrication quality of fiber-reinforced nanocomposites. Hence, determining the morphology becomes a very critical issue in the field of nanocomposite quality control. The conventional way of quality inspection is to take the scanning electron microscopic (SEM) images of the cross-section of composite material and do the visual checking of these SEM images to evaluate the nanofiber alignment and length distribution. But this type of inspection is often subjective, inaccurate and time consuming. Moreover, the extremely small size of nanofibers makes the …


Propagation Of Probabilistic Uncertainty: The Simplest Case (A Brief Pedagogical Introduction), Olga Kosheleva, Vladik Kreinovich Nov 2017

Propagation Of Probabilistic Uncertainty: The Simplest Case (A Brief Pedagogical Introduction), Olga Kosheleva, Vladik Kreinovich

Departmental Technical Reports (CS)

The main objective of this text is to provide a brief introduction to formulas describing the simplest case of propagation of probabilistic uncertainty -- for students who have not yet taken a probability course.


Towards Analytical Techniques For Optimizing Knowledge Acquisition, Processing, Propagation, And Use In Cyberinfrastructure, Leonardo Octavio Lerma Jan 2015

Towards Analytical Techniques For Optimizing Knowledge Acquisition, Processing, Propagation, And Use In Cyberinfrastructure, Leonardo Octavio Lerma

Open Access Theses & Dissertations

For many decades, there has been a continuous progress in science and engineering applications.

A large part of this progress comes from the new knowledge that researchers acquire, propagate, and use. This new knowledge has revolutionized many aspects of our life, from driving to communications to shopping.

Somewhat surprisingly, there is one area of human activity which is the least impacted by the modern technological progress: the very processes of acquiring, processing, and propagating information. When we decide where to place sensors, which algorithm to use for processing the data – we rely mostly on our own intuition and on …


Secondary Structure Prediction Of Long Rna Sequences Based On Inversion Excursions And A Modularized Mapreduce Framework, Daniel Tesfai Yehdego Jan 2012

Secondary Structure Prediction Of Long Rna Sequences Based On Inversion Excursions And A Modularized Mapreduce Framework, Daniel Tesfai Yehdego

Open Access Theses & Dissertations

Ribonucleic acid (RNA) molecules and their secondary structures play important roles in many biological processes including gene expression and regulation. The genomes of many viruses are also RNA molecules. Since secondary structures are crucial for RNA functionality, computational predictions of the RNA secondary structures have been widely studied. However, the tremendous demands on computer memory and computing time for complex secondary structures limit the capability of existing thermodynamically based algorithms for structure predictions to handling only short RNA sequences with a few hundred bases. One approach to overcome this limitation is by first cutting long RNA sequences into shorter, non-overlapping …


Estimating Statistical Characteristics Under Interval Uncertainty And Constraints: Mean, Variance, Covariance, And Correlation, Ali Jalal-Kamali Jan 2011

Estimating Statistical Characteristics Under Interval Uncertainty And Constraints: Mean, Variance, Covariance, And Correlation, Ali Jalal-Kamali

Open Access Theses & Dissertations

In many practical situations, we have a sample of objects of a given type. When we measure the values of a certain quantity x for these objects, we get a sequence of values x1, . . . , xn. When the sample is large enough, then the arithmetic mean E of the values xi is a good approximation for the average value of this quantity for all the objects from this class. Other expressions provide a good approximation to statistical characteristics such as variance, covariance, and correlation.

The values xi come from measurements, and measurement is never absolutely accurate.

Often, …