Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics

PDF

University of South Florida

Theses/Dissertations

Classification

Articles 1 - 9 of 9

Full-Text Articles in Entire DC Network

Gradient Boosting For Survival Analysis With Applications In Oncology, Nam Phuong Nguyen Jan 2020

Gradient Boosting For Survival Analysis With Applications In Oncology, Nam Phuong Nguyen

USF Tampa Graduate Theses and Dissertations

Cancer is one of the most deadly diseases that the world has been fighting against over decades. An enormous number of research has been conducted, via a wide scale of approaches, raging from genetic analysis to mathematical modeling. Survival analysis is a well-performed methodology frequently used to estimate the survival probability of a patient. Although there has been a large number of methods for survival analysis, efficient exploration of a high-dimensional feature space has been challenging due to its computational cost and complexity. This thesis adapts the component-wise gradient boosting algorithms for cancer survival analysis, and also proposes a new …


Fractional Random Weighted Bootstrapping For Classification On Imbalanced Data With Ensemble Decision Tree Methods, Sean Charles Carter Nov 2019

Fractional Random Weighted Bootstrapping For Classification On Imbalanced Data With Ensemble Decision Tree Methods, Sean Charles Carter

USF Tampa Graduate Theses and Dissertations

Ensemble methods are commonly used for building predictive models for classification. Models that are unstable to perturbations in the training set, such as the decision tree, often see considerable reductions in error when grouped, using bootstrapped resamples of the training data to train many models. The non-parametric bootstrap, however, has limited efficacy when used on severely imbalanced data, especially when the number of observations of one or more classes is exceptionally small. We explore the fractional random weighted bootstrap, which randomly assigns fractional weights to observations, as an alternative resampling pro cedure in training machine learning ensembles, particularly decision tree …


Multimodal Emotion Recognition Using 3d Facial Landmarks, Action Units, And Physiological Data, Diego Fabiano Oct 2019

Multimodal Emotion Recognition Using 3d Facial Landmarks, Action Units, And Physiological Data, Diego Fabiano

USF Tampa Graduate Theses and Dissertations

To fully understand the complexities of human emotion, the integration of multiple physical features from different modalities can be advantageous. Considering this, this thesis presents an approach to emotion recognition using handcrafted features that consist of 3D facial data, action units, and physiological data. Each modality independently, as well as the combination of each for recognizing human emotion were analyzed.

This analysis includes the use of principal component analysis to determine which dimensions of the feature vector are most important for emotion recognition. The proposed features are shown to be able to be used to accurately recognize emotion and that …


A Machine Learning Approach To Predicting Community Engagement On Social Media During Disasters, Adel Alshehri Jul 2019

A Machine Learning Approach To Predicting Community Engagement On Social Media During Disasters, Adel Alshehri

USF Tampa Graduate Theses and Dissertations

The use of social media is expanding significantly and can serve a variety of purposes. Over the last few years, users of social media have played an increasing role in the dissemination of emergency and disaster information. It is becoming more common for affected populations and other stakeholders to turn to Twitter to gather information about a crisis when decisions need to be made, and action is taken. However, social media platforms, especially on Twitter, presents some drawbacks when it comes to gathering information during disasters. These drawbacks include information overload, messages are written in an informal format, the presence …


Change Descriptors For Determining Nodule Malignancy In Lung Ct Screening Images, Benjamin Geiger Dec 2018

Change Descriptors For Determining Nodule Malignancy In Lung Ct Screening Images, Benjamin Geiger

USF Tampa Graduate Theses and Dissertations

Computed tomography (CT) imagery is an important weapon in the fight against lung cancer; various forms of lung cancer are routinely diagnosed from CT imagery. The growth of the suspect nodule is known to be a prognostic factor in the diagnosis of pulmonary cancer, but the change in other aspects of the nodule, such as its aspect ratio, density, spiculation, or other features usable for machine learning, may also provide prognostic information.

We hypothesized that adding combined feature information from multiple CT image sets separated in time could provide a more accurate determination of nodule malignancy. To this end, we …


On The Feasibility Of Profiling, Forecasting And Authenticating Internet Usage Based On Privacy Preserving Netflow Logs, Soheil Sarmadi Nov 2018

On The Feasibility Of Profiling, Forecasting And Authenticating Internet Usage Based On Privacy Preserving Netflow Logs, Soheil Sarmadi

USF Tampa Graduate Theses and Dissertations

Understanding Internet user behavior and Internet usage patterns is fundamental in developing future access networks and services that meet technical as well as Internet user needs. User behavior is routinely studied and measured, but with different methods depending on the research discipline of the investigator, and these disciplines rarely cross. We tackle this challenge by developing frameworks that the Internet usage statistics used as the main features in understanding Internet user behaviors, with the purpose of finding a complete picture of the user behavior and working towards a unified analysis methodology. In this dissertation we collected Internet usage statistics via …


Application Of Image Recognition Technology To Foraminiferal Assemblage Analyses, Christian Helmut Gfatter Oct 2018

Application Of Image Recognition Technology To Foraminiferal Assemblage Analyses, Christian Helmut Gfatter

USF Tampa Graduate Theses and Dissertations

Analyses of foraminiferal assemblages involve time consuming microscopic assessment of sediment samples. Image recognition software, which systematically matches features within sample images against an image library, is widely used in contexts ranging from law enforcement to medical research. At present, scientific applications such as identification of specimens in plankton samples utilize flow through systems in which samples are suspended in liquid and pass through a beam of light where the images are captured using transmitted light. Identification of foraminifers generally utilizes reflected light, because most shells are relatively opaque.

My goal was to design and test a protocol to directly …


Real-Time Classification Of Biomedical Signals, Parkinson’S Analytical Model, Abolfazl Saghafi Jun 2017

Real-Time Classification Of Biomedical Signals, Parkinson’S Analytical Model, Abolfazl Saghafi

USF Tampa Graduate Theses and Dissertations

The reach of technological innovation continues to grow, changing all industries as it evolves. In healthcare, technology is increasingly playing a role in almost all processes, from patient registration to data monitoring, from lab tests to self-care tools. The increase in the amount and diversity of generated clinical data requires development of new technologies and procedures capable of integrating and analyzing the BIG generated information as well as providing support in their interpretation.

To that extent, this dissertation focuses on the analysis and processing of biomedical signals, specifically brain and heart signals, using advanced machine learning techniques. That is, the …


Statistical Learning And Behrens-Fisher Distribution Methods For Heteroscedastic Data In Microarray Analysis, Nabin K. Manandhr-Shrestha Mar 2010

Statistical Learning And Behrens-Fisher Distribution Methods For Heteroscedastic Data In Microarray Analysis, Nabin K. Manandhr-Shrestha

USF Tampa Graduate Theses and Dissertations

The aim of the present study is to identify the di®erentially expressed genes be- tween two di®erent conditions and apply it in predicting the class of new samples using the microarray data. Microarray data analysis poses many challenges to the statis- ticians because of its high dimensionality and small sample size, dubbed as "small n large p problem". Microarray data has been extensively studied by many statisticians and geneticists. Generally, it is said to follow a normal distribution with equal vari- ances in two conditions, but it is not true in general. Since the number of replications is very small, …