Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 10 of 10

Full-Text Articles in Physical Sciences and Mathematics

On The Use Of Minimum Penalties In Statistical Learning, Ben Sherwood, Bradley S. Price Jul 2021

On The Use Of Minimum Penalties In Statistical Learning, Ben Sherwood, Bradley S. Price

Faculty & Staff Scholarship

Modern multivariate machine learning and statistical methodologies estimate parameters of interest while leveraging prior knowledge of the association between outcome variables. The methods that do allow for estimation of relationships do so typically through an error covariance matrix in multivariate regression which does not scale to other types of models. In this article we proposed the MinPEN framework to simultaneously estimate regression coefficients associated with the multivariate regression model and the relationships between outcome variables using mild assumptions. The MinPen framework utilizes a novel penalty based on the minimum function to exploit detected relationships between responses. An iterative algorithm that …


Association Of Incident Cancer To Low-Value Care And Healthcare Cost Burden Among Elderly Medicare Beneficiaries, Chibuzo Iloabuchi Jan 2021

Association Of Incident Cancer To Low-Value Care And Healthcare Cost Burden Among Elderly Medicare Beneficiaries, Chibuzo Iloabuchi

Graduate Theses, Dissertations, and Problem Reports

In the United States (US), 25% of healthcare spending is considered wasteful because it is spent reimbursing low-value care. Low-value care is the utilization of healthcare services, medical tests, and procedures that have unclear or no clinical benefit to patients but still exposes them to risk. World-wide, low-value care imposes a significant economic burden on patients, payers, governments, and society. Cancer care among older adults > 65 years is one of the biggest drivers of healthcare expenditure in the US and accounts for nearly 40% of all spending, and low-value care among cancer patients is prevalent and contributes to the financial …


Topic Modeling And Cultural Nature Of Citations, Marie Coraline Dumaz Jan 2021

Topic Modeling And Cultural Nature Of Citations, Marie Coraline Dumaz

Graduate Theses, Dissertations, and Problem Reports

Ever since the beginning of research journals, the number of academic publications has been increasing steadily. Nowadays, especially, with the new importance of online open-access journals and databases, research papers are more easily available to read and share. It also becomes harder to keep up with novelties and grasp an idea of the general impact of a given researcher, institution, journal, or field. For this reason, different bibliometric indicators are now routinely used to classify and evaluate the impact or significance of individual researchers, conferences, journals, or entire scientific communities. In this thesis, we provide tools to study trends in …


Ensemble Encoder-Decoder Models For Predicting Land Transformation, Pariya Pourmohammadi Jan 2021

Ensemble Encoder-Decoder Models For Predicting Land Transformation, Pariya Pourmohammadi

Graduate Theses, Dissertations, and Problem Reports

In studying dynamic and complex processes which are influenced by a system of inter-connected driving variables, it is crucial to apply models that can learn the complexity of the interactions. Land transformation is one of such complex processes, prediction of which can help to mitigate severe climate situations and improve the resiliency of communities. In this study, a multi-spectral set of data cubes is used to capture various characteristics of a geographic region. Based on the data cube, a feature space is constructed using socio-economic attributes, terrain characteristics, and landscape traits of the study region. Two-dimensional and three-dimensional convolutional neural …


Identification And Classification Of Radio Pulsar Signals Using Machine Learning, Di Pang Jan 2021

Identification And Classification Of Radio Pulsar Signals Using Machine Learning, Di Pang

Graduate Theses, Dissertations, and Problem Reports

Automated single-pulse search approaches are necessary as ever-increasing amount of observed data makes the manual inspection impractical. Detecting radio pulsars using single-pulse searches, however, is a challenging problem for machine learning because pul- sar signals often vary significantly in brightness, width, and shape and are only detected in a small fraction of observed data.

The research work presented in this dissertation is focused on development of ma- chine learning algorithms and approaches for single-pulse searches in the time domain. Specifically, (1) We developed a two-stage single-pulse search approach, named Single- Pulse Event Group IDentification (SPEGID), which automatically identifies and clas- …


Analysis And Classification Of Software Fault-Proneness And Vulnerabilities, Mohammad Jamil Ahmad Jan 2021

Analysis And Classification Of Software Fault-Proneness And Vulnerabilities, Mohammad Jamil Ahmad

Graduate Theses, Dissertations, and Problem Reports

Software bugs are expensive to fix and can lead to catastrophic consequences. Therefore, their analysis and the use of machine learning for prediction are of the utmost importance. Many prediction models have been proposed and different factors affecting the prediction performance have been extensively studied. This work addresses four topics in two areas in software engineering: software fault-proneness prediction and analysis and classification of security-related bug reports. The first topic focuses on the effect of the learning approach (i.e., the way software fault-proneness prediction models are trained and tested) on the performance of software fault-proneness prediction which lacks extensive research …


Review Of Forecasting Univariate Time-Series Data With Application To Water-Energy Nexus Studies & Proposal Of Parallel Hybrid Sarima-Ann Model, Cory Sumner Yarrington Jan 2021

Review Of Forecasting Univariate Time-Series Data With Application To Water-Energy Nexus Studies & Proposal Of Parallel Hybrid Sarima-Ann Model, Cory Sumner Yarrington

Graduate Theses, Dissertations, and Problem Reports

The necessary materials for most human activities are water and energy. Integrated analysis to accurately forecast water and energy consumption enables the implementation of efficient short and long-term resource management planning as well as expanding policy and research possibilities for the supportive infrastructure. However, the integral relationship between water and energy (water-energy nexus) poses a difficult problem for modeling. The accessibility and physical overlay of data sets related to water-energy nexus is another main issue for a reliable water-energy consumption forecast. The framework of urban metabolism (UM) uses several types of data to build a global view and highlight issues …


Plant Species Identification In The Wild Based On Images Of Organs, Meghana Kovur Jan 2021

Plant Species Identification In The Wild Based On Images Of Organs, Meghana Kovur

Graduate Theses, Dissertations, and Problem Reports

Image-based plant species identification in the wild is a difficult problem for several reasons. First, the input data is subject to a very high degree of variability because it is captured under fully unconstrained conditions. The same plant species may look very different in different images, while different species can often appear very similar, challenging even the recognition skills of human experts in the field. The large intra-class and small inter-class image variability makes this a fine-grained visual classification problem. One way to cope with this variability and to reduce image background noise is to predict species based on the …


Estimating The Azimuthal Mode Structure Of Ultra Low Frequency Waves And Its Effects On The Radial Diffusion Of Radiation Belt Electrons, Mohammad Barani Jan 2021

Estimating The Azimuthal Mode Structure Of Ultra Low Frequency Waves And Its Effects On The Radial Diffusion Of Radiation Belt Electrons, Mohammad Barani

Graduate Theses, Dissertations, and Problem Reports

Characterizing the azimuthal mode number 𝑚 of Ultra Low Frequency (ULF) waves is critical to quantifying the radial diffusion of radiation belt electrons. A Wavelet cross-spectral technique is applied to the compressional ULF waves observed by multiple pairs of GOES and MMS satellites to estimate the mode structure of ULF waves. A more realistic distribution of mode numbers is achieved by inclusion of the modes corresponding to different wave propagation directions as well as at 𝑚 higher than fundamental mode number. For the event study of a geomagnetic storm using GOES data, ULF wave power is found to dominate at …


Searching Harder, Localizing Better, Classifying Faster: Optimizing Fast Radio Burst Detection And Analysis, Kshitij Aggarwal Jan 2021

Searching Harder, Localizing Better, Classifying Faster: Optimizing Fast Radio Burst Detection And Analysis, Kshitij Aggarwal

Graduate Theses, Dissertations, and Problem Reports

Fast Radio Bursts (or FRBs) are millisecond-duration transients of extragalactic origin. They exhibit dispersion caused by propagation through an ionized medium, and quantified by Dispersion Measure (DM). Around 800 FRBs (24 repeaters) have been discovered; so far, 24 FRBs have been confidently associated with a host galaxy. In this thesis, we discuss multiple new FRB search and analysis techniques and the corresponding tools that enable us to search for FRBs harder, localize them better, and classify candidates faster.

We discuss five open-source software suites that can be used in FRB analysis. These suites are used to distinguish between FRBs and …