Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 22 of 22

Full-Text Articles in Physical Sciences and Mathematics

The Impact Of Data Preparation And Model Complexity On The Natural Language Classification Of Chinese News Headlines, Torrey J. Wagner, Dennis Guhl, Brent T. Langhals Mar 2024

The Impact Of Data Preparation And Model Complexity On The Natural Language Classification Of Chinese News Headlines, Torrey J. Wagner, Dennis Guhl, Brent T. Langhals

Faculty Publications

Given the emergence of China as a political and economic power in the 21st century, there is increased interest in analyzing Chinese news articles to better understand developing trends in China. Because of the volume of the material, automating the categorization of Chinese-language news articles by headline text or titles can be an effective way to sort the articles into categories for efficient review. A 383,000-headline dataset labeled with 15 categories from the Toutiao website was evaluated via natural language processing to predict topic categories. The influence of six data preparation variations on the predictive accuracy of four algorithms was …


Lightning Forecast From Chaotic And Incomplete Time Series Using Wavelet De-Noising And Spatiotemporal Kriging, Jared K. Nystrom, Raymond Hill, Andrew J. Geyer, Joseph J. Pignatiello Jr., Eric Chicken Oct 2023

Lightning Forecast From Chaotic And Incomplete Time Series Using Wavelet De-Noising And Spatiotemporal Kriging, Jared K. Nystrom, Raymond Hill, Andrew J. Geyer, Joseph J. Pignatiello Jr., Eric Chicken

Faculty Publications

Purpose: Present a method to impute missing data from a chaotic time series, in this case lightning prediction data, and then use that completed dataset to create lightning prediction forecasts.

Design/Methodology/Approach: Using the technique of spatiotemporal kriging to estimate data that is autocorrelated but in space and time. Using the estimated data in an imputation methodology completes a dataset used in lighting prediction.

Findings: The techniques provided prove robust to the chaotic nature of the data, and the resulting time series displays evidence of smoothing while also preserving the signal of interest for lightning prediction.

Abstract © Emerald Publishing …


A Comparison Of Quaternion Neural Network Backpropagation Algorithms, Jeremiah Bill, Bruce A. Cox, Lance Champaign Jun 2023

A Comparison Of Quaternion Neural Network Backpropagation Algorithms, Jeremiah Bill, Bruce A. Cox, Lance Champaign

Faculty Publications

This research paper focuses on quaternion neural networks (QNNs) - a type of neural network wherein the weights, biases, and input values are all represented as quaternion numbers. Previous studies have shown that QNNs outperform real-valued neural networks in basic tasks and have potential in high-dimensional problem spaces. However, research on QNNs has been fragmented, with contributions from different mathematical and engineering domains leading to unintentional overlap in QNN literature. This work aims to unify existing research by evaluating four distinct QNN backpropagation algorithms, including the novel GHR-calculus backpropagation algorithm, and providing concise, scalable implementations of each algorithm using a …


Multicollinearity Applied Stepwise Stochastic Imputation: A Large Dataset Imputation Through Correlation‑Based Regression, Benjamin D. Leiby, Darryl K. Ahner Feb 2023

Multicollinearity Applied Stepwise Stochastic Imputation: A Large Dataset Imputation Through Correlation‑Based Regression, Benjamin D. Leiby, Darryl K. Ahner

Faculty Publications

This paper presents a stochastic imputation approach for large datasets using a correlation selection methodology when preferred commercial packages struggle to iterate due to numerical problems. A variable range-based guard rail modification is proposed that benefits the convergence rate of data elements while simultaneously providing increased confidence in the plausibility of the imputations. A large country conflict dataset motivates the search to impute missing values well over a common threshold of 20% missingness. The Multicollinearity Applied Stepwise Stochastic imputation methodology (MASS-impute) capitalizes on correlation between variables within the dataset and uses model residuals to estimate unknown values. Examination of the …


Using Conformal Win Probability To Predict The Winners Of The Canceled 2020 Ncaa Basketball Tournaments, Chancellor Johnstone, Dan Nettleton Jan 2023

Using Conformal Win Probability To Predict The Winners Of The Canceled 2020 Ncaa Basketball Tournaments, Chancellor Johnstone, Dan Nettleton

Faculty Publications

The COVID-19 pandemic was responsible for the cancellation of both the men’s and women’s 2020 National Collegiate Athletic Association (NCAA) Division I basketball tournaments. Starting from the point when the Division I tournaments and unfinished conference tournaments were canceled, we deliver closed-form probabilities for each team of making the Division I tournaments, had they not been canceled, under a simplified method for tournament selection. We also determine probabilities of a team winning March Madness, given a tournament bracket. Our calculations make use of conformal win probabilities derived from conformal predictive distributions. We compare these conformal win probabilities to those generated …


Improving Data-Driven Infrastructure Degradation Forecast Skill With Stepwise Asset Condition Prediction Models, Kurt R. Lamm, Justin D. Delorit, Michael N. Grussing, Steven J. Schuldt Aug 2022

Improving Data-Driven Infrastructure Degradation Forecast Skill With Stepwise Asset Condition Prediction Models, Kurt R. Lamm, Justin D. Delorit, Michael N. Grussing, Steven J. Schuldt

Faculty Publications

Organizations with large facility and infrastructure portfolios have used asset management databases for over ten years to collect and standardize asset condition data. Decision makers use these data to predict asset degradation and expected service life, enabling prioritized maintenance, repair, and renovation actions that reduce asset life-cycle costs and achieve organizational objectives. However, these asset condition forecasts are calculated using standardized, self-correcting distribution models that rely on poorly-fit, continuous functions. This research presents four stepwise asset condition forecast models that utilize historical asset inspection data to improve prediction accuracy: (1) Slope, (2) Weighted Slope, (3) Condition-Intelligent Weighted Slope, and (4) …


Pilot Development: An Empirical Mixed-Method Analysis, Jonathan Slottje, Jason Anderson, John M. Dickens, Adam D. Reiman Jun 2022

Pilot Development: An Empirical Mixed-Method Analysis, Jonathan Slottje, Jason Anderson, John M. Dickens, Adam D. Reiman

Faculty Publications

Purpose — Pilot upgrade training is critical to aircraft and passenger safety. This study aims to identify variances in the US Air Force C-130J pilot upgrade training based on geographic location and provide a model to enhance policy that will impact future pilot training efforts that lower cost and increase operator quality and proficiency.
Design/methodology/approach This research employed a mixed-method approach. First, the authors collected data and analyzed 90 C-130J pilots' aviation records and then contextualized this analysis with interviews of experts. Finally, the authors present a modified version of Six Sigma's define–measure–analyze–improve–control (DMAIC) that identifies and reduces the …


Forecasting Country Conflict Using Statistical Learning Methods, Sarah Neumann, Darryl K. Ahner, Raymond R. Hill Jun 2022

Forecasting Country Conflict Using Statistical Learning Methods, Sarah Neumann, Darryl K. Ahner, Raymond R. Hill

Faculty Publications

Purpose — This paper aims to examine whether changing the clustering of countries within a United States Combatant Command (COCOM) area of responsibility promotes improved forecasting of conflict. Design/methodology/approach — In this paper statistical learning methods are used to create new country clusters that are then used in a comparative analysis of model-based conflict prediction. Findings — In this study a reorganization of the countries assigned to specific areas of responsibility are shown to provide improvements in the ability of models to predict conflict. Research limitations/implications — The study is based on actual historical data and is purely data driven. …


Transportation Service Level Impact On Aircraft Availability, Vincent Mclean, Adam D. Reiman Jun 2022

Transportation Service Level Impact On Aircraft Availability, Vincent Mclean, Adam D. Reiman

Faculty Publications

Purpose — Aircraft fail to meet mission capable rate goals due to a lack of supply of aircraft parts in inventory where the aircraft breaks. This triggers an order at the repair location. To maximize mission capable rate, the time from order to delivery needs to be minimized. The purpose of this research is to examine the case of three airfields for the order to delivery time of mission critical aircraft parts for a specific aircraft type. Design/methodology/approach — This research captured data from three information systems to assess the order fulfillment process. The data were analyzed to determine the …


Investigation And Statistical Modeling Of The Mechanical Properties Of Additively Manufactured Lattices, Derek G. Spear, Anthony N. Palazotto Jul 2021

Investigation And Statistical Modeling Of The Mechanical Properties Of Additively Manufactured Lattices, Derek G. Spear, Anthony N. Palazotto

Faculty Publications

This paper describes the background, test methodology, and experimental results associated with the testing and analysis of quasi-static compression testing of additively manufactured open-cell lattice structures. The study aims to examine the effect of lattice topology, cell size, cell density, and surface thickness on the mechanical properties of lattice structures. Three lattice designs were chosen, the Diamond, I-WP, and Primitive Triply Periodic Minimal Surfaces (TPMSs). Uniaxial compression tests were conducted for every combination of the three lattice designs, three cell sizes, three cell densities, and three surface thicknesses. In order to perform an efficient experiment and gain the most information …


Cost Estimating Using A New Learning Curve Theory For Non-Constant Production Rates, Dakotah Hogan, John J. Elshaw, Clay M. Koschnick, Jonathan D. Ritschel, Adedeji B. Badiru, Shawn M. Valentine Oct 2020

Cost Estimating Using A New Learning Curve Theory For Non-Constant Production Rates, Dakotah Hogan, John J. Elshaw, Clay M. Koschnick, Jonathan D. Ritschel, Adedeji B. Badiru, Shawn M. Valentine

Faculty Publications

Traditional learning curve theory assumes a constant learning rate regardless of the number of units produced. However, a collection of theoretical and empirical evidence indicates that learning rates decrease as more units are produced in some cases. These diminishing learning rates cause traditional learning curves to underestimate required resources, potentially resulting in cost overruns. A diminishing learning rate model, namely Boone’s learning curve, was recently developed to model this phenomenon. This research confirms that Boone’s learning curve systematically reduced error in modeling observed learning curves using production data from 169 Department of Defense end-items. However, high amounts of variability in …


Applications Of Portable Libs For Actinide Analysis, Ashwin P. Rao, John D. Auxier Ii, Dung Vu, Michael B. Shattan Jul 2020

Applications Of Portable Libs For Actinide Analysis, Ashwin P. Rao, John D. Auxier Ii, Dung Vu, Michael B. Shattan

Faculty Publications

A portable LIBS device was used for rapid elemental impurity analysis of plutonium alloys. This device demonstrates the potential for fast, accurate in-situ chemical analysis and could significantly reduce the fabrication time of plutonium alloys.


Generating Electromagnetic Schell-Model Sources Using Complex Screens With Spatially Varying Auto- And Cross-Correlation Functions, Milo W. Hyde Iv Sep 2019

Generating Electromagnetic Schell-Model Sources Using Complex Screens With Spatially Varying Auto- And Cross-Correlation Functions, Milo W. Hyde Iv

Faculty Publications

We present a method to generate any physically realizable electromagnetic Schell-model source. Our technique can be directly implemented on existing vector-beam generators that utilize spatial light modulators for coherence control, beam shaping, and relative phasing. This work significantly extends published research on the subject, where control over the partially coherent source’s cross-spectral density matrix was limited. We begin by presenting the statistical optics theory necessary to derive and implement our method. We then apply our technique, both analytically and in simulation, to produce two electromagnetic Schell-model sources from the literature. We demonstrate control over the full cross-spectral density matrices of …


Ergodicity For The 3d Stochastic Navier-Stokes Equations Perturbed By Lévy Noise, Manil T. Mohan, K. Sakthivel, Sivaguru S. Sritharan May 2019

Ergodicity For The 3d Stochastic Navier-Stokes Equations Perturbed By Lévy Noise, Manil T. Mohan, K. Sakthivel, Sivaguru S. Sritharan

Faculty Publications

In this work we construct a Markov family of martingale solutions for 3D stochastic Navier–Stokes equations (SNSE) perturbed by Lévy noise with periodic boundary conditions. Using the Kolmogorov equations of integrodifferential type associated with the SNSE perturbed by Lévy noise, we construct a transition semigroup and establish the existence of a unique invariant measure. We also show that it is ergodic and strongly mixing.
Abstract © Wiley.


Toxoplasma Gondii Igg Associations With Sleepwake Problems, Sleep Duration And Timing, Celine C. Corona, Ma Zhang, Abhishek Wadhawan, Melanie L. Daue, Maureen W. Groer, Aline Dagang, Christopher A. Lowry, Kathleen A. Ryan, Andrew J. Hoisington, John W. Stiller, Dietmar Fuchs, Braxton D. Mitchell, Teodor T. Postolache Feb 2019

Toxoplasma Gondii Igg Associations With Sleepwake Problems, Sleep Duration And Timing, Celine C. Corona, Ma Zhang, Abhishek Wadhawan, Melanie L. Daue, Maureen W. Groer, Aline Dagang, Christopher A. Lowry, Kathleen A. Ryan, Andrew J. Hoisington, John W. Stiller, Dietmar Fuchs, Braxton D. Mitchell, Teodor T. Postolache

Faculty Publications

Background: Evidence links Toxoplasma gondii (T. gondii), a neurotropic parasite, with schizophrenia, mood disorders and suicidal behavior, all of which are associated and exacerbated by disrupted sleep. Moreover, low-grade immune activation and dopaminergic overstimulation, which are consequences of T. gondii infection, could alter sleep patterns and duration. Methods: Sleep data on 833 Amish participants [mean age (SD) = 44.28 (16.99) years; 59.06% women] were obtained via self-reported questionnaires that assessed sleep problems, duration and timing. T. gondii IgG was measured with ELISA. Data were analyzed using multivariable logistic regressions and linear mixed models, with adjustment for age, sex and family …


Improved N-Dimensional Data Visualization From Hyper-Radial Values, Todd J. Paciencia, Trevor J. Bihl, Kenneth W. Bauer Jan 2019

Improved N-Dimensional Data Visualization From Hyper-Radial Values, Todd J. Paciencia, Trevor J. Bihl, Kenneth W. Bauer

Faculty Publications

Higher-dimensional data, which is becoming common in many disciplines due to big data problems, are inherently difficult to visualize in a meaningful way. While many visualization methods exist, they are often difficult to interpret, involve multiple plots and overlaid points, or require simultaneous interpretations. This research adapts and extends hyper-radial visualization, a technique used to visualize Pareto fronts in multi-objective optimizations, to become an n-dimensional visualization tool. Hyper-radial visualization is seen to offer many advantages by presenting a low-dimensionality representation of data through easily understood calculations. First, hyper-radial visualization is extended for use with general multivariate data. Second, a method …


Monte Carlo Simulations Of Three-Dimensional Electromagnetic Gaussian Schell-Model Sources, Milo W. Hyde Iv, Santasri Bose-Pillai, Olga Korotkova Feb 2018

Monte Carlo Simulations Of Three-Dimensional Electromagnetic Gaussian Schell-Model Sources, Milo W. Hyde Iv, Santasri Bose-Pillai, Olga Korotkova

Faculty Publications

This article presents a method to simulate a three-dimensional (3D) electromagnetic Gaussian-Schell model (EGSM) source with desired characteristics. Using the complex screen method, originally developed for the synthesis of two-dimensional stochastic electromagnetic fields, a set of equations is derived which relate the desired 3D source characteristics to those of the statistics of the random complex screen. From these equations and the 3D EGSM source realizability conditions, a single criterion is derived, which when satisfied guarantees both the realizability and simulatability of the desired 3D EGSM source. Lastly, a 3D EGSM source, with specified properties, is simulated; the Monte Carlo simulation …


Unmasking Cost Growth Behavior: A Longitudinal Study, Cory N. D'Amico, Edward D. White, Jonathan D. Ritschel, Scott R. Kozlak Jan 2018

Unmasking Cost Growth Behavior: A Longitudinal Study, Cory N. D'Amico, Edward D. White, Jonathan D. Ritschel, Scott R. Kozlak

Faculty Publications

This article examines how cost growth factors (CGF) change over a program’s acquisition life cycle for 36 Department of Defense aircraft programs. Starting from Milestone B, the authors examine CGFs at five gateways: Critical Design Review, First Flight (FF), the end of Developmental Test and Evaluation (DT&E), Initial Operational Capability, and Full Operational Capability. Each CGF is assigned a color rating based upon the program’s cost growth: Green (low), Amber (moderate), or Red (high). Significant findings include dependencies among similar CGF color ratings and cost growth occurring primarily between FF and the end of DT&E during a program’s life cycle.


Anomalydetection: Implementation Of Augmented Network Log Anomaly Detection Procedures, Robert J. Gutierrez, Bradley C. Boehmke, Kenneth W. Bauer, Cade M. Saie, Trevor J. Bihl Aug 2017

Anomalydetection: Implementation Of Augmented Network Log Anomaly Detection Procedures, Robert J. Gutierrez, Bradley C. Boehmke, Kenneth W. Bauer, Cade M. Saie, Trevor J. Bihl

Faculty Publications

As the number of cyber-attacks continues to grow on a daily basis, so does the delay in threat detection. For instance, in 2015, the Office of Personnel Management discovered that approximately 21.5 million individual records of Federal employees and contractors had been stolen. On average, the time between an attack and its discovery is more than 200 days. In the case of the OPM breach, the attack had been going on for almost a year. Currently, cyber analysts inspect numerous potential incidents on a daily basis, but have neither the time nor the resources available to perform such a task. …


A Recommendation System For Meta-Modeling: A Meta-Learning Based Approach, Can Cui, Mengqi Hu, Jeffery D. Weir, Teresa Wu Jan 2016

A Recommendation System For Meta-Modeling: A Meta-Learning Based Approach, Can Cui, Mengqi Hu, Jeffery D. Weir, Teresa Wu

Faculty Publications

Various meta-modeling techniques have been developed to replace computationally expensive simulation models. The performance of these meta-modeling techniques on different models is varied which makes existing model selection/recommendation approaches (e.g., trial-and-error, ensemble) problematic. To address these research gaps, we propose a general meta-modeling recommendation system using meta-learning which can automate the meta-modeling recommendation process by intelligently adapting the learning bias to problem characterizations. The proposed intelligent recommendation system includes four modules: (1) problem module, (2) meta-feature module which includes a comprehensive set of meta-features to characterize the geometrical properties of problems, (3) meta-learner module which compares the performance of instance-based …


Unequal A Priori Probability Multiple Hypothesis Testing In Space Domain Awareness With The Space Surveillance Telescope, Tyler J. Hardy, Stephen C. Cain, Travis F. Blake Jan 2016

Unequal A Priori Probability Multiple Hypothesis Testing In Space Domain Awareness With The Space Surveillance Telescope, Tyler J. Hardy, Stephen C. Cain, Travis F. Blake

Faculty Publications

This paper investigates the ability to improve Space Domain Awareness (SDA) by increasing the number of detectable Resident Space Objects (RSOs) from space surveillance sensors. With matched filter based techniques, the expected impulse response, or Point Spread Function (PSF), is compared against the received data. In the situation where the images are spatially undersampled, the modeled PSF may not match the received data if the RSO does not fall in the center of the pixel. This aliasing can be accounted for with a Multiple Hypothesis Test (MHT). Previously, proposed MHTs have implemented a test with an equal a priori prior …


Taming The Hurricane Of Acquisition Cost Growth – Or At Least Predicting It, Allen J. Deneve, Erin T. Ryan, Jonathan D. Ritschel, Christine M. Schubert Kabban Jan 2015

Taming The Hurricane Of Acquisition Cost Growth – Or At Least Predicting It, Allen J. Deneve, Erin T. Ryan, Jonathan D. Ritschel, Christine M. Schubert Kabban

Faculty Publications

Cost growth is a persistent adversary to efficient budgeting in the Department of Defense. Despite myriad studies to uncover causes of this cost growth, few of the proposed remedies have made a meaningful impact. A key reason may be that DoD cost estimates are formulated using the highly unrealistic assumption that a program’s current baseline characteristics will not change in the future. Using a weather forecasting analogy, the authors demonstrate how a statistical approach may be used to account for these inevitable baseline changes and identify related cost growth trends. These trends are then used to reduce the error in …