Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Institution
-
- City University of New York (CUNY) (2)
- South Dakota State University (2)
- Southern Methodist University (2)
- The University of Akron (2)
- University of Central Florida (2)
-
- University of New Hampshire (2)
- California State University, San Bernardino (1)
- Chapman University (1)
- Chinese Academy of Sciences (1)
- Clemson University (1)
- Dartmouth College (1)
- Georgia Southern University (1)
- Ohio Northern University (1)
- Old Dominion University (1)
- Purdue University (1)
- The University of Southern Mississippi (1)
- University of Alkafeel (1)
- University of Arkansas, Fayetteville (1)
- University of Denver (1)
- Utah State University (1)
- Wilfrid Laurier University (1)
- Keyword
-
- Decision Trees (3)
- Machine Learning (2)
- Statistics (2)
- Texas (2)
- ADNI database (1)
-
- Alzheimer’s disease (1)
- Analytics (1)
- Artificial Intelligence (1)
- Asymptotic Theorems (1)
- Bayesian Inferences (1)
- Bayesian inference (1)
- Bellman equation (1)
- Bender Element (1)
- Best-worst scaling (1)
- Bootstrapping (1)
- Brain data (1)
- Brain graph (1)
- CART (1)
- CHAID (1)
- CO-CUB model (1)
- Categorical Data Analysis (1)
- Central Limit Theorem (1)
- Consumer behavior (1)
- Correlated Random Variables (1)
- Cost-risk analysis (1)
- Coupled neuronalglial models (1)
- Covid-19 (1)
- Cross Validation (1)
- DCM (1)
- DOCCS (1)
- Publication
-
- Data Science and Data Mining (2)
- Dissertations, Theses, and Capstone Projects (2)
- Electronic Theses and Dissertations (2)
- Honors Theses and Capstones (2)
- SDSU Data Science Symposium (2)
-
- Williams Honors College, Honors Research Projects (2)
- Al-Bahir Journal for Engineering and Pure Sciences (1)
- All Dissertations (1)
- All Graduate Reports and Creative Projects, Fall 2023 to Present (1)
- Bulletin of Chinese Academy of Sciences (Chinese Version) (1)
- Computational and Data Sciences (PhD) Dissertations (1)
- Dartmouth College Master’s Theses (1)
- Electronic Theses, Projects, and Dissertations (1)
- Industrial Engineering Undergraduate Honors Theses (1)
- Master's Theses (1)
- Mathematics & Statistics Faculty Publications (1)
- ONU Student Research Colloquium (1)
- SMU Data Science Review (1)
- SMU Journal of Undergraduate Research (1)
- The Journal of Purdue Undergraduate Research (1)
- Theses and Dissertations (Comprehensive) (1)
- Publication Type
- File Type
Articles 1 - 27 of 27
Full-Text Articles in Physical Sciences and Mathematics
Reevaluating Texas Energy Market Forecasts In The Wake Of Recent Extreme Weather Events, Robert A. Derner, Richard W. Butler Ii, Alexandria Neff, Adam R. Ruthford
Reevaluating Texas Energy Market Forecasts In The Wake Of Recent Extreme Weather Events, Robert A. Derner, Richard W. Butler Ii, Alexandria Neff, Adam R. Ruthford
SMU Data Science Review
This paper provides updated forecasts of energy demand in Texas and recognizes the impact of sustainable energy. It is important that the forecasts of the adoption of sustainable energy are reexamined after Winter Storm Uri crippled the Texas power grid and left many without power. This storm highlighted the issues the Texas power grid had and has continued to struggle with in supplying the state with energy. This paper will offer an overview of the relevant literature on the adoption of sustainable energy and relevant events that have occurred in the state of Texas that will give the reader the …
Context Aware Music Recommendation And Playlist Generation, Elias Mann
Context Aware Music Recommendation And Playlist Generation, Elias Mann
SMU Journal of Undergraduate Research
There are many reasons people listen to music, and the type of music is largely determined by what the listener may be doing while they listen. For example, one may listen to one type of music while commuting, another while exercising, and yet another while relaxing. Without access to the physiological state of the user, current music recommendation methods rely on collaborative filtering - recommending music based on what other similar users listen to - and content based filtering - recommending songs based on their similarities to songs the user already prefers. With the rise in popularity of smart devices …
A Novel Correction For The Multivariate Ljung-Box Test, Minhao Huang
A Novel Correction For The Multivariate Ljung-Box Test, Minhao Huang
Computational and Data Sciences (PhD) Dissertations
This research introduces an analytical improvement to the Multivariate Ljung-Box test that addresses significant deviations of the original test from the nominal Type I error rates under almost all scenarios. Prior attempts to mitigate this issue have been directed at modification of the test statistics or correction of the test distribution to achieve precise results in finite samples. In previous studies, focused on designing corrections to the univariate Ljung-Box, a method that specifically adjusts the test rejection region has been the most successful of attaining the best Type I error rates. We adopt the same approach for the more complex, …
Code For Care: Hypertension Prediction In Women Aged 18-39 Years, Kruti Sheth
Code For Care: Hypertension Prediction In Women Aged 18-39 Years, Kruti Sheth
Electronic Theses, Projects, and Dissertations
The longstanding prevalence of hypertension, often undiagnosed, poses significant risks of severe chronic and cardiovascular complications if left untreated. This study investigated the causes and underlying risks of hypertension in females aged between 18-39 years. The research questions were: (Q1.) What factors affect the occurrence of hypertension in females aged 18-39 years? (Q2.) What machine learning algorithms are suited for effectively predicting hypertension? (Q3.) How can SHAP values be leveraged to analyze the factors from model outputs? The findings are: (Q1.) Performing Feature selection using binary classification Logistic regression algorithm reveals an array of 30 most influential factors at an …
Using The History Of Statistics To Teach Introductory Statistics, Melissa Hansen
Using The History Of Statistics To Teach Introductory Statistics, Melissa Hansen
All Graduate Reports and Creative Projects, Fall 2023 to Present
While often taught in high school and required as part of a college degree, statistics classes are sometimes viewed by students as an obstacle rather than a support for their overall goals. One way to increase student engagement in a statistics course is to use the history of statistics. Within the literature review, the advantages to using the history of statistics are discussed as well as the more extensive research on using the history of mathematics in mathematics courses. Included are instructional strategies for using the context around the development of mathematical ideas in math classrooms which can be extended …
Comparing North American Professional Sports League Season Formats Using Monte Carlo Simulation, Lathan Gregg
Comparing North American Professional Sports League Season Formats Using Monte Carlo Simulation, Lathan Gregg
Industrial Engineering Undergraduate Honors Theses
Each NFL, NBA, and MLB season consists of a regular season, in which teams play a set number of scheduled games and a playoff, in which qualifying teams compete for a championship. At the conclusion of each season, teams are ranked based on their performance throughout the season. This study aims to investigate the ability of each league's season format to accurately rank teams using Monte Carlo simulation. Matches between two teams are simulated by using the team’s assigned strength ranks to calculate a winning probability for each team. The winning probabilities are simulated with different skill values, dictating how …
Cost-Risk Analysis Of The Ercot Region Using Modern Portfolio Theory, Megan Sickinger
Cost-Risk Analysis Of The Ercot Region Using Modern Portfolio Theory, Megan Sickinger
Master's Theses
In this work, we study the use of modern portfolio theory in a cost-risk analysis of the Electric Reliability Council of Texas (ERCOT). Based upon the risk-return concepts of modern portfolio theory, we develop an n-asset minimization problem to create a risk-cost frontier of portfolios of technologies within the ERCOT electricity region. The levelized cost of electricity for each technology in the region is a step in evaluating the expected cost of the portfolio, and the historical data of cost factors estimate the variance of cost for each technology. In addition, there are several constraints in our minimization problem to …
Efficient Fully Bayesian Approaches To Brain Activity Mapping With Complex-Valued Fmri Data: Analysis Of Real And Imaginary Components In A Cartesian Model And Extension To Magnitude And Phase In A Polar Model, Zhengxin Wang
All Dissertations
Functional magnetic resonance imaging (fMRI) plays a crucial role in neuroimaging, enabling the exploration of brain activity through complex-valued signals. Traditional fMRI analyses have largely focused on magnitude information, often overlooking the potential insights offered by phase data, and therefore, lead to underutilization of available data and flawed statistical assumptions. This dissertation proposes two efficient, fully Bayesian approaches for the analysis of complex-valued functional magnetic resonance imaging (cv-fMRI) time series.
Chapter 2 introduces the model, referred to as CV-sSGLMM, using the real and imaginary components of cv-fMRI data and sparse spatial generalized linear mixed model prior. This model extends the …
Evaluating The Effect Of Skipping Ticagrelor Doses And Need For Bolus Doses Upon Treatment Resumption Through Population Pk/Pd Simulation, Hiroyoshi Matsui, Le Thien Truc Pham, Eyob D. Adane
Evaluating The Effect Of Skipping Ticagrelor Doses And Need For Bolus Doses Upon Treatment Resumption Through Population Pk/Pd Simulation, Hiroyoshi Matsui, Le Thien Truc Pham, Eyob D. Adane
ONU Student Research Colloquium
Ticagrelor (Brilinta (R)) is the first reversibly binding oral P2Y12 receptor antagonist. It is used, mostly in combination with aspirin, in patients with acute coronary syndromes to reduce thrombosis. The manufacturer of ticagrelor recommends discontinuing it at least 5 days before any surgery when possible. While the effect of dose interruptions on the risk of thrombosis is not directly studied, it is important to understand the impact of skipping doses on ticagrelor's PK/PD profile for clinical-decision making. The objectives of the current study were to simulate the impact of therapy interruption on the PK/PD of ticagrelor and examine the need …
Research On Chinese Data Sovereignty Policy Based On Lda Model And Policy Instruments, Han Qiao, Junru Xu
Research On Chinese Data Sovereignty Policy Based On Lda Model And Policy Instruments, Han Qiao, Junru Xu
Bulletin of Chinese Academy of Sciences (Chinese Version)
Data sovereignty has become an important component of national sovereignty in the dual context of the digital economy development and the overall national security concept. Major countries and regions are actively carrying out data sovereignty strategic deployment and engaging in fierce competition in data resources, data technology, and data rules. This work adopts the policy text analysis method to study China’s data sovereignty policy, and employs the LDA model and policy instruments to quantitatively analyze the process evolution and thematic characteristics of China’s data sovereignty policy. Drawing on these findings, this study comprehensively considers the global data sovereignty policy and …
Assessment Of Method Effects Of Keying And Wording In Instruments: A Mixed-Methods Explanatory Sequential Study, Lin Ma
Electronic Theses and Dissertations
This dissertation presents an innovative approach to examining the keying method, wording method, and construct validity on psychometric instruments. By employing a mixed methods explanatory sequential design, the effects of keying and wording in two psychometric assessments were examined and validated. Those two self-report psychometric assessments were the Effortful Control assessment (Ellis & Rothbart, 2001) and the Grit assessment (Duckworth & Quinn, 2009). Moreover, the quantitative phase utilized structural equation modeling to analyze 2,104 students’ responses and assess the construct of keying and wording. Various hypothetical models were investigated and evaluated. The reliability of each construct in each method was …
Predicting Crop Yield Using Remote Sensing Data, Mary Row, Jung-Han Kimn, Hossein Moradi
Predicting Crop Yield Using Remote Sensing Data, Mary Row, Jung-Han Kimn, Hossein Moradi
SDSU Data Science Symposium
Accurate crop yield predictions can help farmers make adjustments or changes in their farming practices to optimize their harvest. Remote sensing data is an inexpensive approach to collecting massive amounts of data that could be utilized for predicting crop yield. This study employed linear regression and spatial linear models were used to predict soybean yield with data from Landsat 8 OLI. Each model was built using only spectral bands of the satellite, only vegetation indices, and both spectral bands and vegetation indices. All analysis was based on data collected from two fields in South Dakota from the 2019 and 2021 …
Session 6: The Size-Biased Lognormal Mixture With The Entropy Regularized Algorithm, Tatjana Miljkovic, Taehan Bae
Session 6: The Size-Biased Lognormal Mixture With The Entropy Regularized Algorithm, Tatjana Miljkovic, Taehan Bae
SDSU Data Science Symposium
A size-biased left-truncated Lognormal (SB-ltLN) mixture is proposed as a robust alternative to the Erlang mixture for modeling left-truncated insurance losses with a heavy tail. The weak denseness property of the weighted Lognormal mixture is studied along with the tail behavior. Explicit analytical solutions are derived for moments and Tail Value at Risk based on the proposed model. An extension of the regularized expectation–maximization (REM) algorithm with Shannon's entropy weights (ewREM) is introduced for parameter estimation and variability assessment. The left-truncated internal fraud data set from the Operational Riskdata eXchange is used to illustrate applications of the proposed model. Finally, …
Modeling Of Covid-19 Clinical Outcomes In Mexico: An Analysis Of Demographic, Clinical, And Chronic Disease Factors, Livia Clarete
Modeling Of Covid-19 Clinical Outcomes In Mexico: An Analysis Of Demographic, Clinical, And Chronic Disease Factors, Livia Clarete
Dissertations, Theses, and Capstone Projects
This study explores COVID-19 clinical outcomes in Mexico, focusing on demographic, clinical, and chronic disease variables to develop predictive models. In the binary classification task, the Ada Boost Classifier distinguishes survivors from non-survivors, with age, sex, ethnicity, and chronic medical conditions influencing outcomes. In multiclass classification, the Gradient Boosting Classifier categorizes patients into outcome groups.
Demographic variables, especially age, are crucial for predicting COVID-19 outcomes for both the binary and multiclass classification tasks. Clinical information about previous conditions, including chronic diseases, also holds relevance, especially diabetes, immunocompromise, and cardiovascular diseases. These insights inform public health measures and healthcare strategies, emphasizing …
Making Sense Of Making Parole In New York, Alexandra Mcglinchy
Making Sense Of Making Parole In New York, Alexandra Mcglinchy
Dissertations, Theses, and Capstone Projects
For many individuals incarcerated in New York, the initial step toward freedom begins with an interview with the Board of Parole. This process, however, is frequently a complex and challenging one, characterized by repeated denials and extended incarcerations. The disparity in outcomes – where one individual may receive over 20 denials and another is granted parole on their first attempt – highlights the ambiguity and inconsistency in the parole decision-making process. This project aims to clarify the factors that influence parole decisions by concentrating on measurable variables. These include age, race, duration of sentence served, proportion of sentence served, type …
Model Selection Through Cross-Validation For Supervised Learning Tasks With Manifold Data, Derek Brown
Model Selection Through Cross-Validation For Supervised Learning Tasks With Manifold Data, Derek Brown
The Journal of Purdue Undergraduate Research
No abstract provided.
Sensitivity Analysis Of Prior Distributions In Regression Model Estimation, Ayoade I Adewole, Oluwatoyin K. Bodunwa
Sensitivity Analysis Of Prior Distributions In Regression Model Estimation, Ayoade I Adewole, Oluwatoyin K. Bodunwa
Al-Bahir Journal for Engineering and Pure Sciences
Bayesian inferences depend solely on specification and accuracy of likelihoods and prior distributions of the observed data. The research delved into Bayesian estimation method of regression models to reduce the impact of some of the problems, posed by convectional method of estimating regression models, such as handling complex models, availability of small sample sizes and inclusion of background information in the estimation procedure. Posterior distributions are based on prior distributions and the data accuracy, which is the fundamental principles of Bayesian statistics to produce accurate final model estimates. Sensitivity analysis is an essential part of mathematical model validation in obtaining …
Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe
Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe
Data Science and Data Mining
This project estimates a regression model to predict the superconducting critical temperature based on variables extracted from the superconductor’s chemical formula. The regression model along with the stepwise variable selection gives a reasonable and good predictive model with a lower prediction error (MSE). Variables extracted based on atomic radius, valence, atomic mass and thermal conductivity appeared to have the most contribution to the predictive model.
A Bayesian Inversion For Emissions And Export Productivity Across The End-Cretaceous Boundary, Alexander A. Cox
A Bayesian Inversion For Emissions And Export Productivity Across The End-Cretaceous Boundary, Alexander A. Cox
Dartmouth College Master’s Theses
The end-Cretaceous mass extinction was marked by both the Chicxulub impact and the ongoing emplacement of the Deccan Traps flood basalt province. Both of these events perturbed the environment by the emission of climate-active volatiles, primarily CO2 and SO2. To understand the mechanism of extinction, we must disentangle the timing, duration, and intensity of volcanic and meteoritic environmental forcings. In this thesis, we used a parallel Markov chain Monte Carlo approach to invert for the aforementioned volatile emissions, export productivity, and remineralization from 67 to 65 million years ago using the LOSCAR (Long-term Ocean-atmosphere-Sediment CArbon cycle Reservoir) model. The parallel …
Imputation Strategies For Different Categories Of Missing Data, Karthik Chalumuri
Imputation Strategies For Different Categories Of Missing Data, Karthik Chalumuri
Honors Theses and Capstones
Addressing missing data in research is crucial for ensuring the reliability and validity of study findings, yet it remains a significant challenge. This study investigates the impact of missing data on research outcomes and explores the underutilization of existing tools for managing missingness, potentially leading to gaps in critical information with tangible implications for decision-making processes (Dziura et al.).
Focusing on the different categories of missing data—Missing Completely At Random (MCAR), Missing At Random (MAR), and Missing Not At Random (MNAR)—this research examines various imputation strategies tailored to each category. Specifically, we compare the efficacy of several model-based imputation methods, …
Defensive Impact Wins: Developing A New Method To Rate Individual Defense In Nba Games, Dylan J. Stiles
Defensive Impact Wins: Developing A New Method To Rate Individual Defense In Nba Games, Dylan J. Stiles
Honors Theses and Capstones
With the analytics revolution in sports in the past 20 years, it seems that everything that can be quantified is. In basketball though, trying to break the game down into a set of numbers comes with a unique problem. While we've come up with a good set of advanced numbers to measure offensive efficiency, defense is fundamentally harder to quantify. The game is played five on five, but it has often been popular or convenient to model defense as a set of five one on one games. As defenses became more complex into the 2010s, this methodology became more insignificant. …
Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen
Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen
Theses and Dissertations (Comprehensive)
The complex nature of the human brain, with its intricate organic structure and multiscale spatio-temporal characteristics ranging from synapses to the entire brain, presents a major obstacle in brain modelling. Capturing this complexity poses a significant challenge for researchers. The complex interplay of coupled multiphysics and biochemical activities within this intricate system shapes the brain's capacity, functioning within a structure-function relationship that necessitates a specific mathematical framework. Advanced mathematical modelling approaches that incorporate the coupling of brain networks and the analysis of dynamic processes are essential for advancing therapeutic strategies aimed at treating neurodegenerative diseases (NDDs), which afflict millions of …
Utility In Time Description In Priority Best-Worst Discrete Choice Models: An Empirical Evaluation Using Flynn's Data, Sasanka Adikari, Norou Diawara
Utility In Time Description In Priority Best-Worst Discrete Choice Models: An Empirical Evaluation Using Flynn's Data, Sasanka Adikari, Norou Diawara
Mathematics & Statistics Faculty Publications
Discrete choice models (DCMs) are applied in many fields and in the statistical modelling of consumer behavior. This paper focuses on a form of choice experiment, best-worst scaling in discrete choice experiments (DCEs), and the transition probability of a choice of a consumer over time. The analysis was conducted by using simulated data (choice pairs) based on data from Flynn's (2007) 'Quality of Life Experiment'. Most of the traditional approaches assume the choice alternatives are mutually exclusive over time, which is a questionable assumption. We introduced a new copula-based model (CO-CUB) for the transition probability, which can handle the dependent …
Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe
Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe
Data Science and Data Mining
Cyberbullying refers to the act of bullying using electronic means and the internet. In recent years, this act has been identifed to be a major problem among young people and even adults. It can negatively impact one’s emotions and lead to adverse outcomes like depression, anxiety, harassment, and suicide, among others. This has led to the need to employ machine learning techniques to automatically detect cyberbullying and prevent them on various social media platforms. In this study, we want to analyze the combination of some Natural Language Processing (NLP) algorithms (such as Bag-of-Words and TFIDF) with some popular machine learning …
Simulation Of Wave Propagation In Granular Particles Using A Discrete Element Model, Syed Tahmid Hussan
Simulation Of Wave Propagation In Granular Particles Using A Discrete Element Model, Syed Tahmid Hussan
Electronic Theses and Dissertations
The understanding of Bender Element mechanism and utilization of Particle Flow Code (PFC) to simulate the seismic wave behavior is important to test the dynamic behavior of soil particles. Both discrete and finite element methods can be used to simulate wave behavior. However, Discrete Element Method (DEM) is mostly suitable, as the micro scaled soil particle cannot be fully considered as continuous specimen like a piece of rod or aluminum. Recently DEM has been widely used to study mechanical properties of soils at particle level considering the particles as balls. This study represents a comparative analysis of Voigt and Best …
Statistical Modeling Of Bankruptcy Data, Andrew Elsfelder
Statistical Modeling Of Bankruptcy Data, Andrew Elsfelder
Williams Honors College, Honors Research Projects
My project uses a dataset of bankrupt and non-bankrupt companies in Taiwan from 1999 to 2009. This data was collected from the Taiwan Economic Journal. The statistical methods I used to model the data are CHAID, CART, and logistic regression. The models created are tools that can predict if a company is bankrupt, or not-bankrupt based on other data about the company. I created multiple models for each of the methods to find the best model for each method. I then analyzed the output from each method. Lastly, I determined which model was the best for this data based on …
Ensemble Classification: An Analysis Of The Random Forest Model, Jarod Korn
Ensemble Classification: An Analysis Of The Random Forest Model, Jarod Korn
Williams Honors College, Honors Research Projects
The random forest model proposed by Dr. Leo Breiman in 2001 is an ensemble machine learning method for classification prediction and regression. In the following paper, we will conduct an analysis on the random forest model with a focus on how the model works, how it is applied in software, and how it performs on a set of data. To fully understand the model, we will introduce the concept of decision trees, give a summary of the CART model, explain in detail how the random forest model operates, discuss how the model is implemented in software, demonstrate the model by …