Open Access. Powered by Scholars. Published by Universities.®
- Institution
-
- Technological University Dublin (8)
- Old Dominion University (6)
- Southern Methodist University (6)
- Stephen F. Austin State University (6)
- Western University (6)
-
- Claremont Colleges (5)
- City University of New York (CUNY) (4)
- University of Nebraska - Lincoln (4)
- California Polytechnic State University, San Luis Obispo (3)
- Georgia Southern University (3)
- Harrisburg University of Science and Technology (3)
- Illinois State University (3)
- Selected Works (3)
- University of Massachusetts Amherst (3)
- COBRA (2)
- East Tennessee State University (2)
- Embry-Riddle Aeronautical University (2)
- Louisiana Tech University (2)
- South Dakota State University (2)
- The University of Akron (2)
- University of Arkansas, Fayetteville (2)
- University of Central Florida (2)
- University of New Hampshire (2)
- University of New Mexico (2)
- University of New Orleans (2)
- University of Tennessee, Knoxville (2)
- Virginia Commonwealth University (2)
- Western Kentucky University (2)
- Wilfrid Laurier University (2)
- Bard College (1)
- Keyword
-
- Statistics (8)
- Archaeology (5)
- Probability (5)
- American Southeast (4)
- Caddo (4)
-
- Date Combination (4)
- Machine Learning (4)
- Radiocarbon (4)
- Summed Probability Distributions (4)
- Machine learning (3)
- Poisson (3)
- Regression (3)
- Sports (3)
- Baseball (2)
- Bayesian (2)
- Bayesian analysis (2)
- Boyne Valley (2)
- Classification (2)
- ETD (2)
- Energy (2)
- Mathematics (2)
- Passage tomb (2)
- Probabilistic (2)
- Random forest (2)
- Review (2)
- Stochastic analysis (2)
- Variable selection (2)
- "hot hand" (1)
- 2-prop Z-Test (1)
- ABX index (1)
- Publication Year
- Publication
-
- Doctoral Dissertations (7)
- Electronic Theses and Dissertations (7)
- Electronic Thesis and Dissertation Repository (6)
- CRHR: Archaeology (5)
- SMU Data Science Review (5)
-
- Articles (4)
- Byron E. Bell (3)
- Harrisburg University Research Symposium: Highlighting Research, Innovation, & Creativity (3)
- Journal of Humanistic Mathematics (3)
- Mathematics & Statistics Theses & Dissertations (3)
- Publications and Research (3)
- Annual Symposium on Biomathematics and Ecology Education and Research (2)
- CMC Senior Theses (2)
- Data Science and Data Mining (2)
- Graduate Theses and Dissertations (2)
- Masters Theses & Specialist Projects (2)
- SDSU Data Science Symposium (2)
- Statistics (2)
- Theses and Dissertations (2)
- Theses and Dissertations (Comprehensive) (2)
- University of New Orleans Theses and Dissertations (2)
- Williams Honors College, Honors Research Projects (2)
- Access*: Interdisciplinary Journal of Student Research and Scholarship (1)
- All Faculty Scholarship (1)
- Applications and Applied Mathematics: An International Journal (AAM) (1)
- Basic Science Engineering (1)
- Beyond: Undergraduate Research Journal (1)
- Biology and Medicine Through Mathematics Conference (1)
- Book/Book Chapter (1)
- Books/Book chapters (1)
- Publication Type
Articles 1 - 30 of 123
Full-Text Articles in Applied Statistics
Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe
Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe
Data Science and Data Mining
Cyberbullying refers to the act of bullying using electronic means and the internet. In recent years, this act has been identifed to be a major problem among young people and even adults. It can negatively impact one’s emotions and lead to adverse outcomes like depression, anxiety, harassment, and suicide, among others. This has led to the need to employ machine learning techniques to automatically detect cyberbullying and prevent them on various social media platforms. In this study, we want to analyze the combination of some Natural Language Processing (NLP) algorithms (such as Bag-of-Words and TFIDF) with some popular machine learning …
Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe
Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe
Data Science and Data Mining
This project estimates a regression model to predict the superconducting critical temperature based on variables extracted from the superconductor’s chemical formula. The regression model along with the stepwise variable selection gives a reasonable and good predictive model with a lower prediction error (MSE). Variables extracted based on atomic radius, valence, atomic mass and thermal conductivity appeared to have the most contribution to the predictive model.
Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen
Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen
Theses and Dissertations (Comprehensive)
The complex nature of the human brain, with its intricate organic structure and multiscale spatio-temporal characteristics ranging from synapses to the entire brain, presents a major obstacle in brain modelling. Capturing this complexity poses a significant challenge for researchers. The complex interplay of coupled multiphysics and biochemical activities within this intricate system shapes the brain's capacity, functioning within a structure-function relationship that necessitates a specific mathematical framework. Advanced mathematical modelling approaches that incorporate the coupling of brain networks and the analysis of dynamic processes are essential for advancing therapeutic strategies aimed at treating neurodegenerative diseases (NDDs), which afflict millions of …
Reducing Food Scarcity: The Benefits Of Urban Farming, S.A. Claudell, Emilio Mejia
Reducing Food Scarcity: The Benefits Of Urban Farming, S.A. Claudell, Emilio Mejia
Journal of Nonprofit Innovation
Urban farming can enhance the lives of communities and help reduce food scarcity. This paper presents a conceptual prototype of an efficient urban farming community that can be scaled for a single apartment building or an entire community across all global geoeconomics regions, including densely populated cities and rural, developing towns and communities. When deployed in coordination with smart crop choices, local farm support, and efficient transportation then the result isn’t just sustainability, but also increasing fresh produce accessibility, optimizing nutritional value, eliminating the use of ‘forever chemicals’, reducing transportation costs, and fostering global environmental benefits.
Imagine Doris, who is …
Ohio Recovery Housing: Resident Risk And Outcomes Assessment, Elyjiah Potter, Bivin Sadler
Ohio Recovery Housing: Resident Risk And Outcomes Assessment, Elyjiah Potter, Bivin Sadler
SMU Data Science Review
Addiction and substance abuse disorder is a significant problem in the United States. Over the past two decades, the United States has faced a boom in substance abuse, which has resulted in an increase in death and disruption of families across the nation. The State of Ohio has been particularly hard hit by the crisis, with overdose rates nearly doubling the national average. Established in the mid 1970’s Sober Living Housing is an alcohol and substance use recovery model emphasizing personal responsibility, sober living, and community support. This model has been adopted by the Ohio Recovery Housing organization, which seeks …
Exploration And Statistical Modeling Of Profit, Caleb Gibson
Exploration And Statistical Modeling Of Profit, Caleb Gibson
Undergraduate Honors Theses
For any company involved in sales, maximization of profit is the driving force that guides all decision-making. Many factors can influence how profitable a company can be, including external factors like changes in inflation or consumer demand or internal factors like pricing and product cost. Understanding specific trends in one's own internal data, a company can readily identify problem areas or potential growth opportunities to help increase profitability.
In this discussion, we use an extensive data set to examine how a company might analyze their own data to identify potential changes the company might investigate to drive better performance. Based …
Foundations Of Memory Capacity In Models Of Neural Cognition, Chandradeep Chowdhury
Foundations Of Memory Capacity In Models Of Neural Cognition, Chandradeep Chowdhury
Master's Theses
A central problem in neuroscience is to understand how memories are formed as a result of the activities of neurons. Valiant’s neuroidal model attempted to address this question by modeling the brain as a random graph and memories as subgraphs within that graph. However the question of memory capacity within that model has not been explored: how many memories can the brain hold? Valiant introduced the concept of interference between memories as the defining factor for capacity; excessive interference signals the model has reached capacity. Since then, exploration of capacity has been limited, but recent investigations have delved into the …
Reu-Deim Classification Of Hispanic Voters In Hispanic Groups Using Name And Zip Code Data In Palm Beach, Florida, Kamila Soto-Ortiz
Reu-Deim Classification Of Hispanic Voters In Hispanic Groups Using Name And Zip Code Data In Palm Beach, Florida, Kamila Soto-Ortiz
Beyond: Undergraduate Research Journal
When it comes to registering to vote, Hispanic voters can only register as “Hispanic” in the “Race/Ethnicity” category, causing difficulties when analyzing voting trends amongst the Hispanic community. Upon the recent idea that not all Hispanic Groups vote the same, the goal is to create a model that can possibly identify a voter’s Hispanic Group with the information provided on the public Florida voter file. This is accomplished using name and zip code data for all voters in Palm Beach, Florida. This paper will explore the model implemented, its findings and limitations. Palm Beach, Florida, is met with low confidence …
The "Benfordness" Of Bach Music, Chadrack Bantange, Darby Burgett, Luke Haws, Sybil Prince Nelson
The "Benfordness" Of Bach Music, Chadrack Bantange, Darby Burgett, Luke Haws, Sybil Prince Nelson
Journal of Humanistic Mathematics
In this paper we analyze the distribution of musical note frequencies in Hertz to see whether they follow the logarithmic Benford distribution. Our results show that the music of Johann Sebastian Bach and Johann Christian Bach is Benford distributed while the computer-generated music is not. We also find that computer-generated music is statistically less Benford distributed than human- composed music.
Development And Testing Of A New Method For Velocity-Selecting White Dwarfs From Gaia By Galactic Population, Joseph Hammill
Development And Testing Of A New Method For Velocity-Selecting White Dwarfs From Gaia By Galactic Population, Joseph Hammill
Doctoral Dissertations and Master's Theses
The detailed processes by which spiral galaxies form remains an open question in modern cosmology. Observations of the current configuration of spiral galaxies including the Milky Way reveal thin and thick disk and halo populations which must all be accounted for in formation theories and likely have distinct ages. Using the Milky Way as an example to probe this question, we are studying the formation history of these structures.
This work details our approach to age-dating the galaxy, velocity-selecting targets from a sample of white dwarfs from the Gaia DR3 catalog that have also been age-analysed using BASE-9. BASE-9 uses …
(R2027) A New Class Of Pareto Distribution: Estimation And Its Applications, Anitta Susan Aniyan, Dais George
(R2027) A New Class Of Pareto Distribution: Estimation And Its Applications, Anitta Susan Aniyan, Dais George
Applications and Applied Mathematics: An International Journal (AAM)
The classical Pareto distribution is a positively skewed and right heavy-tailed lifetime distribution having a lot many applications in various fields of science and social science. In this work, via logarithmic trans-formed method, a new three parameter lifetime distribution, an extension of classical Pareto distribution is generated. The different structural properties of the new distribution are studied. The model parameters are estimated by the method of maximum likelihood and Bayesian procedure. When all the three parameters of the distribution are unknown, the Bayes estimators cannot be obtained in a closed form and hence, the Lindley’s approximation under squared error loss …
Finite Mixture Modeling For Hierarchically Structured Data With Application To Keystroke Dynamics, Andrew Simpson, Semhar Michael
Finite Mixture Modeling For Hierarchically Structured Data With Application To Keystroke Dynamics, Andrew Simpson, Semhar Michael
SDSU Data Science Symposium
Keystroke dynamics has been used to both authenticate users of computer systems and detect unauthorized users who attempt to access the system. Monitoring keystroke dynamics adds another level to computer security as passwords are often compromised. Keystrokes can also be continuously monitored long after a password has been entered and the user is accessing the system for added security. Many of the current methods that have been proposed are supervised methods in that they assume that the true user of each keystroke is known apriori. This is not always true for example with businesses and government agencies which have internal …
Application Of Probabilistic Ranking Systems On Women’S Junior Division Beach Volleyball, Cameron Stewart, Michael Mazel, Bivin Sadler
Application Of Probabilistic Ranking Systems On Women’S Junior Division Beach Volleyball, Cameron Stewart, Michael Mazel, Bivin Sadler
SMU Data Science Review
Women’s beach volleyball is one of the fastest growing collegiate sports today. The increase in popularity has come with an increase in valuable scholarship opportunities across the country. With thousands of athletes to sort through, college scouts depend on websites that aggregate tournament results and rank players nationally. This project partnered with the company Volleyball Life, who is the current market leader in the ranking space of junior beach volleyball players. Utilizing the tournament information provided by Volleyball Life, this study explored replacements to the current ranking systems, which are designed to aggregate player points from recent tournament placements. Three …
Defining Viable Solar Resource Locations In The Southeast United States Using The Satellite-Based Glass Product, Jolie Kavanagh
Defining Viable Solar Resource Locations In The Southeast United States Using The Satellite-Based Glass Product, Jolie Kavanagh
Theses and Dissertations
This research uses satellite data and the moment statistics to determine if solar farms can be placed in the Southeast US. From 2001-2019, the data are analyzed in reference to the Southwest US, where solar farms are located. The clean energy need is becoming more common; therefore, more locations than arid environments must be observed. The Southeast US is the main location of interest due to the warm, moist environment throughout the year. This research uses the Global Land Surface Satellite (GLASS) photosynthetically active radiation product (PAR) to determine viable locations for solar panels. A probability density function (PDF) along …
Rewriting The Rules For Diagnostics: Implications Of Probability And Measure Theory For Sars-Cov-2 Testing, Paul Patrone, Anthony Kearsley
Rewriting The Rules For Diagnostics: Implications Of Probability And Measure Theory For Sars-Cov-2 Testing, Paul Patrone, Anthony Kearsley
Biology and Medicine Through Mathematics Conference
No abstract provided.
Early-Warning Alert Systems For Financial-Instability Detection: An Hmm-Driven Approach, Xing Gu
Early-Warning Alert Systems For Financial-Instability Detection: An Hmm-Driven Approach, Xing Gu
Electronic Thesis and Dissertation Repository
Regulators’ early intervention is crucial when the financial system is experiencing difficulties. Financial stability must be preserved to avert banks’ bailouts, which hugely drain government's financial resources. Detecting in advance periods of financial crisis entails the development and customisation of accurate and robust quantitative techniques. The goal of this thesis is to construct automated systems via the interplay of various mathematical and statistical methodologies to signal financial instability episodes in the near-term horizon. These signal alerts could provide regulatory bodies with the capacity to initiate appropriate response that will thwart or at least minimise the occurrence of a financial crisis. …
Percentage Of Yellow Sour Patch Kids, Easton Kratzer, Sarah Baxter
Percentage Of Yellow Sour Patch Kids, Easton Kratzer, Sarah Baxter
Harrisburg University Research Symposium: Highlighting Research, Innovation, & Creativity
After being given the Qualitative Research Project in Introduction to Statistics, I came up with the question asking what percentage of Sour Patch Kids are yellow. This resulted in me going through an entire bag and counting the amount of every color to figure out the percentages. (Class Project)
Preference For Violence By Gender, Ayrton Hall, Christian Watson, Akeisha Belgrave
Preference For Violence By Gender, Ayrton Hall, Christian Watson, Akeisha Belgrave
Harrisburg University Research Symposium: Highlighting Research, Innovation, & Creativity
In our survey we asked students of HU what their favorite video games were as well as their favorite genre and most played game. We then analyzed the data to see how gender affects preference for violent games. (Class Project)
Favorite Programming Language Among Students, Anwar Jawhar, Akeisha Belgrave
Favorite Programming Language Among Students, Anwar Jawhar, Akeisha Belgrave
Harrisburg University Research Symposium: Highlighting Research, Innovation, & Creativity
This project involves understanding the favorite programming language among students. I hypothesize that the favorite programming language will be Python. (Class Project)
Intra-Hour Solar Forecasting Using Cloud Dynamics Features Extracted From Ground-Based Infrared Sky Images, Guillermo Terrén-Serrano
Intra-Hour Solar Forecasting Using Cloud Dynamics Features Extracted From Ground-Based Infrared Sky Images, Guillermo Terrén-Serrano
Electrical and Computer Engineering ETDs
Due to the increasing use of photovoltaic systems, power grids are vulnerable to the projection of shadows from moving clouds. An intra-hour solar forecast provides power grids with the capability of automatically controlling the dispatch of energy, reducing the additional cost for a guaranteed, reliable supply of energy (i.e., energy storage). This dissertation introduces a novel sky imager consisting of a long-wave radiometric infrared camera and a visible light camera with a fisheye lens. The imager is mounted on a solar tracker to maintain the Sun in the center of the images throughout the day, reducing the scattering effect produced …
Session 5: Equipment Finance Credit Risk Modeling - A Case Study In Creative Model Development & Nimble Data Engineering, Edward Krueger, Landon Thompson, Josh Moore
Session 5: Equipment Finance Credit Risk Modeling - A Case Study In Creative Model Development & Nimble Data Engineering, Edward Krueger, Landon Thompson, Josh Moore
SDSU Data Science Symposium
This presentation will focus first on providing an overview of Channel and the Risk Analytics team that performed this case study. Given that context, we’ll then dive into our approach for building the modeling development data set, techniques and tools used to develop and implement the model into a production environment, and some of the challenges faced upon launch. Then, the presentation will pivot to the data engineering pipeline. During this portion, we will explore the application process and what happens to the data we collect. This will include how we extract & store the data along with how it …
Searching For Anomalous Extensive Air Showers Using The Pierre Auger Observatory Fluorescence Detector, Andrew Puyleart
Searching For Anomalous Extensive Air Showers Using The Pierre Auger Observatory Fluorescence Detector, Andrew Puyleart
Dissertations, Master's Theses and Master's Reports
Anomalous extensive air showers have yet to be detected by cosmic ray observatories. Fluorescence detectors provide a way to view the air showers created by cosmic rays with primary energies reaching up to hundreds of EeV . The resulting air showers produced by these highly energetic collisions can contain features that deviate from average air showers. Detection of these anomalous events may provide information into unknown regions of particle physics, and place constraints on cross-sectional interaction lengths of protons. In this dissertation, I propose measurements of extensive air shower profiles that are used in a machine learning pipeline to distinguish …
Analysis Of Minor League Rule Changes Effect On Stolen Bases, Zachary Houghtaling
Analysis Of Minor League Rule Changes Effect On Stolen Bases, Zachary Houghtaling
Williams Honors College, Honors Research Projects
This study uses various statistical analyses to evaluate the justification of rule changes for Major League Baseball that were implemented within the Minor Leagues during the 2021 minor league season. The primary focus of the study is predicting how some of these Minor League rule changes could affect the stolen base success rate and the number of attempts per game within the Major Leagues. A survey was conducted to evaluate how fans feel about stolen bases within the current game and if rules should be altered to increase the number of stolen bases that occur. Additionally, recorded Major and Minor …
Lake Huron Shoreline Analysis, Shubham Satish Nandanwar
Lake Huron Shoreline Analysis, Shubham Satish Nandanwar
Theses and Dissertations (Comprehensive)
Lake Huron is a popular tourist destination and is home to several businesses and residents. Since the shoreline is dynamic and is subject to change over the years due to several factors such as a change in water level, soil type, human encroachment, etc., these locations tend to encounter floods due to increased water levels and wind speed. This causes erosion and loss to the properties along the shoreline.
This study is based on two areas of interest named Pinery Provincial Park and Sauble Beach which are located on the shoreline of Lake Huron where Pinery Provincial Park is a …
Identification And Characterization Of Forest Fire Risk Zones Leveraging Machine Learning Methods, Joshua Balson, Matt Chinchilla, Cam Lu, Jeff Washburn, Nibhrat Lohia
Identification And Characterization Of Forest Fire Risk Zones Leveraging Machine Learning Methods, Joshua Balson, Matt Chinchilla, Cam Lu, Jeff Washburn, Nibhrat Lohia
SMU Data Science Review
Across the United States, record numbers of wildfires are observed costing billions of dollars in property damage, polluting the environment, and putting lives at risk. The ability of emergency management professionals, city planners, and private entities such as insurance companies to determine if an area is at higher risk of a fire breaking out has never been greater. This paper proposes a novel methodology for identifying and characterizing zones with increased risks of forest fires. Methods involving machine learning techniques use the widely available and recorded data, thus making it possible to implement the tool quickly.
Risk-Based Machine Learning Approaches For Probabilistic Transient Stability, Umair Shahzad
Risk-Based Machine Learning Approaches For Probabilistic Transient Stability, Umair Shahzad
Department of Electrical and Computer Engineering: Dissertations, Theses, and Student Research
Power systems are getting more complex than ever and are consequently operating close to their limit of stability. Moreover, with the increasing demand of renewable wind generation, and the requirement to maintain a secure power system, the importance of transient stability cannot be overestimated. Considering its significance in power system security, it is important to propose a different approach for enhancing the transient stability, considering uncertainties. Current deterministic industry practices of transient stability assessment ignore the probabilistic nature of variables (fault type, fault location, fault clearing time, etc.). These approaches typically provide a conservative criterion and can result in expensive …
A Computational Study Of Genotype-Phenotype Mutation Patterns, Kamaludin Dingle, Omar Tawfik, Ahmed Aldabagh
A Computational Study Of Genotype-Phenotype Mutation Patterns, Kamaludin Dingle, Omar Tawfik, Ahmed Aldabagh
Undergraduate Research Symposium
Understanding properties of genotype-phenotype maps is important for understanding biology and evolution. In this project we make a computational study of the statistical effects of genetic mutations, in particular computing the probabilities of each phenotype transitioning to any other phenotype. We also investigate the importance of the local phenotypic environment of a single genotype, and its role in determining mutation transition probabilities. We use HP protein folding, RNA structure, and a simplified GRN matrix model to study these questions.
Identification And Characterization Of De Novo Germline Tp53 Mutation Carriers In Families With Li-Fraumeni Syndrome, Carlos C. Vera Recio
Identification And Characterization Of De Novo Germline Tp53 Mutation Carriers In Families With Li-Fraumeni Syndrome, Carlos C. Vera Recio
Dissertations & Theses (Open Access)
Li-Fraumeni syndrome (LFS) is an inherited cancer syndrome caused by a deleterious mutation in TP53. An estimated 48% of LFS patients present due to a de novo mutation (DNM) in TP53. The knowledge of DNM status, DNM or familial mutation (FM), of an LFS patient requires genetic testing of both parents which is often inaccessible, making de novo LFS patients difficult to study. Famdenovo.TP53 is a Mendelian Risk prediction model used to predict DNM status of TP53 mutation carriers based on the cancer-family history and several input genetic parameters, including disease-gene penetrance. The good predictive performance of Famdenovo.TP53 was demonstrated …
Bayesian Variable Selection Strategies In Longitudinal Mixture Models And Categorical Regression Problems., Md Nazir Uddin
Bayesian Variable Selection Strategies In Longitudinal Mixture Models And Categorical Regression Problems., Md Nazir Uddin
Electronic Theses and Dissertations
In this work, we seek to develop a variable screening and selection method for Bayesian mixture models with longitudinal data. To develop this method, we consider data from the Health and Retirement Survey (HRS) conducted by University of Michigan. Considering yearly out-of-pocket expenditures as the longitudinal response variable, we consider a Bayesian mixture model with $K$ components. The data consist of a large collection of demographic, financial, and health-related baseline characteristics, and we wish to find a subset of these that impact cluster membership. An initial mixture model without any cluster-level predictors is fit to the data through an MCMC …
Application Of Randomness In Finance, Jose Sanchez, Daanial Ahmad, Satyanand Singh
Application Of Randomness In Finance, Jose Sanchez, Daanial Ahmad, Satyanand Singh
Publications and Research
Brownian Motion which is also considered to be a Wiener process and can be thought of as a random walk. In our project we had briefly discussed the fluctuations of financial indices and related it to Brownian Motion and the modeling of Stock prices.