Open Access. Powered by Scholars. Published by Universities.®

Other Statistics and Probability Commons

Open Access. Powered by Scholars. Published by Universities.®

404 Full-Text Articles 888 Authors 250,540 Downloads 91 Institutions

All Articles in Other Statistics and Probability

Faceted Search

404 full-text articles. Page 8 of 17.

Functional Random Forest With Applications In Dose-Response Predictions, Raziur Rahman, Saugato Rahman Dhruba, Souparno Ghosh, Ranadip Pal 2019 Texas Tech University

Functional Random Forest With Applications In Dose-Response Predictions, Raziur Rahman, Saugato Rahman Dhruba, Souparno Ghosh, Ranadip Pal

Department of Statistics: Faculty Publications

Drug sensitivity prediction for individual tumors is a significant challenge in personalized medicine. Current modeling approaches consider prediction of a single metric of the drug response curve such as AUC or IC50. However, the single summary metric of a dose-response curve fails to provide the entire drug sensitivity profile which can be used to design the optimal dose for a patient. In this article, we assess the problem of predicting the complete dose-response curve based on genetic characterizations. We propose an enhancement to the popular ensemble-based Random Forests approach that can directly predict the entire functional profile of …


Health Risk Tolerance As A Key Determinant Of (Un)Willingness To Behavior Change: Conceptualization And Scale Development, Hyoyeun Jun, Yan Jin 2019 University of Georgia

Health Risk Tolerance As A Key Determinant Of (Un)Willingness To Behavior Change: Conceptualization And Scale Development, Hyoyeun Jun, Yan Jin

International Crisis and Risk Communication Conference

After the study of testing determinants of risk tolerance affecting information sharing, this study was conducted as a second step to actually develop the scale for risk tolerance. Firstly, this study followed qualitative steps, such as in-depth interview and focus group, to capture how public describes the situation when they are tolerating the risk, when they knew what the recommended behavior is to relieve the risk. Secondly, this study collected 1000 U.S. public sample for the survey questionnaire that are the items generated from the qualitative steps.


Cost-Effective Surveillance For Infectious Diseases Through Specimen Pooling And Multiplex Assays, Christopher Bilder, Joshua Tebbs, Christopher McMahan 2019 University of Nebraska - Lincoln

Cost-Effective Surveillance For Infectious Diseases Through Specimen Pooling And Multiplex Assays, Christopher Bilder, Joshua Tebbs, Christopher Mcmahan

Department of Statistics: Faculty Publications

To develop specimen pooling algorithms that reduce the number of tests needed to test individuals for infectious diseases with multiplex assays.


Genomic Prediction Using Canopy Coverage Image And Genotypic Information In Soybean Via A Hybrid Model, Reka Howard, Diego Jarquin 2019 University of Nebraska-Lincoln

Genomic Prediction Using Canopy Coverage Image And Genotypic Information In Soybean Via A Hybrid Model, Reka Howard, Diego Jarquin

Department of Statistics: Faculty Publications

Prediction techniques are important in plant breeding as they provide a tool for selection that is more efficient and economical than traditional phenotypic and pedigree based selection. The conventional genomic prediction models include molecular marker information to predict the phenotype. With the development of new phenomics techniques we have the opportunity to collect image data on the plants, and extend the traditional genomic prediction models where we incorporate diverse set of information collected on the plants. In our research, we developed a hybrid matrix model that incorporates molecular marker and canopy coverage information as a weighted linear combination to predict …


Post-Er Stress Biogenesis Of Golgi Is Governed By Giantin, Cole P. Frisbie, Alexander Y. Lushnikov, Alexey V. Krasnoslobodtsev, Jean-Jack Riethoven, Jennifer L. Clarke, Elena I. Stepchenkova, Armen Petrosyan 2019 University of Nebraska Medical Center

Post-Er Stress Biogenesis Of Golgi Is Governed By Giantin, Cole P. Frisbie, Alexander Y. Lushnikov, Alexey V. Krasnoslobodtsev, Jean-Jack Riethoven, Jennifer L. Clarke, Elena I. Stepchenkova, Armen Petrosyan

Department of Statistics: Faculty Publications

Background: The Golgi apparatus undergoes disorganization in response to stress, but it is able to restore compact and perinuclear structure under recovery. This self-organization mechanism is significant for cellular homeostasis, but remains mostly elusive, as does the role of giantin, the largest Golgi matrix dimeric protein. Methods: In HeLa and different prostate cancer cells, we used the model of cellular stress induced by Brefeldin A (BFA). The conformational structure of giantin was assessed by proximity ligation assay and atomic force microscopy. The post-BFA distribution of Golgi resident enzymes was examined by 3D SIM high-resolution microscopy. Results: We detected that giantin …


Recursive Model For Dose-Time Responses In Pharmacological Studies, Saugato Rahman Dhruba, Aminur Rahman, Raziur Rahman, Souparno Ghosh, Ranadip Pal 2019 Texas Tech University

Recursive Model For Dose-Time Responses In Pharmacological Studies, Saugato Rahman Dhruba, Aminur Rahman, Raziur Rahman, Souparno Ghosh, Ranadip Pal

Department of Statistics: Faculty Publications

Background: Clinical studies often track dose-response curves of subjects over time. One can easily model the dose-response curve at each time point with Hill equation, but such a model fails to capture the temporal evolution of the curves. On the other hand, one can use Gompertz equation to model the temporal behaviors at each dose without capturing the evolution of time curves across dosage

Results: In this article, we propose a parametric model for dose-time responses that follows Gompertz law in time and Hill equation across dose approximately. We derive a recursion relation for dose-response curves over time capturing the …


Using Neural Networks To Classify Discrete Circular Probability Distributions, Madelyn Gaumer 2019 Claremont Colleges

Using Neural Networks To Classify Discrete Circular Probability Distributions, Madelyn Gaumer

HMC Senior Theses

Given the rise in the application of neural networks to all sorts of interesting problems, it seems natural to apply them to statistical tests. This senior thesis studies whether neural networks built to classify discrete circular probability distributions can outperform a class of well-known statistical tests for uniformity for discrete circular data that includes the Rayleigh Test1, the Watson Test2, and the Ajne Test3. Each neural network used is relatively small with no more than 3 layers: an input layer taking in discrete data sets on a circle, a hidden layer, and an output …


Design Of Experiment And Analysis Techniques For Fuel Consumption Data Using Heavy-Duty Diesel Vehicles And On-Road Testing, Sarah Ann Mills 2019 West Virginia University

Design Of Experiment And Analysis Techniques For Fuel Consumption Data Using Heavy-Duty Diesel Vehicles And On-Road Testing, Sarah Ann Mills

Graduate Theses, Dissertations, and Problem Reports

Chassis dynamometer and on-road testing are usually employed to test vehicle operation. Testing on a chassis dynamometer reduces data variability compared to on-road testing due to the controlled environment but it does not account for other important variables that affects real-world vehicle operation. This study used on-road testing to investigate the differences between two test fuels under real-world conditions. Three heavy-duty diesel vehicles were driven on different routes for a period of three months. Each vehicle was instrumented with flow meters to gather fuel consumption data, which was then compared to the fuel rate broadcasted by the engine control unit …


Regression Tree Construction For Reinforcement Learning Problems With A General Action Space, Anthony S. Bush Jr 2019 Georgia Southern University

Regression Tree Construction For Reinforcement Learning Problems With A General Action Space, Anthony S. Bush Jr

Electronic Theses and Dissertations

Part of the implementation of Reinforcement Learning is constructing a regression of values against states and actions and using that regression model to optimize over actions for a given state. One such common regression technique is that of a decision tree; or in the case of continuous input, a regression tree. In such a case, we fix the states and optimize over actions; however, standard regression trees do not easily optimize over a subset of the input variables\cite{Card1993}. The technique we propose in this thesis is a hybrid of regression trees and kernel regression. First, a regression tree splits over …


The Dark Sky Character Of Archaeological Landscapes: Cultural Meaning And Conservation Strategies, Frank Prendergast 2019 Technological University Dublin

The Dark Sky Character Of Archaeological Landscapes: Cultural Meaning And Conservation Strategies, Frank Prendergast

Book/Book Chapter

This paper presents the first ever study of light pollution at selected Irish prehistoric archaeological landscapes. The concepts of cosmology and landscape are first briefly described and followed by a summary of early human settlement of the island. Building on this, the extant corpus of early prehistoric megalithic burial tombs is illustrated to show their contrasting distribution patterns and typology. Analysis of tomb locations using nearest-neighbour statistical methods reveals evidence of intentional clustering. Further geo-statistical analysis identifies the geographical locations and the density ranking of these nucleated clusters - a feature especially evident in the passage tomb tradition on this …


Global Warming Statistical Analysis, Jared Skinner 2019 The University of Akron

Global Warming Statistical Analysis, Jared Skinner

Williams Honors College, Honors Research Projects

This paper will investigate global warming and its effects on natural disasters. I will review the historic movements of climate change and activism, as well as the current discussions surrounding global warming. Secondly, I will examine various datasets, paying attention to the severity and frequency of specific natural disasters. I will then touch briefly on the topic of catastrophe modeling as it relates to the increased risk and losses associated with the discussed natural disasters and how those put the problem of global warming in a framework which financial and government institutions can grasp. I will also be analyzing economic …


Rfviz: An Interactive Visualization Package For Random Forests In R, Christopher Beckett 2018 Utah State University

Rfviz: An Interactive Visualization Package For Random Forests In R, Christopher Beckett

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

Random forests are very popular tools for predictive analysis and data science. They work for both classification (where there is a categorical response variable) and regression (where the response is continuous). Random forests provide proximities, and both local and global measures of variable importance. However, these quantities require special tools to be effectively used to interpret the forest. Rfviz is a sophisticated interactive visualization package and toolkit in R, specially designed for interpreting the results of a random forest in a user-friendly way. Rfviz uses a recently developed R package (loon) from the Comprehensive R Archive Network (CRAN) to create …


Season-Ahead Forecasting Of Water Storage And Irrigation Requirements – An Application To The Southwest Monsoon In India, Arun Ravindranath, Naresh Devineni, Upmanu Lall, Paulina Concha Larrauri 2018 CUNY City College

Season-Ahead Forecasting Of Water Storage And Irrigation Requirements – An Application To The Southwest Monsoon In India, Arun Ravindranath, Naresh Devineni, Upmanu Lall, Paulina Concha Larrauri

Publications and Research

Water risk management is a ubiquitous challenge faced by stakeholders in the water or agricultural sector. We present a methodological framework for forecasting water storage requirements and present an application of this methodology to risk assessment in India. The application focused on forecasting crop water stress for potatoes grown during the monsoon season in the Satara district of Maharashtra. Pre-season large-scale climate predictors used to forecast water stress were selected based on an exhaustive search method that evaluates for highest ranked probability skill score and lowest root-mean-squared error in a leave-one-out cross-validation mode. Adaptive forecasts were made in the years …


Yelp’S Review Filtering Algorithm, Yao Yao, Ivelin Angelov, Jack Rasmus-Vorrath, Mooyoung Lee, Daniel W. Engels 2018 Southern Methodist University

Yelp’S Review Filtering Algorithm, Yao Yao, Ivelin Angelov, Jack Rasmus-Vorrath, Mooyoung Lee, Daniel W. Engels

SMU Data Science Review

In this paper, we present an analysis of features influencing Yelp's proprietary review filtering algorithm. Classifying or misclassifying reviews as recommended or non-recommended affects average ratings, consumer decisions, and ultimately, business revenue. Our analysis involves systematically sampling and scraping Yelp restaurant reviews. Features are extracted from review metadata and engineered from metrics and scores generated using text classifiers and sentiment analysis. The coefficients of a multivariate logistic regression model were interpreted as quantifications of the relative importance of features in classifying reviews as recommended or non-recommended. The model classified review recommendations with an accuracy of 78%. We found that reviews …


Quantitative Jeopardy Feud, Jonathan M. Gallimore 2018 Embry-Riddle Aeronautical University

Quantitative Jeopardy Feud, Jonathan M. Gallimore

MSF 600 PR - Gallimore - Fall 2018

This activity - Quantitative Jeopardy Feud - is a method for using a game as a final exam.


Secondary Data Analysis Project, Jonathan M. Gallimore 2018 Embry-Riddle Aeronautical University

Secondary Data Analysis Project, Jonathan M. Gallimore

SF 420 PR - Gallimore - Fall 2018

This activity is designed to give students an opportunity to apply what they have learned in statistics to a real dataset.

This activity will help students apply what they have learned in statistics to real world data and answer their own research questions. Students will also practice reporting their results in a paper using APA format.


Goalie Analytics: Statistical Evaluation Of Context-Specific Goalie Performance Measures In The National Hockey League, Marc Naples, Logan Gage, Amy Nussbaum 2018 Southern Methodist University

Goalie Analytics: Statistical Evaluation Of Context-Specific Goalie Performance Measures In The National Hockey League, Marc Naples, Logan Gage, Amy Nussbaum

SMU Data Science Review

In this paper, we attempt to improve upon the classic formulation of save percentage in the NHL by controlling the context of the shots and use alternative measures than save percentage. In particular, we find save percentage to be both a weakly repeatable skill and predictor of future performance, and we seek other goalie performance calculations that are more robust. To do so, we use three primary tests to test intra-season consistency, intra-season predictability, and inter-season consistency, and extend the analysis to disentangle team effects on goalie statistics. We find that there are multiple ways to improve upon classic save …


Evaluation Of Using The Bootstrap Procedure To Estimate The Population Variance, Nghia Trong Nguyen 2018 Stephen F Austin State University

Evaluation Of Using The Bootstrap Procedure To Estimate The Population Variance, Nghia Trong Nguyen

Electronic Theses and Dissertations

The bootstrap procedure is widely used in nonparametric statistics to generate an empirical sampling distribution from a given sample data set for a statistic of interest. Generally, the results are good for location parameters such as population mean, median, and even for estimating a population correlation. However, the results for a population variance, which is a spread parameter, are not as good due to the resampling nature of the bootstrap method. Bootstrap samples are constructed using sampling with replacement; consequently, groups of observations with zero variance manifest in these samples. As a result, a bootstrap variance estimator will carry a …


Standard And Anomalous Wave Transport Inside Random Media, Xujun Ma 2018 The Graduate Center, City University of New York

Standard And Anomalous Wave Transport Inside Random Media, Xujun Ma

Dissertations, Theses, and Capstone Projects

This thesis is a study of wave transport inside random media using random matrix theory. Anderson localization plays a central role in wave transport in random media. As a consequence of destructive interference in multiple scattering, the wave function decays exponentially inside random systems. Anderson localization is a wave effect that applies to both classical waves and quantum waves. Random matrix theory has been successfully applied to study the statistical properties of transport and localization of waves. Particularly, the solution of the Dorokhov-Mello-Pereyra-Kumar (DMPK) equation gives the distribution of transmission.

For wave transport in standard one dimensional random systems in …


Initial Evidence Of Construct Validity Of Data From A Self-Assessment Instrument Of Technological Pedagogical Content Knowledge (Tpack) In 2-Year Public College Faculty In Texas, Kristin C. Scott 2018 University of Texas at Tyler

Initial Evidence Of Construct Validity Of Data From A Self-Assessment Instrument Of Technological Pedagogical Content Knowledge (Tpack) In 2-Year Public College Faculty In Texas, Kristin C. Scott

Human Resource Development Theses and Dissertations

Technological pedagogical content knowledge (TPACK) has been studied in K-12 faculty in the U.S. and around the world using survey methodology. Very few studies of TPACK in post-secondary faculty have been conducted and no peer-reviewed studies in U.S. post-secondary faculty have been published to date. The present study is the first reliability and validity of data from a TPACK survey to be conducted with a large sample of U.S. post-secondary faculty. The professorate of 2-year public college faculty in Texas will help their institutions meet the goals of the state’s higher education strategic plan, 60x30TX. In order to do …


Digital Commons powered by bepress