Open Access. Powered by Scholars. Published by Universities.®

Other Statistics and Probability Commons

Open Access. Powered by Scholars. Published by Universities.®

410 Full-Text Articles 911 Authors 250,540 Downloads 91 Institutions

All Articles in Other Statistics and Probability

Faceted Search

410 full-text articles. Page 8 of 17.

Probabilistic Modeling Of Personalized Drug Combinations From Integrated Chemical Screen And Molecular Data In Sarcoma, Noah E. Berlow, Rishi Rikhi, Mathew Geltzeiler, Jinu Abraham, Matthew N. Svalina, Lara E. Davis, Erin Wise, Maria Mancini, Jonathan Noujaim, Atiya Mansoor, Michael J. Quist, Kevin L. Matlock, Martin W. Goros, Brian S. Hernandez, Yee C. Doung, Khin Thway, Tomohide Tsukahara, Jun Nishio, Elaine T. Huang, Susan Airhart, Carol J. Bult, Regina Gandour-Edwards, Robert G. Maki, Robin L. Jones, Joel E. Michalek, Milan Milovancev, Souparno Ghosh, Ranadip Pal, Charles Keller 2019 Children's Cancer Therapy Development Institut

Probabilistic Modeling Of Personalized Drug Combinations From Integrated Chemical Screen And Molecular Data In Sarcoma, Noah E. Berlow, Rishi Rikhi, Mathew Geltzeiler, Jinu Abraham, Matthew N. Svalina, Lara E. Davis, Erin Wise, Maria Mancini, Jonathan Noujaim, Atiya Mansoor, Michael J. Quist, Kevin L. Matlock, Martin W. Goros, Brian S. Hernandez, Yee C. Doung, Khin Thway, Tomohide Tsukahara, Jun Nishio, Elaine T. Huang, Susan Airhart, Carol J. Bult, Regina Gandour-Edwards, Robert G. Maki, Robin L. Jones, Joel E. Michalek, Milan Milovancev, Souparno Ghosh, Ranadip Pal, Charles Keller

Department of Statistics: Faculty Publications

Background: Cancer patients with advanced disease routinely exhaust available clinical regimens and lack actionable genomic medicine results, leaving a large patient population without effective treatments options when their disease inevitably progresses. To address the unmet clinical need for evidence-based therapy assignment when standard clinical approaches have failed, we have developed a probabilistic computational modeling approach which integrates molecular sequencing data with functional assay data to develop patient-specific combination cancer treatments. Methods: Tissue taken from a murine model of alveolar rhabdomyosarcoma was used to perform single agent drug screening and DNA/RNA sequencing experiments; results integrated via our computational modeling approach identified …


Cs + Sociology: Using Big Data To Identify And Understand Educational Inequality In America (1), Joseph Cleary, Elin Waring 2019 CUNY Lehman College

Cs + Sociology: Using Big Data To Identify And Understand Educational Inequality In America (1), Joseph Cleary, Elin Waring

Open Educational Resources

This is the first of two lessons/labs for teaching and learning of computer science and sociology. Either and be used on their own or they can be used in sequence, in which case this should be used first.

Students will develop CS skills and behaviors including but not limited to: learning what an API is, learning how to access and utilize data on an API, and developing their R coding skills and knowledge. Students will also learn basic, but important, sociological principles such as how poverty is related to educational opportunities in America. Although prior knowledge of CS and sociology …


Development Of A School Boredom Proneness Scale For Children, Taylor Carrington 2019 James Madison University

Development Of A School Boredom Proneness Scale For Children, Taylor Carrington

Educational Specialist, 2009-2019

One common phrase heard from students is, “I’m bored.” However, there is no real understanding of what this actually means. In this study, elementary-age students were asked to respond to a newly developed School Boredom Proneness Scale (SBPS) including questions relating to a five-factor model of boredom. Students were also asked to rate how often they become bored at school and how bored they seem compared to classmates. In addition to student responses, parents and teachers were asked to rate how bored they thought the student was, and teachers were additionally asked to rate students’ level of work completion. The …


Valuation And Risk Management Of Some Longevity And P&C Insurance Products, Yixing Zhao 2019 The University of Western Ontario

Valuation And Risk Management Of Some Longevity And P&C Insurance Products, Yixing Zhao

Electronic Thesis and Dissertation Repository

Numerous insurance products linked to risky assets have emerged rapidly in the last couple of decades. These products have option-embedded features and typically involve at least two risk factors, namely interest and mortality risks. The need for models to capture risk factors' behaviours accurately is enormous and critical for insurance companies. The primary objective of this thesis is to develop pricing and hedging frameworks for option-embedded longevity products addressing correlated risk factors. Various methods are employed to facilitate the computation of prices and risk measures of longevity products including those with maturity benefits. Furthermore, in order to be prepared for …


Unified Methods For Feature Selection In Large-Scale Genomic Studies With Censored Survival Outcomes, Lauren Spirko-Burns, Karthik Devarajan 2019 Temple University

Unified Methods For Feature Selection In Large-Scale Genomic Studies With Censored Survival Outcomes, Lauren Spirko-Burns, Karthik Devarajan

COBRA Preprint Series

One of the major goals in large-scale genomic studies is to identify genes with a prognostic impact on time-to-event outcomes which provide insight into the disease's process. With rapid developments in high-throughput genomic technologies in the past two decades, the scientific community is able to monitor the expression levels of tens of thousands of genes and proteins resulting in enormous data sets where the number of genomic features is far greater than the number of subjects. Methods based on univariate Cox regression are often used to select genomic features related to survival outcome; however, the Cox model assumes proportional hazards …


Informative Group Testing For Multiplex Assays, Christopher R. Bilder, Joshua M. Tebbs, Christopher S. McMahan 2019 University of Nebraska - Lincoln

Informative Group Testing For Multiplex Assays, Christopher R. Bilder, Joshua M. Tebbs, Christopher S. Mcmahan

Department of Statistics: Faculty Publications

Infectious disease testing frequently takes advantage of two tools–group testing and multiplex assays–to make testing timely and cost effective. Until the work of Tebbs et al. (2013) and Hou et al. (2017), there was no research available to understand how best to apply these tools simultaneously. This recent work focused on applications where each individual is considered to be identical in terms of the probability of disease. However, risk-factor information, such as past behavior and presence of symptoms, is very often available on each individual to allow one to estimate individual-specific probabilities. The purpose of our paper is to propose …


Functional Random Forest With Applications In Dose-Response Predictions, Raziur Rahman, Saugato Rahman Dhruba, Souparno Ghosh, Ranadip Pal 2019 Texas Tech University

Functional Random Forest With Applications In Dose-Response Predictions, Raziur Rahman, Saugato Rahman Dhruba, Souparno Ghosh, Ranadip Pal

Department of Statistics: Faculty Publications

Drug sensitivity prediction for individual tumors is a significant challenge in personalized medicine. Current modeling approaches consider prediction of a single metric of the drug response curve such as AUC or IC50. However, the single summary metric of a dose-response curve fails to provide the entire drug sensitivity profile which can be used to design the optimal dose for a patient. In this article, we assess the problem of predicting the complete dose-response curve based on genetic characterizations. We propose an enhancement to the popular ensemble-based Random Forests approach that can directly predict the entire functional profile of …


Health Risk Tolerance As A Key Determinant Of (Un)Willingness To Behavior Change: Conceptualization And Scale Development, Hyoyeun Jun, Yan Jin 2019 University of Georgia

Health Risk Tolerance As A Key Determinant Of (Un)Willingness To Behavior Change: Conceptualization And Scale Development, Hyoyeun Jun, Yan Jin

International Crisis and Risk Communication Conference

After the study of testing determinants of risk tolerance affecting information sharing, this study was conducted as a second step to actually develop the scale for risk tolerance. Firstly, this study followed qualitative steps, such as in-depth interview and focus group, to capture how public describes the situation when they are tolerating the risk, when they knew what the recommended behavior is to relieve the risk. Secondly, this study collected 1000 U.S. public sample for the survey questionnaire that are the items generated from the qualitative steps.


Cost-Effective Surveillance For Infectious Diseases Through Specimen Pooling And Multiplex Assays, Christopher Bilder, Joshua Tebbs, Christopher McMahan 2019 University of Nebraska - Lincoln

Cost-Effective Surveillance For Infectious Diseases Through Specimen Pooling And Multiplex Assays, Christopher Bilder, Joshua Tebbs, Christopher Mcmahan

Department of Statistics: Faculty Publications

To develop specimen pooling algorithms that reduce the number of tests needed to test individuals for infectious diseases with multiplex assays.


Genomic Prediction Using Canopy Coverage Image And Genotypic Information In Soybean Via A Hybrid Model, Reka Howard, Diego Jarquin 2019 University of Nebraska-Lincoln

Genomic Prediction Using Canopy Coverage Image And Genotypic Information In Soybean Via A Hybrid Model, Reka Howard, Diego Jarquin

Department of Statistics: Faculty Publications

Prediction techniques are important in plant breeding as they provide a tool for selection that is more efficient and economical than traditional phenotypic and pedigree based selection. The conventional genomic prediction models include molecular marker information to predict the phenotype. With the development of new phenomics techniques we have the opportunity to collect image data on the plants, and extend the traditional genomic prediction models where we incorporate diverse set of information collected on the plants. In our research, we developed a hybrid matrix model that incorporates molecular marker and canopy coverage information as a weighted linear combination to predict …


Post-Er Stress Biogenesis Of Golgi Is Governed By Giantin, Cole P. Frisbie, Alexander Y. Lushnikov, Alexey V. Krasnoslobodtsev, Jean-Jack Riethoven, Jennifer L. Clarke, Elena I. Stepchenkova, Armen Petrosyan 2019 University of Nebraska Medical Center

Post-Er Stress Biogenesis Of Golgi Is Governed By Giantin, Cole P. Frisbie, Alexander Y. Lushnikov, Alexey V. Krasnoslobodtsev, Jean-Jack Riethoven, Jennifer L. Clarke, Elena I. Stepchenkova, Armen Petrosyan

Department of Statistics: Faculty Publications

Background: The Golgi apparatus undergoes disorganization in response to stress, but it is able to restore compact and perinuclear structure under recovery. This self-organization mechanism is significant for cellular homeostasis, but remains mostly elusive, as does the role of giantin, the largest Golgi matrix dimeric protein. Methods: In HeLa and different prostate cancer cells, we used the model of cellular stress induced by Brefeldin A (BFA). The conformational structure of giantin was assessed by proximity ligation assay and atomic force microscopy. The post-BFA distribution of Golgi resident enzymes was examined by 3D SIM high-resolution microscopy. Results: We detected that giantin …


Recursive Model For Dose-Time Responses In Pharmacological Studies, Saugato Rahman Dhruba, Aminur Rahman, Raziur Rahman, Souparno Ghosh, Ranadip Pal 2019 Texas Tech University

Recursive Model For Dose-Time Responses In Pharmacological Studies, Saugato Rahman Dhruba, Aminur Rahman, Raziur Rahman, Souparno Ghosh, Ranadip Pal

Department of Statistics: Faculty Publications

Background: Clinical studies often track dose-response curves of subjects over time. One can easily model the dose-response curve at each time point with Hill equation, but such a model fails to capture the temporal evolution of the curves. On the other hand, one can use Gompertz equation to model the temporal behaviors at each dose without capturing the evolution of time curves across dosage

Results: In this article, we propose a parametric model for dose-time responses that follows Gompertz law in time and Hill equation across dose approximately. We derive a recursion relation for dose-response curves over time capturing the …


The Dark Sky Character Of Archaeological Landscapes: Cultural Meaning And Conservation Strategies, Frank Prendergast 2019 Technological University Dublin

The Dark Sky Character Of Archaeological Landscapes: Cultural Meaning And Conservation Strategies, Frank Prendergast

Book/Book Chapter

This paper presents the first ever study of light pollution at selected Irish prehistoric archaeological landscapes. The concepts of cosmology and landscape are first briefly described and followed by a summary of early human settlement of the island. Building on this, the extant corpus of early prehistoric megalithic burial tombs is illustrated to show their contrasting distribution patterns and typology. Analysis of tomb locations using nearest-neighbour statistical methods reveals evidence of intentional clustering. Further geo-statistical analysis identifies the geographical locations and the density ranking of these nucleated clusters - a feature especially evident in the passage tomb tradition on this …


Using Neural Networks To Classify Discrete Circular Probability Distributions, Madelyn Gaumer 2019 Claremont Colleges

Using Neural Networks To Classify Discrete Circular Probability Distributions, Madelyn Gaumer

HMC Senior Theses

Given the rise in the application of neural networks to all sorts of interesting problems, it seems natural to apply them to statistical tests. This senior thesis studies whether neural networks built to classify discrete circular probability distributions can outperform a class of well-known statistical tests for uniformity for discrete circular data that includes the Rayleigh Test1, the Watson Test2, and the Ajne Test3. Each neural network used is relatively small with no more than 3 layers: an input layer taking in discrete data sets on a circle, a hidden layer, and an output …


Design Of Experiment And Analysis Techniques For Fuel Consumption Data Using Heavy-Duty Diesel Vehicles And On-Road Testing, Sarah Ann Mills 2019 West Virginia University

Design Of Experiment And Analysis Techniques For Fuel Consumption Data Using Heavy-Duty Diesel Vehicles And On-Road Testing, Sarah Ann Mills

Graduate Theses, Dissertations, and Problem Reports

Chassis dynamometer and on-road testing are usually employed to test vehicle operation. Testing on a chassis dynamometer reduces data variability compared to on-road testing due to the controlled environment but it does not account for other important variables that affects real-world vehicle operation. This study used on-road testing to investigate the differences between two test fuels under real-world conditions. Three heavy-duty diesel vehicles were driven on different routes for a period of three months. Each vehicle was instrumented with flow meters to gather fuel consumption data, which was then compared to the fuel rate broadcasted by the engine control unit …


Global Warming Statistical Analysis, Jared Skinner 2019 The University of Akron

Global Warming Statistical Analysis, Jared Skinner

Williams Honors College, Honors Research Projects

This paper will investigate global warming and its effects on natural disasters. I will review the historic movements of climate change and activism, as well as the current discussions surrounding global warming. Secondly, I will examine various datasets, paying attention to the severity and frequency of specific natural disasters. I will then touch briefly on the topic of catastrophe modeling as it relates to the increased risk and losses associated with the discussed natural disasters and how those put the problem of global warming in a framework which financial and government institutions can grasp. I will also be analyzing economic …


Regression Tree Construction For Reinforcement Learning Problems With A General Action Space, Anthony S. Bush Jr 2019 Georgia Southern University

Regression Tree Construction For Reinforcement Learning Problems With A General Action Space, Anthony S. Bush Jr

Electronic Theses and Dissertations

Part of the implementation of Reinforcement Learning is constructing a regression of values against states and actions and using that regression model to optimize over actions for a given state. One such common regression technique is that of a decision tree; or in the case of continuous input, a regression tree. In such a case, we fix the states and optimize over actions; however, standard regression trees do not easily optimize over a subset of the input variables\cite{Card1993}. The technique we propose in this thesis is a hybrid of regression trees and kernel regression. First, a regression tree splits over …


Rfviz: An Interactive Visualization Package For Random Forests In R, Christopher Beckett 2018 Utah State University

Rfviz: An Interactive Visualization Package For Random Forests In R, Christopher Beckett

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

Random forests are very popular tools for predictive analysis and data science. They work for both classification (where there is a categorical response variable) and regression (where the response is continuous). Random forests provide proximities, and both local and global measures of variable importance. However, these quantities require special tools to be effectively used to interpret the forest. Rfviz is a sophisticated interactive visualization package and toolkit in R, specially designed for interpreting the results of a random forest in a user-friendly way. Rfviz uses a recently developed R package (loon) from the Comprehensive R Archive Network (CRAN) to create …


Season-Ahead Forecasting Of Water Storage And Irrigation Requirements – An Application To The Southwest Monsoon In India, Arun Ravindranath, Naresh Devineni, Upmanu Lall, Paulina Concha Larrauri 2018 CUNY City College

Season-Ahead Forecasting Of Water Storage And Irrigation Requirements – An Application To The Southwest Monsoon In India, Arun Ravindranath, Naresh Devineni, Upmanu Lall, Paulina Concha Larrauri

Publications and Research

Water risk management is a ubiquitous challenge faced by stakeholders in the water or agricultural sector. We present a methodological framework for forecasting water storage requirements and present an application of this methodology to risk assessment in India. The application focused on forecasting crop water stress for potatoes grown during the monsoon season in the Satara district of Maharashtra. Pre-season large-scale climate predictors used to forecast water stress were selected based on an exhaustive search method that evaluates for highest ranked probability skill score and lowest root-mean-squared error in a leave-one-out cross-validation mode. Adaptive forecasts were made in the years …


Yelp’S Review Filtering Algorithm, Yao Yao, Ivelin Angelov, Jack Rasmus-Vorrath, Mooyoung Lee, Daniel W. Engels 2018 Southern Methodist University

Yelp’S Review Filtering Algorithm, Yao Yao, Ivelin Angelov, Jack Rasmus-Vorrath, Mooyoung Lee, Daniel W. Engels

SMU Data Science Review

In this paper, we present an analysis of features influencing Yelp's proprietary review filtering algorithm. Classifying or misclassifying reviews as recommended or non-recommended affects average ratings, consumer decisions, and ultimately, business revenue. Our analysis involves systematically sampling and scraping Yelp restaurant reviews. Features are extracted from review metadata and engineered from metrics and scores generated using text classifiers and sentiment analysis. The coefficients of a multivariate logistic regression model were interpreted as quantifications of the relative importance of features in classifying reviews as recommended or non-recommended. The model classified review recommendations with an accuracy of 78%. We found that reviews …


Digital Commons powered by bepress