Functional Regression, 2015 The University of Texas

#### Functional Regression, Jeffrey S. Morris

*Jeffrey S. Morris*

Functional data analysis (FDA) involves the analysis of data whose ideal units of observation are functions defined on some continuous domain, and the observed data consist of a sample of functions taken from some population, sampled on a discrete grid. Ramsay and Silverman's 1997 textbook sparked the development of this field, which has accelerated in the past 10 years to become one of the fastest growing areas of statistics, fueled by the growing number of applications yielding this type of data. One unique characteristic of FDA is the need to combine information both across and within functions, which Ramsay ...

Targeted Learning Of An Optimal Dynamic Treatment, And Statistical Inference For Its Mean Outcome, 2014 COBRA

#### Targeted Learning Of An Optimal Dynamic Treatment, And Statistical Inference For Its Mean Outcome, Mark J. Van Der Laan, Alexander R. Luedtke

*U.C. Berkeley Division of Biostatistics Working Paper Series*

Suppose we observe n independent and identically distributed observations of a time-dependent random variable consisting of baseline covariates, initial treatment and censoring indicator, intermediate covariates, subsequent treatment and censoring indicator, and a final outcome. For example, this could be data generated by a sequentially randomized controlled trial, where subjects are sequentially randomized to a first line and second line treatment, possibly assigned in response to an intermediate biomarker, and are subject to right-censoring. In this article we consider estimation of an optimal dynamic multiple time-point treatment rule defined as the rule that maximizes the mean outcome under the dynamic treatment ...

Comparing Partial Least Square Approaches In Gene-Or Region-Based Association Study For Multiple Quantitative Phenotypes, 2014 Wayne State University

#### Comparing Partial Least Square Approaches In Gene-Or Region-Based Association Study For Multiple Quantitative Phenotypes, Zhongshang Yuan, Xiaoshuai Zhang, Fangyu Li, Jinghua Zhao, Fuzhong Xue

*Human Biology Open Access Pre-Prints*

On thinking quantitatively of complex diseases, there are at least three statistical strategies for association study: single SNP on single trait, gene-or region (with multiple SNPs) on single trait and on multiple traits. The third of which is the most general in dissecting the genetic mechanism underlying complex diseases underpinning multiple quantitative traits. Gene-or region association methods based on partial least square (PLS) approaches have been shown to have apparent power advantage. However, few attempts are developed for multiple quantitative phenotypes or traits underlying a condition or disease, and the performance of various PLS approaches used in association study for ...

Inferences In Log-Rate Models, 2014 Minnesota State University, Mankato

#### Inferences In Log-Rate Models, Herbert C. Heien, William A. Baumann

*Journal of Undergraduate Research at Minnesota State University, Mankato*

Log-Rate models are used in analyzing rates of individuals who are exposed to a risk of having a certain characteristic. The explanatory variables could be categorical or in a continuous scale. In finding a Log-Rate Model, parameters are estimated and goodness-of-fit are studied to carefully extract the best model to fit our data. Here we revisit three aspects of Log-Rate Models using the data set give at the end of the paper. The three aspects are parameter estimation, goodness-of-fit of the model, and marginal effect of the factors.

Modeling Count Data; Errata And Comments, 2014 SelectedWorks

#### Modeling Count Data; Errata And Comments, Joseph M. Hilbe

*Joseph M Hilbe*

Modeling Count Data: Errata and Comments PDF. Will be updated on a continuing basis.

Bird Keeping And Lung Cancer, 2014 Minnesota State University, Mankato

#### Bird Keeping And Lung Cancer, Andrew Tackmann, Jonathan Hellman, Jamie Johnson

*Journal of Undergraduate Research at Minnesota State University, Mankato*

Logistic regression is reviewed in estimating parameters and in making inferences about the parameters. A contingency table approach in computing goodness of fit in logistic regression is elaborated. An existing data on a sample of lung cancer patients and a control group is used to apply the procedures discussed. The data reveals that between the groups considered, the factors ‘bird keeping’ and ‘the number of years of smoking’ are significant as the causes for lung cancer.

Simulating Burr Type Vii Distributions Through The Method Of L-Moments And L-Correlations, 2014 SelectedWorks

#### Simulating Burr Type Vii Distributions Through The Method Of L-Moments And L-Correlations, Mohan D. Pant, Todd C. Headrick

*Mohan Dev Pant*

Burr Type VII, a one-parameter non-normal distribution, is among the less studied distributions, especially, in the contexts of statistical modeling and simulation studies. The main purpose of this study is to introduce a methodology for simulating univariate and multivariate Burr Type VII distributions through the method of L-moments and L-correlations. The methodology can be applied in statistical modeling of events in a variety of applied mathematical contexts and Monte Carlo simulation studies. Numerical examples are provided to demonstrate that L-moment-based Burr Type VII distributions are superior to their conventional moment-based analogs in terms of distribution fitting and estimation. Simulation results ...

The Doubly Adaptive Lasso Methods For Time Series Analysis, 2014 Western University

#### The Doubly Adaptive Lasso Methods For Time Series Analysis, Zi Zhen Liu

*University of Western Ontario - Electronic Thesis and Dissertation Repository*

In this thesis, we propose a systematic approach called the doubly adaptive LASSO tailored to time series analysis, which includes four specific methods for four time series models, respectively:

The PAC-weighted adaptive LASSO for univariate autoregressive (AR) models. Although the LASSO methodology has been applied to AR models, the existing methods in the literature ignore the temporal dependence information embedded in AR time series data. Consequently, the methods may not reflect the characteristics of underlying AR processes, especially, the lag order of AR models. The PAC-weighted adaptive LASSO incorporates the partial autocorrelation (PAC) into the adaptive LASSO weights. The PAC-weighted ...

Perfect And Nearly Perfect Sampling Of Work-Conserving Queues, 2014 Western University

#### Perfect And Nearly Perfect Sampling Of Work-Conserving Queues, Yaofei Xiong

*University of Western Ontario - Electronic Thesis and Dissertation Repository*

We present sampling-based methods to treat work-conserving queueing systems. A variety of models are studied. Besides the First Come First Served (FCFS) queues, many efforts are putted on the accumulating priority queue (APQ), where a customer accumulates priority linearly while waiting. APQs have Poisson arrivals, multi-class customers with corresponding service durations, and single or multiple servers.

Perfect sampling is an approach to draw a sample directly from the steady-state distribution of a Markov chain without explicitly solving for it. Statistical inference can be conducted without initialization bias. If an error can be tolerated within some limit, i.e. the total ...

Mathematical Modeling And Simulation Of Multialleic Migration-Selection Models, 2014 Minnesota State University, Mankato

#### Mathematical Modeling And Simulation Of Multialleic Migration-Selection Models, Chad N. Vidden

*Journal of Undergraduate Research at Minnesota State University, Mankato*

Population ecology is concerned with the growth and decay of specific populations. This field has a variety of applications ranging from evolution and survival at the environmental level to the spread of infectious disease at the cellular and molecular levels. Many ecological circumstances require the use of mathematical methods and reasoning in order to acquire better knowledge of the issue at hand. This study considered and analyzed multiple different mathematical models of population dynamics along with their purposes. This foundation was then applied in order to explore the migration of populations from one isolated region to another along with the ...

Identification Of Informativeness In Text Using Natural Language Stylometry, 2014 Western University

#### Identification Of Informativeness In Text Using Natural Language Stylometry, Rushdi Shams

*University of Western Ontario - Electronic Thesis and Dissertation Repository*

In this age of information overload, one experiences a rapidly growing over-abundance of written text. To assist with handling this bounty, this plethora of texts is now widely used to develop and optimize statistical natural language processing (NLP) systems. Surprisingly, the use of more fragments of text to train these statistical NLP systems may not necessarily lead to improved performance. We hypothesize that those fragments that help the most with training are those that contain the desired information. Therefore, determining informativeness in text has become a central issue in our view of NLP. Recent developments in this field have spawned ...

Mathematical Modeling Of Tick-Borne Encephalitis In Humans, 2014 Minnesota State University, Mankato

#### Mathematical Modeling Of Tick-Borne Encephalitis In Humans, Amanda Kriesel, Michael Meyer, Geoffrey Peterson

*Journal of Undergraduate Research at Minnesota State University, Mankato*

Tick-Borne Encephalitis is a virus that affects ones nervous system and is transmitted from tick to human through tick bite. In recent years, the number of cases of tick-borne encephalitis in Europe has been increasing. This mathematical biological model of Tick-Borne Encephalitis was created in order to further our understanding of such phenomenon, as well as study the relationship between vectors and their hosts. Specifically, we will investigate the population model of ticks in certain regions and its correlation to tick-borne encephalitis infections in the region.

Choosing Between Parametric And Non-Parametric Tests, 2014 Minnesota State University, Mankato

#### Choosing Between Parametric And Non-Parametric Tests, Russ Johnson

*Journal of Undergraduate Research at Minnesota State University, Mankato*

A common question in comparing two sets of measurements is whether to use a parametric testing procedure or a non-parametric procedure. The question is even more important in dealing with smaller samples. Here, using simulation, several parametric and nonparametric tests, such as, t-test, Normal test, Wilcoxon Rank Sum test, van-der Waerden Score test, and Exponential Score test are compared.

Modeling Student Engagement In The Classroom, 2014 Minnesota State University, Mankato

#### Modeling Student Engagement In The Classroom, Sarah Painter

*Journal of Undergraduate Research at Minnesota State University, Mankato*

Connections to Community is a multi-institutional study that looks at the influence of community on post-secondary, science and engineering students and their engagement in academic activity. This paper focuses specifically on student engagement within the classroom as a follow-up to a previous paper by Wendy Hoffman, Identifying Influential Variables of Student Academic Engagement (Hoffman, 2013). The goal of this work is to model student engagement in the classroom using classroom observation data that has been cleaned and then compare the results with those found in Hoffman’s paper which used pre-cleaning data. The cleaned data is used to create two ...

Mdc-R-Code, 2014 SelectedWorks

#### Mdc-R-Code, Joseph M. Hilbe

*Joseph M Hilbe*

Modeling Count Data: R code for download and use.

Extensions To Modeling Count Data, 2014 SelectedWorks

#### Extensions To Modeling Count Data, Joseph M. Hilbe

*Joseph M Hilbe*

Extensions to Modeling Count Data provides additional code and discussion of methodology from what exists in Hilbe, Modeling Count Data (2014). The book is designed to be basic, and of fewer than 300 pages. Some topics were excluded that might be helpful to analysts in modeling counts.

Herd Immunity And The Necessity Of Vaccinations: Modeling The Effects Of Mmr Vaccinations, 2014 Minnesota State University, Mankato

#### Herd Immunity And The Necessity Of Vaccinations: Modeling The Effects Of Mmr Vaccinations, Caitlyn Cardetti, Katie Groskreutz, Melissa Zins

*Journal of Undergraduate Research at Minnesota State University, Mankato*

The MMR vaccination is a two dose vaccine given to children between the ages of 12 – 15 months and the second dose between the ages of 4 – 6 years to prevent measles, mumps, and rubella. The objective was to mathematically model the effects of the MMR vaccinations in a hypothetical school through multiple compartment and spatial models. These models were based on each disease individually with their respective vaccine effectiveness and disease infection rates. These models demonstrated the limits of herd immunity. Herd immunity occurs when a high enough percentage of the population is immune or vaccinated to prevent the ...

Coexistence Of Multi-Allelic Polymorphism With Migration And Selection, 2014 Minnesota State University, Mankato

#### Coexistence Of Multi-Allelic Polymorphism With Migration And Selection, Andrew Flick

*Journal of Undergraduate Research at Minnesota State University, Mankato*

Population ecology is concerned with the growth patterns of populations. This field has many applications, ranging from survival at the environmental level, to the spread of infectious diseases at the cellular level. Mathematical modeling and computer simulation can be powerful tools in researching this area. I will be investigating the spatial patterns in populations (or gene frequencies) due to migration and selection. My research conditions are for the maintenance of polymorphism under a variety of migration schemes in discrete-space and continuous-time mathematical models. The results will be applicable from the ecological level to the molecular level. Some species are better ...

Statistical Models Of Self-Efficacy In Stem Students, 2014 Minnesota State University, Mankato

#### Statistical Models Of Self-Efficacy In Stem Students, Sarah Painter

*Journal of Undergraduate Research at Minnesota State University, Mankato*

Persistence through undergraduate education may be explained by self-efficacy. It is the belief in one’s self to persevere through challenges. Bandura stated four areas that are thought to influence self-efficacy: mastery experience, social persuasion, vicarious experience, and physiological state. In this study, we focused on general and academic self-efficacy in STEM students, in the hopes of learning more about the relationships between Bandura’s categories, demographics, and self-efficacy. Data was taken from two institutions: one, a large research focused university, and the other, a smaller teaching focused university. In the first phase, surveys on general self-efficacy were taken at ...

Instrumental Variable Estimation In A Survival Context, 2014 COBRA

#### Instrumental Variable Estimation In A Survival Context, Eric J. Tchetgen Tchetgen, Stefan Walter, Stijn Vansteelandt, Torben Martinussen, Maria Glymour

*Harvard University Biostatistics Working Paper Series*

No abstract provided.