Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Longitudinal Data Analysis and Time Series

Theses/Dissertations

Institution
Keyword
Publication Year
Publication

Articles 31 - 60 of 162

Full-Text Articles in Physical Sciences and Mathematics

Liquidity Commonality With Factor Models, Ernesto Garcia Iii Feb 2022

Liquidity Commonality With Factor Models, Ernesto Garcia Iii

Dissertations, Theses, and Capstone Projects

Market microstructure research has recently devoted attention to a phenomenon called commonality in liquidity. In this dissertation, I will analyze commonality in liquidity using a novel factor model approach and a generalized definition of commonality in liquidity. This analysis will show that commonality in liquidity is rarely a marketwide phenomenon and is mostly restricted to stocks with a large market capitalization. Additionally, commonality in liquidity is a very recent phenomenon whose appearance coincides with a rise in passive investing after the Dotcom Bubble burst and, more so, after the 2008 Financial Crisis. I will present evidence that suggests commonality in …


Análisis De Los Días De Mora Para La Cartera De Un Producto Financiero En Colombia, Una Aproximación A Partir De Las Series De Tiempo (2013 - 2018), Eleny Kottaridis Fernandez Jan 2022

Análisis De Los Días De Mora Para La Cartera De Un Producto Financiero En Colombia, Una Aproximación A Partir De Las Series De Tiempo (2013 - 2018), Eleny Kottaridis Fernandez

Economía

La morosidad sobre la cartera de consumo evidencia un patrón que debe ser considerado en la toma de decisiones de las entidades financieras para una adecuada administración del riesgo crediticio teniendo en cuenta su alta volatilidad. En efecto, un desempeño económico desfavorable relacionado con algunos sectores financieros, las bajas tasas de crecimiento económico y mayores niveles de desempleo, incrementa la probabilidad del incumplimiento de las obligaciones de los hogares debido a la menor capacidad de pago por la reducción de sus ingresos. De acuerdo con estos impactos, las entidades financieras necesitan contar con mecanismos para abordar el pronóstico sobre el …


Estimating The Statistics Of Operational Loss Through The Analyzation Of A Time Series, Maurice L. Brown Jan 2022

Estimating The Statistics Of Operational Loss Through The Analyzation Of A Time Series, Maurice L. Brown

Theses and Dissertations

In the world of finance, appropriately understanding risk is key to success or failure because it is a fundamental driver for institutional behavior. Here we focus on risk as it relates to the operations of financial institutions, namely operational risk. Quantifying operational risk begins with data in the form of a time series of realized losses, which can occur for a number of reasons, can vary over different time intervals, and can pose a challenge that is exacerbated by having to account for both frequency and severity of losses. We introduce a stochastic point process model for the frequency distribution …


Lake Huron Shoreline Analysis, Shubham Satish Nandanwar Jan 2022

Lake Huron Shoreline Analysis, Shubham Satish Nandanwar

Theses and Dissertations (Comprehensive)

Lake Huron is a popular tourist destination and is home to several businesses and residents. Since the shoreline is dynamic and is subject to change over the years due to several factors such as a change in water level, soil type, human encroachment, etc., these locations tend to encounter floods due to increased water levels and wind speed. This causes erosion and loss to the properties along the shoreline.

This study is based on two areas of interest named Pinery Provincial Park and Sauble Beach which are located on the shoreline of Lake Huron where Pinery Provincial Park is a …


Realtime Event Detection In Sports Sensor Data With Machine Learning, Mallory Cashman Jan 2022

Realtime Event Detection In Sports Sensor Data With Machine Learning, Mallory Cashman

Honors Theses and Capstones

Machine learning models can be trained to classify time series based sports motion data, without reliance on assumptions about the capabilities of the users or sensors. This can be applied to predict the count of occurrences of an event in a time period. The experiment for this research uses lacrosse data, collected in partnership with SPAITR - a UNH undergraduate startup developing motion tracking devices for lacrosse. Decision Tree and Support Vector Machine (SVM) models are trained and perform with high success rates. These models improve upon previous work in human motion event detection and can be used a reference …


Bayesian Variable Selection Strategies In Longitudinal Mixture Models And Categorical Regression Problems., Md Nazir Uddin Aug 2021

Bayesian Variable Selection Strategies In Longitudinal Mixture Models And Categorical Regression Problems., Md Nazir Uddin

Electronic Theses and Dissertations

In this work, we seek to develop a variable screening and selection method for Bayesian mixture models with longitudinal data. To develop this method, we consider data from the Health and Retirement Survey (HRS) conducted by University of Michigan. Considering yearly out-of-pocket expenditures as the longitudinal response variable, we consider a Bayesian mixture model with $K$ components. The data consist of a large collection of demographic, financial, and health-related baseline characteristics, and we wish to find a subset of these that impact cluster membership. An initial mixture model without any cluster-level predictors is fit to the data through an MCMC …


Multiple Baseline Interrupted Time Series: Describing Changes In New Mexico Medicaid Behavioral Health Home Patients’ Care, Jessica Reno Jul 2021

Multiple Baseline Interrupted Time Series: Describing Changes In New Mexico Medicaid Behavioral Health Home Patients’ Care, Jessica Reno

Mathematics & Statistics ETDs

In 2016, the CareLink New Mexico behavioral health homes program began enrolling Medicaid recipients with the goal of increasing care coordination, improving access to services, and decreasing long-term costs of care for adults with serious mental illness (SMI) and children with severe emotional disturbance (SED). To evaluate these aims, a retrospective interrupted time series study using Medicaid claims data was designed. First, a comparable subset of non-enrolled individuals was selected from the pool of Medicaid recipients with SMI or SED using propensity score matching. Then, segmented regression was applied to three outcomes: total Medicaid charges, number of outpatient behavioral health …


Stock Markets Performance During A Pandemic: How Contagious Is Covid-19?, Yara Abushahba May 2021

Stock Markets Performance During A Pandemic: How Contagious Is Covid-19?, Yara Abushahba

Theses and Dissertations

Background and Motivation: The coronavirus (“COVID-19”) pandemic, the subsequent policies and lockdowns have unarguably led to an unprecedented fluid circumstance worldwide. The panic and fluctuations in the stock markets were unparalleled. It is inarguable that real-time availability of news and social media platforms like Twitter played a vital role in driving the investors’ sentiment during such global shock.

Purpose:The purpose of this thesis is to study how the investor sentiment in relation to COVID-19 pandemic influenced stock markets globally and how stock markets globally are integrated and contagious. We analyze COVID-19 sentiment through the Twitter posts and investigate its …


Characterizing The Northern Hemisphere Circumpolar Vortex Through Space And Time, Nazla Bushra May 2021

Characterizing The Northern Hemisphere Circumpolar Vortex Through Space And Time, Nazla Bushra

LSU Doctoral Dissertations

This hemispheric-scale, steering atmospheric circulation represented by the circumpolar vortices (CPVs) are the middle- and upper-tropospheric wind belts circumnavigating the poles. Variability in the CPV area, shape, and position are important topics in geoenvironmental sciences because of the many links to environmental features. However, a means of characterizing the CPV has remained elusive. The goal of this research is to (i) identify the Northern Hemisphere CPV (NHCPV) and its morphometric characteristics, (ii) understand the daily characteristics of NHCPV area and circularity over time, (iii) identify and analyze spatiotemporal variability in the NHCPV’s centroid, and (iv) analyze how CPV features relate …


Motor Control-Based Assessment Of Therapy Effects In Individuals Post-Stroke: Implications For Prediction Of Response And Subject-Specific Modifications, Ashley Rice May 2021

Motor Control-Based Assessment Of Therapy Effects In Individuals Post-Stroke: Implications For Prediction Of Response And Subject-Specific Modifications, Ashley Rice

Doctoral Dissertations

Producing a coordinated motion such as walking is, at its root, the result of healthy communication pathways between the central nervous system and the musculoskeletal system. The central nervous system produces an electrical signal responsible for the excitation of a muscle, and the musculoskeletal system contains the necessary equipment for producing a movement-driving force to achieve a desired motion. Motor control refers to the ability an individual has to produce a desired motion, and the complexity of motor control is a mathematical concept stemming from how the electrical signals from the central nervous system translate to muscle activations. Exercising a …


Extension Of The Two-Step Approach For Informative Dropout In Survival Analysis, Cristina Murray-Krezan Apr 2021

Extension Of The Two-Step Approach For Informative Dropout In Survival Analysis, Cristina Murray-Krezan

Mathematics & Statistics ETDs

Chronic kidney disease (CKD) in children is known to result in poor growth and quality of life, and frequently results in kidney failure. The Chronic Kidney Disease in Children study (CKiD) is a prospective cohort study enrolling children ages 1 to 16 to assess health outcomes in children with CKD including the effects of declining glomerular filtration rate and the resulting consequences of growth failure on morbidity. Quantification of the magnitude of the risk for decreased kidney function and, ultimately, failure has been achieved through a variety of studies, often including cohort studies such as the CKiD study. Longitudinal studies …


Uncovering Object Categories In Infant Views, Naiti S. Bhatt Jan 2021

Uncovering Object Categories In Infant Views, Naiti S. Bhatt

Scripps Senior Theses

While adults recognize objects in a near-instant, infants must learn how to categorize the objects in their visual environments. Recent work has shown that egocentric head-mounted camera videos contain rich data that illuminate the infant experience (Clerkin et al., 2017; Franchak et al., 2011; Yoshida & Smith, 2008). While past work has focused on the social information in view, in this work, we aim to characterize the objects in infants’ at-home visual environments by modifying modern computer vision models for the infant view. To do so, we collected manual annotations of objects that infants seemed to be interacting within a …


Writing At The Horizon: How Producing Imagined Narratives Affects Mood, David Yu-Zhong Liang Jan 2021

Writing At The Horizon: How Producing Imagined Narratives Affects Mood, David Yu-Zhong Liang

Senior Projects Fall 2021

The present study explores the effect of three different writing activities and their subsequent effects on participant mood. Writing has been of particular interest for psychologists due to its use in interventions aimed at working through traumatic or stressful periods, and recent research has begun to explore the use of narrative in placing traumatic events and experiences in greater context. However, purely therapeutic, intervention-based writing exercises exclude a large amount of more expressive, imagined creations and narratives, which may have the capacity to reorient, contextualize, and otherwise positively affect a person’s mood. This study investigates whether employing the imagination may …


Design And Analyses Of School-Based Violence Prevention Cluster Randomized Trials, Md. Tofial Azam Jan 2021

Design And Analyses Of School-Based Violence Prevention Cluster Randomized Trials, Md. Tofial Azam

Theses and Dissertations--Epidemiology and Biostatistics

Interpersonal violence such as teen dating violence is a severe public health problem. Teen dating violence, including sexual violence (unwanted sexual contacts or activities), physical and psychological dating violence, sexual harassment, and stalking, affects high school students' physical and mental health and academic achievement in the United States. Dating violence is linked to psychological abuse perpetration in the future, depression, anxiety, and hostility. The teen dating violence victimization experience was related to antisocial behavior, drug abuse, increased heavy drinking, depression, suicidal ideation, smoking, and adult interpersonal violence victimization during adolescence. The detrimental effects of interpersonal violence demonstrate the critical importance …


Feature Investigation For Stock Returns Prediction Using Xgboost And Deep Learning Sentiment Classification, Seungho (Samuel) Lee Jan 2021

Feature Investigation For Stock Returns Prediction Using Xgboost And Deep Learning Sentiment Classification, Seungho (Samuel) Lee

CMC Senior Theses

This paper attempts to quantify predictive power of social media sentiment and financial data in stock prediction by utilizing a comprehensive set of stock-related fundamental and technical variables and social media sentiments. For conducting sentiment analysis, this study employs a pretrained finBERT model that provides three different sentiment classifications and respective softmax scores. Hence, the significance of these variables is evaluated with XGBoost regression and Shapley Additive exPlanations (SHAP) frameworks. Through investigating feature importance, this study finds that statistical properties of sentiment variables provide a stronger predictive power than a weighted sentiment score and that it is possible to quantify …


Self-Exciting Point Process For Modelling Terror Attack Data, Siyi Wang Jan 2021

Self-Exciting Point Process For Modelling Terror Attack Data, Siyi Wang

Theses and Dissertations (Comprehensive)

Terrorism becomes more rampant in recent years because of separatism and extreme nationalism, which brings a serious threat to the national security of many countries in the world. The analysis of spatial and temporal patterns of terror data is significant in containing terrorism. This thesis focuses on building and applying a temporal point process called self-exciting point process to fit the terror data from 1970 to 2018 of 10 countries. The data come from the Global Terrorism database. Further, an application in predicting the number of terror events based on the self-exciting model is another main innovative idea, in which …


Novel Nonparametric Testing Approaches For Multivariate Growth Curve Data: Finite-Sample, Resampling And Rank-Based Methods, Ting Zeng Jan 2021

Novel Nonparametric Testing Approaches For Multivariate Growth Curve Data: Finite-Sample, Resampling And Rank-Based Methods, Ting Zeng

Theses and Dissertations--Statistics

Multivariate growth curve data naturally arise in various fields, for example, biomedical science, public health, agriculture, social science and so on. For data of this type, the classical approach is to conduct multivariate analysis of variance (MANOVA) based on Wilks' Lambda and other multivariate statistics, which require the assumptions of multivariate normality and homogeneity of within-cell covariance matrices. However, data being analyzed nowadays show marked departure from multivariate normal distribution and homoscedasticity. In this dissertation, we investigate nonparametric testing approaches for multivariate growth curve data from three aspects, i.e., finite-sample, resampling and rank-based methods.

The first project proposes an approximate …


Improved Statistical Methods For Time-Series And Lifetime Data, Xiaojie Zhu Dec 2020

Improved Statistical Methods For Time-Series And Lifetime Data, Xiaojie Zhu

Statistical Science Theses and Dissertations

In this dissertation, improved statistical methods for time-series and lifetime data are developed. First, an improved trend test for time series data is presented. Then, robust parametric estimation methods based on system lifetime data with known system signatures are developed.

In the first part of this dissertation, we consider a test for the monotonic trend in time series data proposed by Brillinger (1989). It has been shown that when there are highly correlated residuals or short record lengths, Brillinger’s test procedure tends to have significance level much higher than the nominal level. This could be related to the discrepancy between …


Snow-Albedo Feedback In Northern Alaska: How Vegetation Influences Snowmelt, Lucas C. Reckhaus Aug 2020

Snow-Albedo Feedback In Northern Alaska: How Vegetation Influences Snowmelt, Lucas C. Reckhaus

Theses and Dissertations

This paper investigates how the snow-albedo feedback mechanism of the arctic is changing in response to rising climate temperatures. Specifically, the interplay of vegetation and snowmelt, and how these two variables can be correlated. This has the potential to refine climate modelling of the spring transition season. Research was conducted at the ecoregion scale in northern Alaska from 2000 to 2020. Each ecoregion is defined by distinct topographic and ecological conditions, allowing for meaningful contrast between the patterns of spring albedo transition across surface conditions and vegetation types. The five most northerly ecoregions of Alaska are chosen as they encompass …


D-Vine Pair-Copula Models For Longitudinal Binary Data, Huihui Lin Aug 2020

D-Vine Pair-Copula Models For Longitudinal Binary Data, Huihui Lin

Mathematics & Statistics Theses & Dissertations

Dependent longitudinal binary data are prevalent in a wide range of scientific disciplines, including healthcare and medicine. A popular method for analyzing such data is the multivariate probit (MP) model. The motivation for this dissertation stems from the fact that the MP model fails even the binary correlations are within the feasible range. The reason being the underlying correlation matrix of the latent variables in the MP model may not be positive definite. In this dissertation, we study alternatives that are based on D-vine pair-copula models. We consider both the serial dependence modeled by the first order autoregressive (AR(1)) and …


Analyzing The Fractal Dimension Of Various Musical Pieces, Nathan Clark Aug 2020

Analyzing The Fractal Dimension Of Various Musical Pieces, Nathan Clark

Industrial Engineering Undergraduate Honors Theses

One of the most common tools for evaluating data is regression. This technique, widely used by industrial engineers, explores linear relationships between predictors and the response. Each observation of the response is a fixed linear combination of the predictors with an added error element. The method is built on the assumption that this error is normally distributed across all observations and has a mean of zero. In some cases, it has been found that the inherent variation is not the result of a random variable, but is instead the result of self-symmetric properties of the observations. For data with these …


A Novel Correction For The Adjusted Box-Pierce Test — New Risk Factors For Emergency Department Return Visits Within 72 Hours For Children With Respiratory Conditions — General Pediatric Model For Understanding And Predicting Prolonged Length Of Stay, Sidy Danioko Aug 2020

A Novel Correction For The Adjusted Box-Pierce Test — New Risk Factors For Emergency Department Return Visits Within 72 Hours For Children With Respiratory Conditions — General Pediatric Model For Understanding And Predicting Prolonged Length Of Stay, Sidy Danioko

Computational and Data Sciences (PhD) Dissertations

This thesis represents the results of three research projects that underline the breadth and depth of my interests.

Firstly, I devoted some efforts to the well-known Box-Pierce goodness-of-fit tests for time series models which has been an important research topic over the last few decades. All previously proposed tests are focused on changes of the test statistics. Instead, I adopted a different approach that takes the best performing test and modifying the rejection region. Thus, I developed a semiparametric correction of the Adjusted Box-Pierce test that attains the best I error rates for all sample sizes and lags and outperforms …


The Limits Of Location Privacy In Mobile Devices, Keen Yuun Sung Jul 2020

The Limits Of Location Privacy In Mobile Devices, Keen Yuun Sung

Doctoral Dissertations

Mobile phones are widely adopted by users across the world today. However, the privacy implications of persistent connectivity are not well understood. This dissertation focuses on one important concern of mobile phone users: location privacy. I approach this problem from the perspective of three adversaries that users are exposed to via smartphone apps: the mobile advertiser, the app developer, and the cellular service provider. First, I quantify the proportion of mobile users who use location permissive apps and are able to be tracked through their advertising identifier, and demonstrate a mark and recapture attack that allows continued tracking of users …


Predictive Modeling Of Asynchronous Event Sequence Data, Jin Shang May 2020

Predictive Modeling Of Asynchronous Event Sequence Data, Jin Shang

LSU Doctoral Dissertations

Large volumes of temporal event data, such as online check-ins and electronic records of hospital admissions, are becoming increasingly available in a wide variety of applications including healthcare analytics, smart cities, and social network analysis. Those temporal events are often asynchronous, interdependent, and exhibiting self-exciting properties. For example, in the patient's diagnosis events, the elevated risk exists for a patient that has been recently at risk. Machine learning that leverages event sequence data can improve the prediction accuracy of future events and provide valuable services. For example, in e-commerce and network traffic diagnosis, the analysis of user activities can be …


Predicting Disease Progression Using Deep Recurrent Neural Networks And Longitudinal Electronic Health Record Data, Seunghwan Kim May 2020

Predicting Disease Progression Using Deep Recurrent Neural Networks And Longitudinal Electronic Health Record Data, Seunghwan Kim

McKelvey School of Engineering Theses & Dissertations

Electronic Health Records (EHR) are widely adopted and used throughout healthcare systems and are able to collect and store longitudinal information data that can be used to describe patient phenotypes. From the underlying data structures used in the EHR, discrete data can be extracted and analyzed to improve patient care and outcomes via tasks such as risk stratification and prospective disease management. Temporality in EHR is innately present given the nature of these data, however, and traditional classification models are limited in this context by the cross- sectional nature of training and prediction processes. Finding temporal patterns in EHR is …


Data-Driven Investment Decisions In P2p Lending: Strategies Of Integrating Credit Scoring And Profit Scoring, Yan Wang Apr 2020

Data-Driven Investment Decisions In P2p Lending: Strategies Of Integrating Credit Scoring And Profit Scoring, Yan Wang

Doctor of Data Science and Analytics Dissertations

In this dissertation, we develop and discuss several loan evaluation methods to guide the investment decisions for peer-to-peer (P2P) lending. In evaluating loans, credit scoring and profit scoring are the two widely utilized approaches. Credit scoring aims at minimizing the risk while profit scoring aims at maximizing the profit. This dissertation addresses the strengths and weaknesses of each scoring method by integrating them in various ways in order to provide the optimal investment suggestions for different investors. Before developing the methods for loan evaluation at the individual level, we applied the state-of-the-art method called the Long Short Term Memory (LSTM) …


Three Essays On Health Economics And Policy Evaluation, Shishir Shakya Jan 2020

Three Essays On Health Economics And Policy Evaluation, Shishir Shakya

Graduate Theses, Dissertations, and Problem Reports

This dissertation consists of three essays on the U.S. Health care policy. Each paragraph below refers to the three abstracts for the three chapters in this dissertation, respectively. I provide quantitative evidence on how much Prescription Drug Monitoring Programs (PDMPs) affects the retail opioid prescribing behaviors. Using the American Community Survey (ACS), I retrieve county-level high dimensional panel data set from 2010 to 2017. I employ three separate identification strategies: difference-in-difference, double selection post-LASSO, and spatial difference-in-difference. I compare how the retail opioid prescribing behaviors of counties, that are mandatory for prescribers to check the PDMP before prescribing controlled substances …


Zero-Inflated Longitudinal Mixture Model For Stochastic Radiographic Lung Compositional Change Following Radiotherapy Of Lung Cancer, Viviana A. Rodríguez Romero Jan 2020

Zero-Inflated Longitudinal Mixture Model For Stochastic Radiographic Lung Compositional Change Following Radiotherapy Of Lung Cancer, Viviana A. Rodríguez Romero

Theses and Dissertations

Compositional data (CD) is mostly analyzed as relative data, using ratios of components, and log-ratio transformations to be able to use known multivariable statistical methods. Therefore, CD where some components equal zero represent a problem. Furthermore, when the data is measured longitudinally, observations are spatially related and appear to come from a mixture population, the analysis becomes highly complex. For this matter, a two-part model was proposed to deal with structural zeros in longitudinal CD using a mixed-effects model. Furthermore, the model has been extended to the case where the non-zero components of the vector might a two component mixture …


Tropical Cyclone Hazards In Relation To Propagation Speed, Jiehao Huang Jan 2020

Tropical Cyclone Hazards In Relation To Propagation Speed, Jiehao Huang

Dissertations and Theses

As the population and infrastructure along the US East Coast increase, it becomes increasingly important to study the characteristics of tropical cyclones that can impact the coast. A recent study shows that the propagation speed of tropical cyclones has slowed over the past 60 years, which can lead to greater accumulation of precipitation and greater storm surge impacts. The study presented herein is meant to examine and analyze the relationships that exist between the propagation speed of tropical cyclones, their surface wind strength, displacement angles, and cyclone averaged winds. This analysis is focused on tropical cyclones spanning from 1950-2015 in …


Home Sales As A Time Series Model, Noah R. Hellenthal Jan 2020

Home Sales As A Time Series Model, Noah R. Hellenthal

Williams Honors College, Honors Research Projects

Rational Expectations Hypothesis is an economic theorem that states that our best way to predict the future is by looking at the past. While this theory is typically used to address inflation, the same concept can be used when predicting future home sales. With the failure of subprime mortgages and the burst of the housing market bubble in 2008, home sales are proven to be an appropriate indication of how the U.S. economy is performing. Through time series analysis, I will be able to construct a model with monthly home sales data from the U.S. Census Bureau. Due to seasonality …