Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

PDF

Time series

Institution
Publication Year
Publication
Publication Type

Articles 1 - 30 of 58

Full-Text Articles in Entire DC Network

A Symbolic Approach To Nonlinear Time Series Analysis, Ranjan Karki, Nibhrat Lohia, Michael B. Schulte May 2024

A Symbolic Approach To Nonlinear Time Series Analysis, Ranjan Karki, Nibhrat Lohia, Michael B. Schulte

SMU Data Science Review

Current nonlinear time series methods such as neural networks forecast well. However, they act as a black box and are difficult to interpret, leaving the researchers and the audience with little insight into why the forecasts are the way they are. There is a need for a method that forecasts accurately while also being easy to interpret. This paper aims to develop a method to build an interpretable model for univariate and multivariate nonlinear time series data using wavelets and symbolic regression. The final method relies on multilayer perceptron (MLP) neural networks as a form of dimensionality reduction and the …


A Novel Correction For The Multivariate Ljung-Box Test, Minhao Huang May 2024

A Novel Correction For The Multivariate Ljung-Box Test, Minhao Huang

Computational and Data Sciences (PhD) Dissertations

This research introduces an analytical improvement to the Multivariate Ljung-Box test that addresses significant deviations of the original test from the nominal Type I error rates under almost all scenarios. Prior attempts to mitigate this issue have been directed at modification of the test statistics or correction of the test distribution to achieve precise results in finite samples. In previous studies, focused on designing corrections to the univariate Ljung-Box, a method that specifically adjusts the test rejection region has been the most successful of attaining the best Type I error rates. We adopt the same approach for the more complex, …


Good Practices And Common Pitfalls In Climate Time Series Changepoint Techniques: A Review, Robert B. Lund, Claudie Beaulieu, Rebecca Killick, Qiqi Lu, Xueheng Shi Dec 2023

Good Practices And Common Pitfalls In Climate Time Series Changepoint Techniques: A Review, Robert B. Lund, Claudie Beaulieu, Rebecca Killick, Qiqi Lu, Xueheng Shi

Department of Statistics: Faculty Publications

Climate changepoint (homogenization) methods abound today, with a myriad of techniques existing in both the climate and statistics literature. Unfortunately, the appropriate changepoint technique to use remains unclear to many. Further complicating issues, changepoint conclusions are not robust to perturbations in assumptions; for example, allowing for a trend or correlation in the series can drastically change changepoint conclusions. This paper is a review of the topic, with an emphasis on illuminating the models and techniques that allow the scientist to make reliable conclusions. Pitfalls to avoid are demonstrated via actual applications. The discourse begins by narrating the salient statistical features …


Gpu Utilization: Predictive Sarimax Time Series Analysis, Dorothy Dorie Parry Apr 2023

Gpu Utilization: Predictive Sarimax Time Series Analysis, Dorothy Dorie Parry

Modeling, Simulation and Visualization Student Capstone Conference

This work explores collecting performance metrics and leveraging the output for prediction on a memory-intensive parallel image classification algorithm - Inception v3 (or "Inception3"). Experimental results were collected by nvidia-smi on a computational node DGX-1, equipped with eight Tesla V100 Graphic Processing Units (GPUs). Time series analysis was performed on the GPU utilization data taken, for multiple runs, of Inception3’s image classification algorithm (see Figure 1). The time series model applied was Seasonal Autoregressive Integrated Moving Average Exogenous (SARIMAX).


Extending The M3-Competition: Category And Interval-Specific Time Series Forecasting, Will Sherman, Kati Schuerger, Randy Kim, Bivin Sadler Apr 2023

Extending The M3-Competition: Category And Interval-Specific Time Series Forecasting, Will Sherman, Kati Schuerger, Randy Kim, Bivin Sadler

SMU Data Science Review

The M3-Competition found that simple models outperform more complex ones for time series forecasting. As part of these competitions, several claims were made that statistical models exceeded machine learning (ML) techniques, such as recurrent neural networks (RNN), in prediction performance. These findings may over-generalize the capabilities of statistical models since the analysis measured the total forecasting accuracy across a wide range of industries and fields and with different interval lengths. This investigation aimed to assess how statistical and ML methods compared when individuating series by category and time interval. Utilizing the M3 data and building individual models using Facebook© Prophet …


A Change-Point Analysis Of Air Pollution Levels In Silao, Mexico And Fresno, California, Rachael Goodwin Apr 2023

A Change-Point Analysis Of Air Pollution Levels In Silao, Mexico And Fresno, California, Rachael Goodwin

WWU Honors College Senior Projects

We analyzed PM10 levels in the city of Silao, Mexico, as well as PM2.5 and PM10 levels in Fresno, California to determine if there was a shift in air pollution levels in either location. A change point based analysis was used to determine if there was a shift in air pollution levels. In the city of Silao, there was a significant increase in PM10 levels, but there was no significant change in Fresno for either pollutant.


Assessing Spurious Correlations In Big Search Data, Jesse T. Richman, Ryan J. Roberts Jan 2023

Assessing Spurious Correlations In Big Search Data, Jesse T. Richman, Ryan J. Roberts

Political Science & Geography Faculty Publications

Big search data offers the opportunity to identify new and potentially real-time measures and predictors of important political, geographic, social, cultural, economic, and epidemiological phenomena, measures that might serve an important role as leading indicators in forecasts and nowcasts. However, it also presents vast new risks that scientists or the public will identify meaningless and totally spurious ‘relationships’ between variables. This study is the first to quantify that risk in the context of search data. We find that spurious correlations arise at exceptionally high frequencies among probability distributions examined for random variables based upon gamma (1, 1) and Gaussian random …


Fitting Time Series Models To Fisheries Data To Ascertain Age, Kathleen S. Kirch, Norou Diawara, Cynthia M. Jones Jan 2023

Fitting Time Series Models To Fisheries Data To Ascertain Age, Kathleen S. Kirch, Norou Diawara, Cynthia M. Jones

OES Faculty Publications

The ability of government agencies to assign accurate ages of fish is important to fisheries management. Accurate ageing allows for most reliable age-based models to be used to support sustainability and maximize economic benefit. Assigning age relies on validating putative annual marks by evaluating accretional material laid down in patterns in fish ear bones, typically by marginal increment analysis. These patterns often take the shape of a sawtooth wave with an abrupt drop in accretion yearly to form an annual band and are typically validated qualitatively. Researchers have shown key interest in modeling marginal increments to verify the marks do, …


Stock Forecasts With Lstm And Web Sentiment, Michael Burgess, Faizan Javed, Nnenna Okpara, Chance Robinson Sep 2022

Stock Forecasts With Lstm And Web Sentiment, Michael Burgess, Faizan Javed, Nnenna Okpara, Chance Robinson

SMU Data Science Review

Traditional time-series techniques, such as auto-regressive and moving average models, can have difficulties when applied to stock data due to the randomness inherent to the markets. In this study, Long Short-Term Memory Recurrent Neural Networks, or LSTMs, have been applied to pricing data along with sentiment scores derived from web sources such as Twitter and other financial media outlets. The project team utilized this approach to complement the technical indicators observed at the end of each trading day for three stocks from the NASDAQ stock exchange over a 12-year span. A common benchmark to assess model performance on time series …


Realtime Event Detection In Sports Sensor Data With Machine Learning, Mallory Cashman Jan 2022

Realtime Event Detection In Sports Sensor Data With Machine Learning, Mallory Cashman

Honors Theses and Capstones

Machine learning models can be trained to classify time series based sports motion data, without reliance on assumptions about the capabilities of the users or sensors. This can be applied to predict the count of occurrences of an event in a time period. The experiment for this research uses lacrosse data, collected in partnership with SPAITR - a UNH undergraduate startup developing motion tracking devices for lacrosse. Decision Tree and Support Vector Machine (SVM) models are trained and perform with high success rates. These models improve upon previous work in human motion event detection and can be used a reference …


Estimating The Statistics Of Operational Loss Through The Analyzation Of A Time Series, Maurice L. Brown Jan 2022

Estimating The Statistics Of Operational Loss Through The Analyzation Of A Time Series, Maurice L. Brown

Theses and Dissertations

In the world of finance, appropriately understanding risk is key to success or failure because it is a fundamental driver for institutional behavior. Here we focus on risk as it relates to the operations of financial institutions, namely operational risk. Quantifying operational risk begins with data in the form of a time series of realized losses, which can occur for a number of reasons, can vary over different time intervals, and can pose a challenge that is exacerbated by having to account for both frequency and severity of losses. We introduce a stochastic point process model for the frequency distribution …


Institutional Context Drives Mobility: A Comprehensive Analysis Of Academic And Economic Factors That Influence International Student Enrollment At United States Higher Education Institutions, Natalie Cruz Apr 2021

Institutional Context Drives Mobility: A Comprehensive Analysis Of Academic And Economic Factors That Influence International Student Enrollment At United States Higher Education Institutions, Natalie Cruz

College of Education & Professional Studies (Darden) Posters

International student enrollment (ISE) has become a hallmark of world-class higher education institutions (HEIs). Although the U.S. has welcomed the largest numbers of international students since the 1950s, ISE shrunk by 10% in the previous three years from an all-time high of 903,127 students in 2016/2017 (IIE, 2019). Research studies about international student mobility and enrollment highlights the significant role that academic and economic rationales play for international students. This quantitative, ex post facto study focused on the influence of ranking, tuition, Optional Practical Training, Gross Domestic Product, and the unemployment rate on ISE at 2,884 U.S. HEIs from 2008 …


Satellite-Based Phenology Analysis In Evaluating The Response Of Puerto Rico And The United States Virgin Islands' Tropical Forests To The 2017 Hurricanes, Melissa Collin Jan 2021

Satellite-Based Phenology Analysis In Evaluating The Response Of Puerto Rico And The United States Virgin Islands' Tropical Forests To The 2017 Hurricanes, Melissa Collin

Cal Poly Humboldt theses and projects

The functionality of tropical forest ecosystems and their productivity is highly related to the timing of phenological events. Understanding forest responses to major climate events is crucial for predicting the potential impacts of climate change. This research utilized Landsat satellite data and ground-based Forest Inventory and Analysis (FIA) plot data to investigate the dynamics of Puerto Rico and the U.S. Virgin Islands’ (PRVI) tropical forests after two major hurricanes in 2017. Analyzing these two datasets allowed for validation of the remote sensing methodology with field data and for the investigation of whether this is an appropriate approach for estimating forest …


Assessing And Forecasting Chlorophyll Abundances In Minnesota Lake Using Remote Sensing And Statistical Approaches, Ben Von Korff Jan 2021

Assessing And Forecasting Chlorophyll Abundances In Minnesota Lake Using Remote Sensing And Statistical Approaches, Ben Von Korff

All Graduate Theses, Dissertations, and Other Capstone Projects

Harmful algae blooms (HABs) can negatively impact water quality, lake aesthetics, and can harm human and animal health. However, monitoring for HABs is rare in Minnesota. Detecting blooms which can vary spatially and may only be present briefly is challenging, so expanding monitoring in Minnesota would require the use of new and cost efficient technologies. Unmanned aerial vehicles (UAVs) were used for bloom mapping using RGB and near-infrared imagery. Real time monitoring was conducted in Bass Lake, in Faribault County, MN using trail cameras. Time series forecasting was conducted with high frequency chlorophyll-a data from a water quality sonde. Normalized …


Statistical Methods With A Focus On Joint Outcome Modeling And On Methods For Fire Science, Da Zhong Xi Nov 2020

Statistical Methods With A Focus On Joint Outcome Modeling And On Methods For Fire Science, Da Zhong Xi

Electronic Thesis and Dissertation Repository

Understanding the dynamics of wildfires contributes significantly to the development of fire science. Challenges in the analysis of historical fire data include defining fire dynamics within existing statistical frameworks, modeling the duration and size of fires as joint outcomes, identifying the how fires are grouped into clusters of subpopulations, and assessing the effect of environmental variables in different modeling frameworks. We develop novel statistical methods to consider outcomes related to fire science jointly. These methods address these challenges by linking univariate models for separate outcomes through shared random effects, an approach referred to as joint modeling. Comparisons with existing …


Time Series Analysis Of Offshore Buoy Light Detection And Ranging (Lidar) Windspeed Data, Aditya Garapati, Charles J. Henderson, Carl Walenciak, Brian T. Waite Sep 2020

Time Series Analysis Of Offshore Buoy Light Detection And Ranging (Lidar) Windspeed Data, Aditya Garapati, Charles J. Henderson, Carl Walenciak, Brian T. Waite

SMU Data Science Review

In this paper, modeling techniques for the forecasting of wind speed using historical values observed by Light Detection and Ranging (LIDAR) sensors in an offshore context are described. Both univariate time series and multivariate time series modeling techniques leveraging meteorological data collected simultaneously with the LIDAR data are evaluated for potential contributions to predictive ability. Accurate and timely ability to predict wind values is essential to the effective integration of wind power into existing power grid systems. It allows for both the management of rapid ramp-up / down of base production capacity due to highly variable wind power inputs and …


A Review Study Of Functional Autoregressive Models With Application To Energy Forecasting, Ying Chen, Thorsten Koch, Kian Guan Lim, Xiaofei Xu, Nazgul Zakiyeva Jul 2020

A Review Study Of Functional Autoregressive Models With Application To Energy Forecasting, Ying Chen, Thorsten Koch, Kian Guan Lim, Xiaofei Xu, Nazgul Zakiyeva

Research Collection Lee Kong Chian School Of Business

In this data‐rich era, it is essential to develop advanced techniques to analyze and understand large amounts of data and extract the underlying information in a flexible way. We provide a review study on the state‐of‐the‐art statistical time series models for univariate and multivariate functional data with serial dependence. In particular, we review functional autoregressive (FAR) models and their variations under different scenarios. The models include the classic FAR model under stationarity; the FARX and pFAR model dealing with multiple exogenous functional variables and large‐scale mixed‐type exogenous variables; the vector FAR model and common functional principal component technique to handle …


Essays On Modeling And Analysis Of Dynamic Sociotechnical Systems, David Rushing Dewhurst Jan 2020

Essays On Modeling And Analysis Of Dynamic Sociotechnical Systems, David Rushing Dewhurst

Graduate College Dissertations and Theses

A sociotechnical system is a collection of humans and algorithms that interact under the partial supervision of a decentralized controller. These systems often display in- tricate dynamics and can be characterized by their unique emergent behavior. In this work, we describe, analyze, and model aspects of three distinct classes of sociotech- nical systems: financial markets, social media platforms, and elections. Though our work is diverse in subject matter content, it is unified though the study of evolution- and adaptation-driven change in social systems and the development of methods used to infer this change.

We first analyze evolutionary financial market microstructure …


Phenotype Extraction: Estimation And Biometrical Genetic Analysis Of Individual Dynamics, Kevin L. Mckee Jan 2020

Phenotype Extraction: Estimation And Biometrical Genetic Analysis Of Individual Dynamics, Kevin L. Mckee

Theses and Dissertations

Within-person data can exhibit a virtually limitless variety of statistical patterns, but it can be difficult to distinguish meaningful features from statistical artifacts. Studies of complex traits have previously used genetic signals like twin-based heritability to distinguish between the two. This dissertation is a collection of studies applying state-space modeling to conceptualize and estimate novel phenotypic constructs for use in psychiatric research and further biometrical genetic analysis. The aims are to: (1) relate control theoretic concepts to health-related phenotypes; (2) design statistical models that formally define those phenotypes; (3) estimate individual phenotypic values from time series data; (4) consider hierarchical …


Time Series Analysis Of Weather Data In South Carolina, Geophrey Odero Oct 2019

Time Series Analysis Of Weather Data In South Carolina, Geophrey Odero

Theses and Dissertations

This thesis discusses time series analysis of weather data in South Carolina for the last fifteen years (January 2003 to December 2017) for Columbia, Greenville and North Myrtle Beach. The first part presents a brief overview of different variables that are used in the analysis. That is, temperature, dew point, humidity and sea level pressure. A short discussion of time series data is also introduced. The second part is about modeling the variables. The models of choice are presented, fitted and model diagnostics is carried out. In the third part, we discuss background on climates of the cities and model …


A First Look At Sublimation Rates In Toss Island Region, Antarctica, Rebecca Baiman, Scott Landolt Jul 2019

A First Look At Sublimation Rates In Toss Island Region, Antarctica, Rebecca Baiman, Scott Landolt

STAR Program Research Presentations

70% of Earth’s fresh water is held in Antarctica ice sheet. If the sheet melts, it has the potential to raise global sea levels by 190 feet (Klekociuk and Wiennecke, 2016). As the climate changes, it is imperative that to understand precipitation systems of Antarctica in order to measure and predict weather around the world. One aspect of precipitation events that we do not understand fully in Antarctica is sublimation. Data was collected from four Ott Pluvio Precipitation Gauges with Belfort Double Alter Shields placed in and around the Ross Ice Shelf from November of 2017 to present. An R …


Feasibility Of Multi-Year Forecast For The Colorado River Water Supply: Time Series Modeling, Brian Plucinski May 2019

Feasibility Of Multi-Year Forecast For The Colorado River Water Supply: Time Series Modeling, Brian Plucinski

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

The Colorado River is one of the largest resources for water in the United States, as well as being an important asset to the economy. Previous studies have shown a connection between the Great Salt Lake and the Colorado River. This study used time series analysis to build models to predict the water supply of the Colorado River ten years out. These models used data from the Colorado River in addition to Great Salt Lake water elevation. Several models suggest a decline in water supply from 2013 – 2020, before starting to increase. These predictions differ from predictions published by …


An Analysis Of Accuracy Using Logistic Regression And Time Series, Edwin Baidoo, Jennifer L. Priestley Mar 2019

An Analysis Of Accuracy Using Logistic Regression And Time Series, Edwin Baidoo, Jennifer L. Priestley

Jennifer L. Priestley

This paper analyzes the accuracy rates for logistic regression and time series models. It also examines a relatively new performance index that takes into consideration the business assumptions of credit markets. Although prior research has focused on evaluation metrics, such as AUC and Gini index, this new measure has a more intuitive interpretation for various managers and decision makers and can be applied to both Logistic and Time Series models.


Forecasting Crashes, Credit Card Default, And Imputation Analysis On Missing Values By The Use Of Neural Networks, Jazmin Quezada Jan 2019

Forecasting Crashes, Credit Card Default, And Imputation Analysis On Missing Values By The Use Of Neural Networks, Jazmin Quezada

Open Access Theses & Dissertations

A neural network is a system of hardware and/or software patterned after the operation of neurons in the human brain. Neural networks,- also called Artificial Neural Networks - are a variety of deep learning technology, which also falls under the umbrella of artificial intelligence, or AI. Recent studies shows that Artificial Neural Network has the highest coefficient of determination (i.e. measure to assess how well a model explains and predicts future outcomes.) in comparison to the K-nearest neighbor classifiers, logistic regression, discriminant analysis, naive Bayesian classifier, and classification trees. In this work, the theoretical description of the neural network methodology …


Trend And Acceleration: A Multi-Model Approach To Key West Sea Level Rise, John Tenenholtz Nov 2017

Trend And Acceleration: A Multi-Model Approach To Key West Sea Level Rise, John Tenenholtz

FIU Electronic Theses and Dissertations

Sea level rise (SLR) varies depending on location. It is therefore important to local residents, businesses and government to analyze SLR locally. Further, because of increasing ice melt and other effects of climate change, rates of SLR may change. It is therefore also important to evaluate rates of change of SLR, which we call sea level acceleration (SLA) or deceleration.

The present thesis will review the annual average sea level data compiled at the Key West tidal gauge in Key West, Florida. We use a multi-model approach that compares the results of various models on that data set. The goal …


Regime Switching In Cointegrated Time Series, Bradley David Zynda Ii Apr 2017

Regime Switching In Cointegrated Time Series, Bradley David Zynda Ii

Undergraduate Honors Capstone Projects

Volatile commodities and markets can often be difficult to model and forecast given significant breaks in trends through time. To account such breaks, regime switching methods allow for models to accommodate abrupt changes in behavior of the data. However, the difficulty often arises in beginning the process of choosing a model and its associated parameters with which to represent the data and the objects of interest. To improve model selection for these volatile markets, this research examines time series with regime switching components and argues that a synthesis of vector error correction models with regime switching models with ameliorate financial …


Analysis Of Break-Points In Financial Time Series, Jean Remy Habimana Dec 2016

Analysis Of Break-Points In Financial Time Series, Jean Remy Habimana

Graduate Theses and Dissertations

A time series is a set of random values collected at equal time intervals; this randomness makes these types of series not easy to predict because the structure of the series may change at any time. As discussed in previous research, the structure of time series may change at any time due to the change in mean and/or variance of the series. Consequently, based on this structure, it is wise not to assume that these series are stationary. This paper, discusses, a method of analyzing time series by considering the entire series non-stationary, assuming there is random change in unconditional …


A Multi-Indexed Logistic Model For Time Series, Xiang Liu Dec 2016

A Multi-Indexed Logistic Model For Time Series, Xiang Liu

Electronic Theses and Dissertations

In this thesis, we explore a multi-indexed logistic regression (MILR) model, with particular emphasis given to its application to time series. MILR includes simple logistic regression (SLR) as a special case, and the hope is that it will in some instances also produce significantly better results. To motivate the development of MILR, we consider its application to the analysis of both simulated sine wave data and stock data. We looked at well-studied SLR and its application in the analysis of time series data. Using a more sophisticated representation of sequential data, we then detail the implementation of MILR. We compare …


An Analysis Of Accuracy Using Logistic Regression And Time Series, Edwin Baidoo, Jennifer L. Priestley Jan 2016

An Analysis Of Accuracy Using Logistic Regression And Time Series, Edwin Baidoo, Jennifer L. Priestley

Published and Grey Literature from PhD Candidates

This paper analyzes the accuracy rates for logistic regression and time series models. It also examines a relatively new performance index that takes into consideration the business assumptions of credit markets. Although prior research has focused on evaluation metrics, such as AUC and Gini index, this new measure has a more intuitive interpretation for various managers and decision makers and can be applied to both Logistic and Time Series models.


Key Factors Driving Personnel Downsizing In Multinational Military Organizations, Ilksen Gorkem, Resit Unal, Pilar Pazos Jan 2015

Key Factors Driving Personnel Downsizing In Multinational Military Organizations, Ilksen Gorkem, Resit Unal, Pilar Pazos

Engineering Management & Systems Engineering Faculty Publications

Although downsizing has long been a topic of research in traditional organizations, there are very few studies of this phenomenon in military contexts. As a result, we have little understanding of the key factors that drive personnel downsizing in military setting. This study contributes to our understanding of key factors that drive personnel downsizing in military organizations and whether those factors may differ across NATO nations’ cultural clusters. The theoretical framework for this study was built from studies in non-military contexts and adapted to fit the military environment.

This research relies on historical data from one of the largest multinational …