Other Statistics and Probability Commons™

All Articles in Other Statistics and Probability

119 full-text articles. Page 1 of 5.

Probabilistic Methods In Information Theory, 2016 Cal State University-San Bernardino

Probabilistic Methods In Information Theory, Erik W. Pachas

Electronic Theses, Projects, and Dissertations

Given a probability space, we analyze the uncertainty, that is, the amount of information of a finite system, by studying the entropy of the system. We also extend the concept of entropy to a dynamical system by introducing a measure preserving transformation on a probability space. After showing some theorems and applications of entropy theory, we study the concept of ergodicity, which helps us to further analyze the information of the system.

Passive Visual Analytics Of Social Media Data For Detection Of Unusual Events, 2016 Purdue University

Passive Visual Analytics Of Social Media Data For Detection Of Unusual Events, Kush Rustagi, Junghoon Chae

The Summer Undergraduate Research Fellowship (SURF) Symposium

Now that social media sites have gained substantial traction, huge amounts of un-analyzed valuable data are being generated. Posts containing images and text have spatiotemporal data attached as well, having immense value for increasing situational awareness of local events, providing insights for investigations and understanding the extent of incidents, their severity, and consequences, as well as their time-evolving nature. However, the large volume of unstructured social media data hinders exploration and examination. To analyze such social media data, the S.M.A.R.T system provides the analyst with an interactive visual spatiotemporal analysis and spatial decision support environment that ...

Well I'Ll Be Damned - Insights Into Predictive Value Of Pedigree Information In Horse Racing, 2016 University of Southampton

Well I'Ll Be Damned - Insights Into Predictive Value Of Pedigree Information In Horse Racing, Timothy Baker Mr, Ming-Chien Sung, Johnnie Johnson Professor, Tiejun Ma

International Conference on Gambling and Risk Taking

Fundamental form characteristics like how fast a horse ran at its last start, are widely used to help predict the outcome of horse racing events. The exception being in races where horses haven’t previously competed, such as Maiden races, where there is little or no publicly available past performance information. In these types of events bettors need only consider a simplified suite of factors however this is offset by a higher level of uncertainty. This paper examines the inherent information content embedded within a horse’s ancestry and the extent to which this information is discounted in the United ...

2016 East Tennessee State University

Takens Theorem With Singular Spectrum Analysis Applied To Noisy Time Series, Thomas K. Torku

Electronic Theses and Dissertations

The evolution of big data has led to financial time series becoming increasingly complex, noisy, non-stationary and nonlinear. Takens theorem can be used to analyze and forecast nonlinear time series, but even small amounts of noise can hopelessly corrupt a Takens approach. In contrast, Singular Spectrum Analysis is an excellent tool for both forecasting and noise reduction. Fortunately, it is possible to combine the Takens approach with Singular Spectrum analysis (SSA), and in fact, estimation of key parameters in Takens theorem is performed with Singular Spectrum Analysis. In this thesis, we combine the denoising abilities of SSA with the Takens ...

Flesch-Kincaid Reading Grade Level Re-Examined: Creating A Uniform Method For Calculating Readability On A Certification Exam, 2016 Southern Illinois University Carbondale

Flesch-Kincaid Reading Grade Level Re-Examined: Creating A Uniform Method For Calculating Readability On A Certification Exam, Emily Neuhoff, Kristiana M. Feeser, Kayla Sutherland, Thomas Hovatter

Online Journal for Workforce Education and Development

Abstract

Objective: This study attempted to establish a consistent measurement technique of the readability of a state-wide Certified Nursing Assistant’s (CNA) certification exam. Background: Monitoring the readability level of an exam helps ensure all test versions do not exceed the maximum reading level of the exam, and that knowledge of the subject matter, rather than reading ability, is being assessed. Method: A two part approach was used to specify and evaluate readability. First, two methods (Microsoft Word® (MSW) software and published readability formulae) were used to calculate Flesch Reading Ease (FRE) and Flesch-Kincaid Reading Grade Level (FKRGL) for multiple ...

2016 University of Connecticut

Spatiotemporal Meta-Analysis: Reviewing Health Psychology Phenomena Over Space And Time., Blair T. Johnson

CHIP Documents

Johnson, B. T., Crowley, E., & Marrouch, N. Spatiotemporal meta-analysis: Reviewing health psychology phenomena over space and time. Health Psychology Review.

Specifically, it is a database of GDPs per capita for nations in the world between 1800 and 2015. It is archived here to support an online supplement to this article.

GDP per capita

Species Discovery And Diversity In Lobocriconema (Criconematidae: Nematoda) And Related Plant-Parasitic Nematodes From North American Ecoregions, 2016 University of Nebraska-Lincoln

Species Discovery And Diversity In Lobocriconema (Criconematidae: Nematoda) And Related Plant-Parasitic Nematodes From North American Ecoregions, Tom Powers, Ernest C. Bernard, T. Harris, Robert Higgins, M. Olson, S. Olson, M. Lodema, Julianne N. Matczyszyn, P. Mullin, L. Sutton, K.S Powers

Faculty Publications, Department of Statistics

There are many nematode species that, following formal description, are seldom mentioned again in the scientific literature. Lobocriconema thornei and L. incrassatum are two such species, described from North American forests, respectively 37 and 49 years ago. In the course of a 3-year nematode biodiversity survey of North American ecoregions, specimens resembling Lobocriconema species appeared in soil samples from both grassland and forested sites. Using a combination of molecular and morphological analyses, together with a set of species delimitation approaches, we have expanded the known range of these species, added to the species descriptions, and discovered a related group of ...

How Often Are Antibiotic-Resistant Bacteria Said To “Evolve” In The News?, 2016 University of California Los Angeles

How Often Are Antibiotic-Resistant Bacteria Said To “Evolve” In The News?, Nina Singh, Matthew T. Sit, Deanna M. Chung, Ana A. Lopez, Ranil Weerackoon, Pamela J. Yeh

Faculty Publications, Department of Statistics

Media plays an important role in informing the general public about scientific ideas.We examine whether the word “evolve,” sometimes considered controversial by the general public, is frequently used in the popular press. Specifically, we ask how often articles discussing antibiotic resistance use the word “evolve” (or its lexemes) as opposed to alternative terms such as “emerge” or “develop.” We chose the topic of antibiotic resistance because it is a medically important issue; bacterial evolution is a central player in human morbidity and mortality. We focused on the most widely-distributed newspapers written in English in the United States, United Kingdom ...

Systematic Evaluation Of The Impact Of Chip-Seq Read Designs On Genome Coverage, Peak Identification, And Allele-Specific Binding Detection, 2016 University of Nebraska-Lincoln

Systematic Evaluation Of The Impact Of Chip-Seq Read Designs On Genome Coverage, Peak Identification, And Allele-Specific Binding Detection, Qi Zhang, Xin Zeng, Sam Younkin, Trupti Kawli, Michael P. Snyder, Sündüz Kele

Faculty Publications, Department of Statistics

Background: Chromatin immunoprecipitation followed by sequencing (ChIP-seq) experiments revolutionized genome-wide profiling of transcription factors and histone modifications. Although maturing sequencing technologies allow these experiments to be carried out with short (36–50 bps), long (75–100 bps), single-end, or paired-end reads, the impact of these read parameters on the downstream data analysis are not well understood. In this paper, we evaluate the effects of different read parameters on genome sequence alignment, coverage of different classes of genomic features, peak identification, and allele-specific binding detection.

Results: We generated 101 bps paired-end ChIP-seq data for many transcription factors from human GM12878 and ...

2016 Iowa State University

Keeping Up With Which Joneses: Spatial Diffusion Of Rule Of Law Through Economic International Organizations, Olga Chyzh

Political Science Publications

I develop a theory of spatial diffusion of the rule of law, where “space” is conceptualized as shared memberships in economic international organizations (IOs). I argue that the rule of law diffuses as a result of economic competition and socialization. Outside evaluators, such as international leaders, activists, and most importantly, international firms and investors, often assess states’ attractiveness as a business venue by comparing them to similar states. The natural reference group for such comparisons is not just geographical neighbors, but also states with shared memberships in economic IOs. Responding to this evaluation, states identify members of their own reference ...

2016 Iowa State University

Dangerous Liaisons An Endogenous Model Of International Trade And Human Rights, Olga Chyzh

Political Science Publications

This article applies recent advances in network analysis to highlight a central tension faced by policymakers – balancing the benefits of engaging with the international system and the associated domestic policy costs. International trade rewards certain domestic practices, such as respect for human rights. Enforcing such practices, however, is politically costly and sometimes prohibitive to state leaders who rely on political repression to stay in power. In such cases, domestic elites often resort to an alternative strategy of securing the benefits of international trade – setting up indirect trade channels through intermediary states. These competing incentives are modeled within a single framework ...

Enscat: Clustering Of Categorical Data Via Ensembling, 2016 University of Nebraska-Lincoln

Enscat: Clustering Of Categorical Data Via Ensembling, Bertrand S. Clarke, Saeid Amiri, Jennifer Clarke

Faculty Publications, Department of Statistics

Background: Clustering is a widely used collection of unsupervised learning techniques for identifying natural classes within a data set. It is often used in bioinformatics to infer population substructure. Genomic data are often categorical and high dimensional, e.g., long sequences of nucleotides. This makes inference challenging: The distance metric is often not well-defined on categorical data; running time for computations using high dimensional data can be considerable; and the Curse of Dimensionality often impedes the interpretation of the results. Up to the present, however, the literature and software addressing clustering for categorical data has not yet led to a ...

Addressing Endogeneity In Actor-Speci Fi C Network Measures, 2016 University of Iowa

Addressing Endogeneity In Actor-Speci Fi C Network Measures, Frederick J. Boehmke, Olga Chyzh, Cameron G. Thies

Political Science Publications

The study of international relations (IR), and political science more broadly, has derived great benefits from the recent growth of conceptualizing and modeling political phenomena within their broader network contexts. More than just a novel approach to evaluating the old puzzles, network analysis provides a whole new way of theoretical thinking. Challenging the traditional dyad-driven approach to the study of IR, networks highlight actor interdependence that goes beyond dyads and emphasizes that many traditional IR variables, such as conflict, trade, alliances, or international organization memberships must be treated and studied as networks. Properties of these networks (e.g., polarization, density ...

A Nonlinear Filter For Markov Chains And Its Effect On Diffusion Maps, 2015 Yale University

A Nonlinear Filter For Markov Chains And Its Effect On Diffusion Maps, Stefan Steinerberger

Yale Day of Data

Diffusion maps are a modern mathematical tool that helps to find structure in large data sets - we present a new filtering technique that is based on the assumption that errors in the data are intrinsically random to isolate and filter errors and thus boost the efficiency of diffusion maps. Applications include data sets from medicine (the Cleveland Heart Disease Data set and the Wisconsin Breast Cancer Data set) and engineering (the Ionosphere data set).

2015 The University of Western Ontario

Completely Monotone And Bernstein Functions With Convexity Properties On Their Measures, Shen Shan

Electronic Thesis and Dissertation Repository

The concepts of completely monotone and Bernstein functions have been introduced near one hundred years ago. They find wide applications in areas ranging from stochastic L\'{e}vy processes and complex analysis to monotone operator theory. They have well-known Bernstein and L\'{e}vy-Khintchine integral representations through which there are one-to-one correspondences between them and Radon measures on $[0,\infty)$ or $(0,\infty)$, respectively. In this thesis, we investigate subclasses of completely monotone and Bernstein functions with various convexity properties on their measures. These subclasses have intriguing applications in probability theories and convex analysis.

The convexity properties we investigate include ...

On The Dual Risk Models, 2015 The University of Western Ontario

On The Dual Risk Models, Chen Yang

Electronic Thesis and Dissertation Repository

Abstract This thesis focuses on developing and computing ruin-related quantities that are potentially measurements for the dual risk models which was proposed to describe the annuity-type businesses from the perspective of the collective risk theory in 1950’s. In recent years, the dual risk models are revisited by many researchers to quantify the risk of the similar businesses as the annuity-type businesses. The major extensions included in this thesis consist of two aspects: the ﬁrst is to search for new ruin-related quantities that are potentially indices of the risk for well-established dual models; the other aspect is to generalize the ...

2015 California Polytechnic State University, San Luis Obispo

A Study Of The Parametric And Nonparametric Linear-Circular Correlation Coefficient, Robin Tu

Statistics

Circular statistics are specialized statistical methods that deal specifically with directional data. Data that is angular require specialized techniques due to the modulo 2π (in radians) or modulo 360 (in degrees) nature of angles.

Correlation, typically in terms of Pearson’s correlation coefficient, is a measure of association between two linear random variables x and y. In this paper, the specific circular technique of the parametric and nonparametric linear-circular correlation coefficient will be explored where correlation is no longer between two linear variables x and y, but between a linear random variable x and circular random variable θ.

A simulation ...

2015 California Polytechnic State University - San Luis Obispo

#Twittercritic: Sentiment Analysis Of Tweets To Predict Tv Ratings, Isabel Litton

Statistics

Twitter has rapidly become one of the most popular sites of the Internet. It functions not just as a microblogging service, but as a crowdsourcing tool for listening, promotion, insight and much more. From the perspective of TV networks, tweets capture the real time reactions of viewers, making them an ideal indicator of a show’s ratings. This paper predicts Internet Movie Database (IMDB) television ratings by text mining Twitter data.

Tweets for five television shows were downloaded over a period of several months utilizing a SAS macro. Television show data, such as rating, show title, episode title, and more ...

2015 California State University - San Bernardino

The Relationship Between Self-Determination And Client Outcomes Among The Homeless, Samuel M. Hanna

Electronic Theses, Projects, and Dissertations

This paper has attempted to determine if there is a significant relationship between self-determination and client outcomes among the homeless. The study has been based upon the conceptual framework set forth in Self-Determination Theory. The purpose of the study was to explore the relationship between self-determination and client outcomes among the homeless. Using a data collection instrument, based on empirically validated instrumentation, clients from several homeless service providers in the City of San Bernardino were assessed for the level of self-determination and autonomy support they experience within these agencies. Outcome measures included such things as whether the client was going ...

Geographical Analysis Of Hub City Transit, 2015 University of Southern Mississippi

Geographical Analysis Of Hub City Transit, Joshua Adam Watts

Master's Theses

This study assess Hub City Transit, the public bus system of Hattiesburg, MS. Statistical analysis is used to determine how well the transit system serves low income areas of the city. A 0.5 mile buffer was applied to the bus routes to determine the coverage of the transit system. Areas of disorder along the routes were also assessed to analyze the landscape routes pass through. Lastly, an analysis of ridership on each route was performed to determine the most heavily used areas, as well as to assess where riders are going on each route.

The findings show that Hub ...