Open Access. Powered by Scholars. Published by Universities.®

Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 14 of 14

Full-Text Articles in Mathematics

Sentiment Analysis Before And During The Covid-19 Pandemic, Emily Musgrove Jul 2023

Sentiment Analysis Before And During The Covid-19 Pandemic, Emily Musgrove

Mathematics Summer Fellows

This study examines the change in connotative language use before and during the Covid-19 pandemic. By analyzing news articles from several major US newspapers, we found that there is a statistically significant correlation between the sentiment of the text and the publication period. Specifically, we document a large, systematic, and statistically significant decline in the overall sentiment of articles published in major news outlets. While our results do not directly gauge the sentiment of the population, our findings have important implications regarding the social responsibility of journalists and media outlets especially in times of crisis.


Compare And Contrast Maximum Likelihood Method And Inverse Probability Weighting Method In Missing Data Analysis, Scott Sun May 2021

Compare And Contrast Maximum Likelihood Method And Inverse Probability Weighting Method In Missing Data Analysis, Scott Sun

Mathematical Sciences Technical Reports (MSTR)

Data can be lost for different reasons, but sometimes the missingness is a part of the data collection process. Unbiased and efficient estimation of the parameters governing the response mean model requires the missing data to be appropriately addressed. This paper compares and contrasts the Maximum Likelihood and Inverse Probability Weighting estimators in an Outcome-Dependendent Sampling design that deliberately generates incomplete observations. WE demonstrate the comparison through numerical simulations under varied conditions: different coefficient of determination, and whether or not the mean model is misspecified.


We’Re Here To Get You There: A Statistical Analysis Of Bridgewater State University’S Transit System, Abigail Adams May 2021

We’Re Here To Get You There: A Statistical Analysis Of Bridgewater State University’S Transit System, Abigail Adams

Honors Program Theses and Projects

Bridgewater State University first established its on-campus transportation service in January of 1984. While it began only running as an on-campus service for students throughout the day, the service grew to expand by offering an off-campus connection to the neighboring city of Brockton and absorbed the night service system from the campus safety team. As BSU Transit continues to grow, the organization is seeking ways to improve their overall service and better prepare their fleet and driver pool to accommodate this growth. The purpose of this research is to analyze trends among the data collected by BSU Transit and assist …


“Playing The Whole Game”: A Data Collection And Analysis Exercise With Google Calendar, Albert Y. Kim, Johanna Hardin Aug 2020

“Playing The Whole Game”: A Data Collection And Analysis Exercise With Google Calendar, Albert Y. Kim, Johanna Hardin

Statistical and Data Sciences: Faculty Publications

We provide a computational exercise suitable for early introduction in an undergraduate statistics or data science course that allows students to “play the whole game” of data science: performing both data collection and data analysis. While many teaching resources exist for data analysis, such resources are not as abundant for data collection given the inherent difficulty of the task. Our proposed exercise centers around student use of Google Calendar to collect data with the goal of answering the question “How do I spend my time?” On the one hand, the exercise involves answering a question with near universal appeal, but …


9th Annual Postdoctoral Science Symposium, University Of Texas Md Anderson Cancer Center Postdoctoral Association Sep 2019

9th Annual Postdoctoral Science Symposium, University Of Texas Md Anderson Cancer Center Postdoctoral Association

Annual Postdoctoral Science Symposium Abstracts

The mission of the Annual Postdoctoral Science Symposium (APSS) is to provide a platform for talented postdoctoral fellows throughout the Texas Medical Center to present their work to a wider audience. The MD Anderson Postdoctoral Association convened its inaugural Annual Postdoctoral Science Symposium (APSS) on August 4, 2011.

The APSS provides a professional venue for postdoctoral scientists to develop, clarify, and refine their research as a result of formal reviews and critiques of faculty and other postdoctoral scientists. Additionally, attendees discuss current research on a broad range of subjects while promoting academic interactions and enrichment and developing new collaborations.


Quantification Of The Statistical Effects Of Spatiotemporal Processing Of Nontask Fmri Data, M. Muge Karaman, Andrew S. Nencka, Iain P. Bruce, Daniel B. Rowe Nov 2014

Quantification Of The Statistical Effects Of Spatiotemporal Processing Of Nontask Fmri Data, M. Muge Karaman, Andrew S. Nencka, Iain P. Bruce, Daniel B. Rowe

Mathematics, Statistics and Computer Science Faculty Research and Publications

Nontask functional magnetic resonance imaging (fMRI) has become one of the most popular noninvasive areas of brain mapping research for neuroscientists. In nontask fMRI, various sources of “noise” corrupt the measured blood oxygenation level-dependent signal. Many studies have aimed to attenuate the noise in reconstructed voxel measurements through spatial and temporal processing operations. While these solutions make the data more “appealing,” many commonly used processing operations induce artificial correlations in the acquired data. As such, it becomes increasingly more difficult to derive the true underlying covariance structure once the data have been processed. As the goal of nontask fMRI studies …


A Method For Generating Realistic Correlation Matrices, Johanna S. Hardin, Stephan Ramon Garcia, David Golan Jan 2013

A Method For Generating Realistic Correlation Matrices, Johanna S. Hardin, Stephan Ramon Garcia, David Golan

Pomona Faculty Publications and Research

Simulating sample correlation matrices is important in many areas of statistics. Approaches such as generating Gaussian data and finding their sample correlation matrix or generating random uniform $[-1,1]$ deviates as pairwise correlations both have drawbacks. We develop an algorithm for adding noise, in a highly controlled manner, to general correlation matrices. In many instances, our method yields results which are superior to those obtained by simply simulating Gaussian data. Moreover, we demonstrate how our general algorithm can be tailored to a number of different correlation models. Using our results with a few different applications, we show that simulating correlation matrices …


From Unbiased Numerical Estimates To Unbiased Interval Estimates, Baokun Li, Gang Xiang, Vladik Kreinovich, Panagios Moscopoulos Aug 2012

From Unbiased Numerical Estimates To Unbiased Interval Estimates, Baokun Li, Gang Xiang, Vladik Kreinovich, Panagios Moscopoulos

Departmental Technical Reports (CS)

One of the main objectives of statistics is to estimate the parameters of a probability distribution based on a sample taken from this distribution. Of course, since the sample is finite, the estimate X is, in general, different from the actual value x of the corresponding parameter. What we can require is that the corresponding estimate is unbiased, i.e., that the mean value of the difference X - x is equal to 0: E[X] = x. In some problems, unbiased estimates are not possible. We show that in some such problems, it is possible to have interval unbiased estimates, i.e., …


Changes Across 25 Years Of Statistics In Medicine, Johanna S. Hardin Jan 2012

Changes Across 25 Years Of Statistics In Medicine, Johanna S. Hardin

Pomona Faculty Publications and Research

[This piece is a series of interviews with giants in the field of medicine on their views of how statistics is changing medicine. I interviewed the editor of the New England Journal of Medicine, a preeminent doctor/researcher of lung cancer, the director of the LA County Department of Public Health, and a Harvard statistician who sits on the editorial board of the New England Journal of Medicine.]


Medicine, Statistics, And Education: The Inextricable Link, Katharine K. Brieger '11, Johanna S. Hardin Jan 2012

Medicine, Statistics, And Education: The Inextricable Link, Katharine K. Brieger '11, Johanna S. Hardin

Pomona Faculty Publications and Research

No abstract provided.


Incorporating Genome Annotation In The Statistical Analysis Of Genomic And Epigenomic Tiling Array Data, Gayla R. Olbricht Jan 2010

Incorporating Genome Annotation In The Statistical Analysis Of Genomic And Epigenomic Tiling Array Data, Gayla R. Olbricht

Mathematics and Statistics Faculty Research & Creative Works

"A wealth of information and technologies are currently available for the genomewide investigation of many types of biological phenomena. Genomic annotation databases provide information about the DNA sequence of a particular organism and give locations of different types of genomic elements, such as the exons and introns of genes. Microarrays are a powerful type of technology that make use of DNA sequence information to investigate different types of biological phenomena on a genome-wide level. Tiling arrays are a unique type of microarray that provide unbiased, highdensity coverage of a genomic region, making them well suited for many applications, such as …


Analyzing Dna Microarrays With Undergraduate Statisticians, Johanna S. Hardin, Laura Hoopes, Ryan Murphy '06 Jan 2006

Analyzing Dna Microarrays With Undergraduate Statisticians, Johanna S. Hardin, Laura Hoopes, Ryan Murphy '06

Pomona Faculty Publications and Research

With advances in technology, biologists have been saddled with high dimensional data that need modern statistical methodology for analysis. DNA microarrays are able to simultaneously measure thousands of genes (and the activity of those genes) in a single sample. Biologists use microarrays to trace connections between pathways or to identify all genes that respond to a signal. The statistical tools we usually teach our undergraduates are inadequate for analyzing thousands of measurements on tens of samples. The project materials include readings on microarrays as well as computer lab activities. The topics covered include image analysis, filtering and normalization techniques, and …


Heckman's Methodology For Correcting Selectivity Bias : An Application To Road Crash Costs, Margaret Giles Jan 2001

Heckman's Methodology For Correcting Selectivity Bias : An Application To Road Crash Costs, Margaret Giles

Research outputs pre 2011

Aggregate road crash costs are traditionally determined using average costs applied to incidence figures found in Police-notified crash data. Such data only comprise a non-random sample of the true population of road crashes, the bias being due to the existence of crashes that are not notified to the Police. The traditional approach is to label the Police-notified sample as 'non-random' thereby casting a cloud over data analyses using this sample. Heckman however viewed similar problems as 'omitted variables' problems in that the exclusion of some observations in a systematic manner (so-called selectivity bias) has inadvertently introduced the need for an …


Detecting Trends And Patterns In Reliability Data Over Time Using Exponentially Weighted Moving-Averages, Harry F. Martz, Paul H. Kvam Jan 1996

Detecting Trends And Patterns In Reliability Data Over Time Using Exponentially Weighted Moving-Averages, Harry F. Martz, Paul H. Kvam

Department of Math & Statistics Faculty Publications

A simple, easy-to-use graphical method is presented for use in determining if there is any statistically significant trend or pattern over time in an underlying Poisson event rate of occurrence or binomial failure on demand probability. The method is based on the combined use of both an exponentially weighted moving-average (EWMA) and a Shewhart chart. Two nuclear power plant examples are introduced and used to illustrate the method. The false alarm probability and power when using the combined procedure are also determined for both cases using Monte Carlo simulation. The results indicate that the combined procedure is quite effective in …