Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Data analysis

Discipline
Institution
Publication Year
Publication
Publication Type
File Type

Articles 151 - 173 of 173

Full-Text Articles in Physical Sciences and Mathematics

Reducing False Alarms In Searches For Gravitational Waves From Coalescing Binary Systems, Andres Rodri­Guez Jan 2007

Reducing False Alarms In Searches For Gravitational Waves From Coalescing Binary Systems, Andres Rodri­Guez

LSU Master's Theses

LIGO observatories in Livingston, LA and Hanford, WA may detect gravitational waves emitted from coalescing binary systems composed of two compact objects. In order to detect compact binary coalescence (CBC) events, LIGO searches utilize matched filtering techniques. Matched filtering is the optimal detection strategy for stationary, Gaussian noise, however, LIGO noise is often non-stationary, non-Gaussian. Non-stationary noise result in an excess of false candidate events, commonly known as false alarms. This thesis develops the r2 test to reduce the false alarm rate for LIGO CBC searches. Results of the search for primordial black hole binary systems (where each object …


Faithful Estimation Of Dynamics Parameters From Cpmg Relaxation Dispersion Measurements, Evgueni Kovriguine, James G. Kempf, Michael J. Grey, J. Patrick Loria May 2006

Faithful Estimation Of Dynamics Parameters From Cpmg Relaxation Dispersion Measurements, Evgueni Kovriguine, James G. Kempf, Michael J. Grey, J. Patrick Loria

Chemistry Faculty Research and Publications

This work examines the robustness of fitting of parameters describing conformational exchange (kex, pa/b, and Δω) processes from CPMG relaxation dispersion data. We have analyzed the equations describing conformational exchange processes for the intrinsic inter-dependence of their parameters that leads to the existence of multiple equivalent solutions, which equally satisfy the experimental data. We have used Monte-Carlo simulations and fitting to the synthetic data sets as well as the direct 3-D mapping of the parameter space of kex, pa/b, and Δω to quantitatively assess …


Fisa: Feature-Based Instance Selection For Imbalanced Text Classification, Aixin Sun, Ee Peng Lim, Boualem Benatallah, Mahbub Hassan Apr 2006

Fisa: Feature-Based Instance Selection For Imbalanced Text Classification, Aixin Sun, Ee Peng Lim, Boualem Benatallah, Mahbub Hassan

Research Collection School Of Computing and Information Systems

Support Vector Machines (SVM) classifiers are widely used in text classification tasks and these tasks often involve imbalanced training. In this paper, we specifically address the cases where negative training documents significantly outnumber the positive ones. A generic algorithm known as FISA (Feature-based Instance Selection Algorithm), is proposed to select only a subset of negative training documents for training a SVM classifier. With a smaller carefully selected training set, a SVM classifier can be more efficiently trained while delivering comparable or better classification accuracy. In our experiments on the 20-Newsgroups dataset, using only 35% negative training examples and 60% learning …


Sgpm: Static Group Pattern Mining Using Apriori-Like Sliding Window, John Goh, David Taniar, Ee Peng Lim Apr 2006

Sgpm: Static Group Pattern Mining Using Apriori-Like Sliding Window, John Goh, David Taniar, Ee Peng Lim

Research Collection School Of Computing and Information Systems

Mobile user data mining is a field that focuses on extracting interesting pattern and knowledge out from data generated by mobile users. Group pattern is a type of mobile user data mining method. In group pattern mining, group patterns from a given user movement database is found based on spatio-temporal distances. In this paper, we propose an improvement of efficiency using area method for locating mobile users and using sliding window for static group pattern mining. This reduces the complexity of valid group pattern mining problem. We support the use of static method, which uses areas and sliding windows instead …


Quantifying The Effects Of Correlated Covariates On Variable Importance Estimates From Random Forests, Ryan Vincent Kimes Jan 2006

Quantifying The Effects Of Correlated Covariates On Variable Importance Estimates From Random Forests, Ryan Vincent Kimes

Theses and Dissertations

Recent advances in computing technology have lead to the development of algorithmic modeling techniques. These methods can be used to analyze data which are difficult to analyze using traditional statistical models. This study examined the effectiveness of variable importance estimates from the random forest algorithm in identifying the true predictor among a large number of candidate predictors. A simulation study was conducted using twenty different levels of association among the independent variables and seven different levels of association between the true predictor and the response. We conclude that the random forest method is an effective classification tool when the goals …


Keynote: The Use Of Meta-Heuristic Algorithms For Data Mining, Dr. Beatrize De La Iglesia, A. Reynolds Aug 2005

Keynote: The Use Of Meta-Heuristic Algorithms For Data Mining, Dr. Beatrize De La Iglesia, A. Reynolds

International Conference on Information and Communication Technologies

In this paper we explore the application of powerful optimisers known as metaheuristic algorithms to problems within the data mining domain. We introduce some well-known data mining problems, and show how they can be formulated as optimisation problems. We then review the use of metaheuristics in this context. In particular, we focus on the task of partial classification and show how multi-objective metaheuristics have produced results that are comparable to the best known techniques but more scalable to large databases. We conclude by reinforcing the importance of research on the areas of metaheuristics for optimisation and data mining. The combination …


Optimal Reconstruction Of Magnetopause Structures From Cluster Data, H Hasegawa, B U. Ö Sonnerup, B Klecker, G Paschmann Mar 2005

Optimal Reconstruction Of Magnetopause Structures From Cluster Data, H Hasegawa, B U. Ö Sonnerup, B Klecker, G Paschmann

Dartmouth Scholarship

The Grad-Shafranov (GS) reconstruction tech- nique, a single-spacecraft based data analysis method for recovering approximately two-dimensional (2-D) magneto- hydrostatic plasma/field structures in space, is improved to become a multi-spacecraft technique that produces a single field map by ingesting data from all four Cluster spacecraft into the calculation. The plasma pressure, required for the technique, is measured in high time resolution by only two of the spacecraft, C1 and C3, but, with the help of spacecraft po- tential measurements available from all four spacecraft, the pressure can be estimated at the other spacecraft as well via a relationship, established from C1 …


The Morphology Of Steve, Eugenie C. Scott, Nicholas J. Matzke, Glenn Branch, Steven Mccullagh Jul 2004

The Morphology Of Steve, Eugenie C. Scott, Nicholas J. Matzke, Glenn Branch, Steven Mccullagh

Faculty and Research Publications

This report is part of Project Steve. Project Steve is, among other things, the first scientific analysis of the sex, geographic location, and body size of scientists named Steve. We performed this research for the best of all reasons: we discovered that we had lots of data. No scientist can resist the opportunity to analyze data, regardless of where that data came from or why it was gathered.


New Methods For Exafs Analysis In Structural Genomics, Grant Bunker, N. Dimakis, Gocha Kelashvili Jan 2004

New Methods For Exafs Analysis In Structural Genomics, Grant Bunker, N. Dimakis, Gocha Kelashvili

Physics and Astronomy Faculty Publications and Presentations

Data analysis is one of the remaining bottlenecks in high-throughput EXAFS for structural genomics. Here some recent developments in methodology are described that offer the potential for rapid and automated XAS analysis of metalloproteins.


Mim And Nonlinear Least-Squares Inversions Of Aem Data In Barataria Basin, Louisiana, Melissa Whitten Bryan, Kenneth W. Holladay, Clyde J. Bergeron Jr., Juliette W. Ioup, George E. Ioup Jan 2003

Mim And Nonlinear Least-Squares Inversions Of Aem Data In Barataria Basin, Louisiana, Melissa Whitten Bryan, Kenneth W. Holladay, Clyde J. Bergeron Jr., Juliette W. Ioup, George E. Ioup

Physics Faculty Publications

An airborne electromagnetic survey was performed over the marsh and estuarine waters of the Barataria basin of Louisiana. Two inversion methods were applied to the measured data to calculate layer thicknesses and conductivities: the modified image method (MIM) and a nonlinear least-squares method of inversion using two two-layer forward models and one three-layer forward model, with results generally in good agreement. Uniform horizontal water layers in the near-shore Gulf of Mexico with the fresher (less saline, less conductive) water above the saltier (more saline, more conductive) water can be seen clearly. More complex near-surface layering showing decreasing salinity/conductivity with depth …


Lisa Data Analysis: Doppler Demodulation, Neil J. Cornish, Shane L. Larson Jan 2003

Lisa Data Analysis: Doppler Demodulation, Neil J. Cornish, Shane L. Larson

All Physics Faculty Publications

The orbital motion of the Laser Interferometer Space Antenna (LISA) produces amplitude, phaseand frequency modulation of a gravitational wave signal. The modulations have the effect of spreading a monochromatic gravitational wave signal across a range of frequencies. The modulations encode useful information about the source location and orientation, but they also have the deleteriousaffect of spreading a signal across a wide bandwidth, thereby reducing the strength of the signalrelative to the instrument noise. We describe a simple method for removing the dominant, Doppler,component of the signal modulation. The demodulation reassembles the power from a monochromatic source into a narrow spike, …


Gravitational Radiation Detectability Of Supernova 1987a'S Remnant Fully Matched Filter For Double Resonant Gravitational Detector, Giovanni Santostasi Jan 2003

Gravitational Radiation Detectability Of Supernova 1987a'S Remnant Fully Matched Filter For Double Resonant Gravitational Detector, Giovanni Santostasi

LSU Doctoral Dissertations

Part I There is some observational evidence of the presence of a pulsating light source in the remnant of the supernova (SN) 1987A [1]. This source is considered to be a rotating neutron star. Fourier analysis of the light intensity of this source reveals a main narrow frequency peak and side bands that are understood as a modulation of the main sinusoidal signal. A particular model of the neutron star invokes a precessing object to explain the modulation. From the Fourier spectrum of the source and changes in the frequency value, we can determine important parameters of the spinning neutron …


Estimation Of Depth-Area Relationships Using Radar-Rainfall Data, S. Rocky Durrans, Lesley T. Julian, Michael Yekta Jan 2002

Estimation Of Depth-Area Relationships Using Radar-Rainfall Data, S. Rocky Durrans, Lesley T. Julian, Michael Yekta

United States Department of Commerce: Staff Publications

Depth-area relationships, such as those published by the National Weather Service in TP 40 and the NOAA Atlas 2, enable conversion of point rainfall depths to areal average depths for the same storm duration and recurrence interval. This problem of conversion is most germane to hydrologic analyses for moderate to large drainage basins, where point rainfall depths are not representative of the spatial distribution of a storm event. Historically, depth-area relationships have been developed on the basis of data from dense networks of recording gauges. However, with the ongoing accumulation of radar-rainfall records, radar-rainfall data represent an alternative to gauging …


Emacs Speaks Statistics: A Universal Interface For Statistical Analysis, Anthony Rossini, Martin Maechler, Kurt Hornik, Richard M. Heiberger, Rodney Sparapani Jul 2001

Emacs Speaks Statistics: A Universal Interface For Statistical Analysis, Anthony Rossini, Martin Maechler, Kurt Hornik, Richard M. Heiberger, Rodney Sparapani

UW Biostatistics Working Paper Series

Emacs Speaks Statistics (ESS) is a user interface for developing statistical applications and performing data analysis using any of several common statistical programming languages. ESS falls in the programming tools category of Integrated Development Environments (IDEs), which are approaches for developing and visualizing computer programs. We discuss how it works, the advantages of using it, and extensions for increasing statistical programming efficiency.


The Warps Survey - Iv. The X-Ray Luminosity-Temperature Relation Of High-Redshift Galaxy Clusters, B. W. Fairley, L. R. Jones, C. Scharf, H. Ebeling, E. Perlman, D. Horner, G. Wegner, M. Malkan Jul 2000

The Warps Survey - Iv. The X-Ray Luminosity-Temperature Relation Of High-Redshift Galaxy Clusters, B. W. Fairley, L. R. Jones, C. Scharf, H. Ebeling, E. Perlman, D. Horner, G. Wegner, M. Malkan

Dartmouth Scholarship

We present a measurement of the cluster X-ray luminosity-temperature (L-T) relation out to high redshift (z∼0.8). Combined ROSAT PSPC spectra of 91 galaxy clusters detected in the Wide Angle ROSAT Pointed Survey (WARPS) are simultaneously fitted in redshift and luminosity bins. The resulting temperature and luminosity measurements of these bins, which occupy a region of the high-redshift L-T relation not previously sampled, are compared with existing measurements at low redshift in order to constrain the evolution of the L-T relation. We find the best fit to low-redshift (z<0.2) cluster data, at T …


Automated Classification Of Stellar Spectra. Ii: Two-Dimensional Classification With Neural Networks And Principal Components Analysis, Ted Von Hippel, Coryn A.L. Bailer-Jones, Mike Irwin Oct 1997

Automated Classification Of Stellar Spectra. Ii: Two-Dimensional Classification With Neural Networks And Principal Components Analysis, Ted Von Hippel, Coryn A.L. Bailer-Jones, Mike Irwin

Publications

We investigate the application of neural networks to the automation of MK spec- tral classification. The data set for this project consists of a set of over 5000 optical (3800–5200°A) spectra obtained from objective prism plates from the Michigan Spec- tral Survey. These spectra, along with their two-dimensional MK classifications listed in the Michigan Henry Draper Catalogue, were used to develop supervised neural network classifiers. We show that neural networks can give accurate spectral type classifications (68 = 0.82 subtypes, rms= 1.09 subtypes) across the full range of spectral types present in the data set (B2–M7). We show also that …


Fast Discrete Polynomial Transforms With Applications To Data Analysis For Distance Transitive Graphs, J. R. Driscoll, D. M. Healy, D. N. Rockmore Aug 1997

Fast Discrete Polynomial Transforms With Applications To Data Analysis For Distance Transitive Graphs, J. R. Driscoll, D. M. Healy, D. N. Rockmore

Dartmouth Scholarship

Let $\poly = \{P_0,\dots,P_{n-1}\}$ denote a set of polynomials with complex coefficients. Let $\pts = \{z_0,\dots,z_{n-1}\}\subset \cplx$ denote any set of {\it sample points}. For any $f = (f_0,\dots,f_{n-1}) \in \cplx^n$, the {\it discrete polynomial transform} of f (with respect to $\poly$ and $\pts$) is defined as the collection of sums, $\{\fhat(P_0),\dots,\fhat(P_{n-1})\}$, where $\fhat(P_j) = \langle f,P_j \rangle = \sum_{i=0}^{n-1} f_iP_j(z_i)w(i)$ for some associated weight function w. These sorts of transforms find important applications in areas such as medical imaging and signal processing.

In this paper, we present fast algorithms for computing discrete orthogonal polynomial transforms. For a system …


Presentation Of Verified Algal Taxa As Reference Sources - Phase Ii, Richard L. Meyer Jun 1990

Presentation Of Verified Algal Taxa As Reference Sources - Phase Ii, Richard L. Meyer

Technical Reports

The focus of this research project was to continue the development of a photographic system which would record living organisms using various forms of light microscopy with correct color and with arrested movement. These demands dictate the use of an electronic flash source with metering and control system located in a position following the passage of the light through the optical train. The system developed uses off-the-shelf components with a modified flashtube holder which positions the tube in the axis of the light beam between the field and iris diaphragm. The light is measured off-the-film so that light from the …


Diminishing Views: Air Quality In Western National Parks, Christine L. Shaver Nov 1989

Diminishing Views: Air Quality In Western National Parks, Christine L. Shaver

Air Quality Protection in the West (November 27-28)

17 pages.

Contains references.


Presentation Of Verified Algal Taxa As Reference Sources, Richard L. Meyer Jun 1989

Presentation Of Verified Algal Taxa As Reference Sources, Richard L. Meyer

Technical Reports

A data base of the algae of Arkansas ecoregions has been established to describe the numerous taxa that occur within the aquatic ecosystems included in these regions. The organisms were identified with the aid of diverse literature from throughout the world. These sources are written in multiple languages and the living organisms had to be compared with outline or silhouette drawings. These illustrations may include shading, but none present the true color of the organism but only the characteristics of the descriptive source. Primary characteristics used to identify algae is based upon pigmentation of the plastid and the number and …


Molecular Dynamics In Hydrogen‐Bonded Interactions: A Preliminary Experimentally Determined Harmonic Stretching Force Field For Hcn‐‐‐Hf, B. A. Wofford, Shannon Lieb, J. W. Bevan Jan 1987

Molecular Dynamics In Hydrogen‐Bonded Interactions: A Preliminary Experimentally Determined Harmonic Stretching Force Field For Hcn‐‐‐Hf, B. A. Wofford, Shannon Lieb, J. W. Bevan

Scholarship and Professional Work - LAS

Observation of the 2ν1 overtone band in the hydrogen‐bonded complex HCN‐‐‐HF permits evaluation of the anharmonicity constant X 1 1=−116.9(1) cm 1 and determination of the anharmonicity corrected fundamental frequency ω1. This information, and available data from previous rovibrational analyses in the common and perdeuterated isotopic species of HCN‐‐‐HF, offer an opportunity for calculation of an approximate stretching harmonic force field. With the assumptions f 1 2=f 2 4=0.0, the remaining force constants (in mdyn/Å) are evaluated as: f 1 1=8.600(20), f 2 2=6.228(9), f 3 3=19.115(40), f 4 …


Data Analysis Using Experimental Design Model Factorial Analysis Of Variance/Covariance (Dmaovc.Bas), Wesley E. Newton May 1985

Data Analysis Using Experimental Design Model Factorial Analysis Of Variance/Covariance (Dmaovc.Bas), Wesley E. Newton

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

DMAOVC.BAS is a computer program written in the compiler version of microsoft basic which performs factorial analysis of variance/covariance with expected mean squares. The program accommodates factorial and other hierarchical experimental designs with balanced sets of data. The program is writ ten for use on most modest sized microprocessors, in which the compiler is available. The program is parameter file driven where the parameter file consists of the response variable structure, the experimental design model expressed in a similar structure as seen in most textbooks, information concerning the factors (i.e. fixed or random, and the number of levels), and necessary …


Research For The Development Of Guidelines For Conducting And Analyzing An Environmental Water Quality Study To Determine Statistically Meaningful Results, Melvin D. Springer Mar 1976

Research For The Development Of Guidelines For Conducting And Analyzing An Environmental Water Quality Study To Determine Statistically Meaningful Results, Melvin D. Springer

Technical Reports

This report presents and discusses the basic statistical models and methods which are useful to researchers in the field of water resources research, as well as in other fields. These models and methods are presented from the standpoint of type (parametric and nonparametric - or distribution free) and purpose (e.g., simultaneous comparison of several means, comparison of two or more variances, establishment of a difference between two means with a specified confidence, etc.). The material is presented with emphasis primarily upon methodology, including the necessary assumptions upon which each model is based. No derivations or proofs are given, since these …