Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Series

PDF

Computer Engineering

Privacy

University of Texas at El Paso

Publication Year

Articles 1 - 2 of 2

Full-Text Articles in Engineering

Interval Approach To Preserving Privacy In Statistical Databases: Related Challenges And Algorithms Of Computational Statistics, Luc Longpre, Gang Xiang, Vladik Kreinovich, Eric Freudenthal Mar 2007

Interval Approach To Preserving Privacy In Statistical Databases: Related Challenges And Algorithms Of Computational Statistics, Luc Longpre, Gang Xiang, Vladik Kreinovich, Eric Freudenthal

Departmental Technical Reports (CS)

In many practical situations, it is important to store large amounts of data and to be able to statistically process the data. A large part of the data is confidential, so while we welcome statistical data processing, we do not want to reveal sensitive individual data. If we allow researchers to ask all kinds of statistical queries, this can lead to violation of people's privacy. A sure-proof way to avoid these privacy violations is to store ranges of values (e.g., between 40 and 50 for age) instead of the actual values. This idea solves the privacy problem, but it leads …


Interval Versions Of Statistical Techniques With Applications To Environmental Analysis, Bioinformatics, And Privacy In Statistical Databases, Vladik Kreinovich, Luc Longpre, Scott A. Starks, Gang Xiang, Jan Beck, Raj Kandathi, Asis Nayak, Scott Ferson, Janos Hajagos Jul 2005

Interval Versions Of Statistical Techniques With Applications To Environmental Analysis, Bioinformatics, And Privacy In Statistical Databases, Vladik Kreinovich, Luc Longpre, Scott A. Starks, Gang Xiang, Jan Beck, Raj Kandathi, Asis Nayak, Scott Ferson, Janos Hajagos

Departmental Technical Reports (CS)

In many areas of science and engineering, it is desirable to estimate statistical characteristics (mean, variance, covariance, etc.) under interval uncertainty. For example, we may want to use the measured values x(t) of a pollution level in a lake at different moments of time to estimate the average pollution level; however, we do not know the exact values x(t) -- e.g., if one of the measurement results is 0, this simply means that the actual (unknown) value of x(t) can be anywhere between 0 and the detection limit DL. We must therefore modify the existing statistical algorithms to process such …