Open Access. Powered by Scholars. Published by Universities.®

Computer Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Electronic Theses and Dissertations

2009

Categorical Datasets

Articles 1 - 1 of 1

Full-Text Articles in Computer Engineering

Scalable And Efficient Outlier Detection In Large Distributed Data Sets With Mixed-Type Attributes, Anna Koufakou Jan 2009

Scalable And Efficient Outlier Detection In Large Distributed Data Sets With Mixed-Type Attributes, Anna Koufakou

Electronic Theses and Dissertations

An important problem that appears often when analyzing data involves identifying irregular or abnormal data points called outliers. This problem broadly arises under two scenarios: when outliers are to be removed from the data before analysis, and when useful information or knowledge can be extracted by the outliers themselves. Outlier Detection in the context of the second scenario is a research field that has attracted significant attention in a broad range of useful applications. For example, in credit card transaction data, outliers might indicate potential fraud; in network traffic data, outliers might represent potential intrusion attempts. The basis of deciding …