Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Electrical Engineering and Computer Science - All Scholarship

Classification

Publication Year

Articles 1 - 2 of 2

Full-Text Articles in Computer Sciences

Building Decision Tree Classifier On Private Data, Wenliang Du, Zhijun Zhan Jan 2002

Building Decision Tree Classifier On Private Data, Wenliang Du, Zhijun Zhan

Electrical Engineering and Computer Science - All Scholarship

This paper studies how to build a decision tree classifier under the following scenario: a database is vertically partitioned into two pieces, with one piece owned by Alice and the other piece owned by Bob. Alice and Bob want to build a decision tree classifier based on such a database, but due to the privacy constraints, neither of them wants to disclose their private pieces to the other party or to any third party. We present a protocol that allows Alice and Bob to conduct such a classifier building without having to compromise their privacy. Our protocol uses an untrusted …


Clouds: A Decision Tree Classifier For Large Datasets, Khaled Alsabti, Sanjay Ranka, Vineet Singh Jan 1998

Clouds: A Decision Tree Classifier For Large Datasets, Khaled Alsabti, Sanjay Ranka, Vineet Singh

Electrical Engineering and Computer Science - All Scholarship

Classification for very large datasets has many practical applications in data mining. Techniques such as discretization and dataset sampling can be used to scale up decision tree classifiers to large datasets. Unfortunately, both of these techniques can cause a significant loss in accuracy. We present a novel decision tree classifier called CLOUDS, which samples the splitting points for numeric attributes followed by an estimation step to narrow the search space of the best split. CLOUDS reduces computation and I/O complexity substantially compared to state of the art classifiers, while maintaining the quality of the generated trees in terms of accuracy …