Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

PDF

City University of New York (CUNY)

Series

2014

Chebyshev inequality; Big Data; Adaptive Sampling; Bootstrap Sampling; Learning Curve

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Intelligent Sampling For Big Data Using Bootstrap Sampling And Chebyshev Inequality, Ashwin Satyanarayana May 2014

Intelligent Sampling For Big Data Using Bootstrap Sampling And Chebyshev Inequality, Ashwin Satyanarayana

Publications and Research

The amount of data being generated and stored is growing exponentially, owed in part to the continuing advances in computer technology. These data present tremendous opportunities in data mining, a burgeoning field in computer science that focuses on the development of methods that can extract knowledge from data. In many real world problems, these data mining algorithms have access to massive amounts of data. Mining all the available data is prohibitive due to computational (time and memory) constraints. Much of the current research is concerned with scaling up data mining algorithms (i.e. improving on existing data mining algorithms for larger …