Open Access. Powered by Scholars. Published by Universities.®
![Digital Commons Network](http://assets.bepress.com/20200205/img/dcn/DCsunburst.png)
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
City University of New York (CUNY)
Chebyshev inequality; Big Data; Adaptive Sampling; Bootstrap Sampling; Learning Curve
Articles 1 - 1 of 1
Full-Text Articles in Physical Sciences and Mathematics
Intelligent Sampling For Big Data Using Bootstrap Sampling And Chebyshev Inequality, Ashwin Satyanarayana
Intelligent Sampling For Big Data Using Bootstrap Sampling And Chebyshev Inequality, Ashwin Satyanarayana
Publications and Research
The amount of data being generated and stored is growing exponentially, owed in part to the continuing advances in computer technology. These data present tremendous opportunities in data mining, a burgeoning field in computer science that focuses on the development of methods that can extract knowledge from data. In many real world problems, these data mining algorithms have access to massive amounts of data. Mining all the available data is prohibitive due to computational (time and memory) constraints. Much of the current research is concerned with scaling up data mining algorithms (i.e. improving on existing data mining algorithms for larger …