Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

PDF

University of Windsor

Series

2013

Articles 1 - 1 of 1

Full-Text Articles in Entire DC Network

Bias Correction In Small Sample From Big Data, Jianguo Lu, Dingding Li Jan 2013

Bias Correction In Small Sample From Big Data, Jianguo Lu, Dingding Li

Computer Science Publications

This paper discusses the bias problem when estimating the population size of big data such as online social networks (OSN) using simple random walk. Unlike the traditional estimation problem where the sample size is not very small relative to the data size, in big data a small sample relative to the data size is already very large and costly to obtain. When small samples are used, there is a bias that is no longer negligible. This paper shows analitically that the relative bias can be approximated by the reciprocal of the number of collisions, thereby a bias correction estimator is …