Open Access. Powered by Scholars. Published by Universities.®

Medicine and Health Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Epidemiology

PDF

Georgia Southern University

Department of Biostatistics, Epidemiology, and Environmental Health Sciences Faculty Publications

2021

Relative risk

Articles 1 - 1 of 1

Full-Text Articles in Medicine and Health Sciences

Aggregating Twitter Text Through Generalized Linear Regression Models For Tweet Popularity Prediction And Automatic Topic Classification, Chen Mo, Jingjing Yin, Isaac Chun-Hai Fung, Zion Tse Nov 2021

Aggregating Twitter Text Through Generalized Linear Regression Models For Tweet Popularity Prediction And Automatic Topic Classification, Chen Mo, Jingjing Yin, Isaac Chun-Hai Fung, Zion Tse

Department of Biostatistics, Epidemiology, and Environmental Health Sciences Faculty Publications

Social media platforms have become accessible resources for health data analysis. However, the advanced computational techniques involved in big data text mining and analysis are challenging for public health data analysts to apply. This study proposes and explores the feasibility of a novel yet straightforward method by regressing the outcome of interest on the aggregated influence scores for association and/or classification analyses based on generalized linear models. The method reduces the document term matrix by transforming text data into a continuous summary score, thereby reducing the data dimension substantially and easing the data sparsity issue of the term matrix. To …