Open Access. Powered by Scholars. Published by Universities.®

Data Science Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 3 of 3

Full-Text Articles in Data Science

A Deep Topical N-Gram Model And Topic Discovery On Covid-19 News And Research Manuscripts, Yuan Du Mar 2021

A Deep Topical N-Gram Model And Topic Discovery On Covid-19 News And Research Manuscripts, Yuan Du

Electronic Thesis and Dissertation Repository

Topic modeling with the latent semantic analysis (LSA), the latent Dirichlet allocation (LDA) and the biterm topic model (BTM) has been successfully implemented and used in many areas, including movie reviews, recommender systems, and text summarization, etc. However, these models may become computationally intensive if tested on a humongous corpus. Considering the wide acceptance of machine learning based on deep neural networks, this research proposes two deep neural network (NN) variants, 2-layer NN and 3-layer NN of the LDA modeling techniques. The primary goal is to deal with problems with a large corpus using manageable computational resources.

This thesis analyze …


Topic Modeling And Cultural Nature Of Citations, Marie Coraline Dumaz Jan 2021

Topic Modeling And Cultural Nature Of Citations, Marie Coraline Dumaz

Graduate Theses, Dissertations, and Problem Reports

Ever since the beginning of research journals, the number of academic publications has been increasing steadily. Nowadays, especially, with the new importance of online open-access journals and databases, research papers are more easily available to read and share. It also becomes harder to keep up with novelties and grasp an idea of the general impact of a given researcher, institution, journal, or field. For this reason, different bibliometric indicators are now routinely used to classify and evaluate the impact or significance of individual researchers, conferences, journals, or entire scientific communities. In this thesis, we provide tools to study trends in …


A Data Exploration Of Jeopardy! From 1984 To The Present, Brian S. Hamilton Sep 2020

A Data Exploration Of Jeopardy! From 1984 To The Present, Brian S. Hamilton

Dissertations, Theses, and Capstone Projects

The gameshow Jeopardy! has been around in its current iteration—hosted by Alex Trebek—since 1984. During this time, it has accumulated data on clues, contestants, and possible strategies on how to win. Using a crowd-sourced archive called J! Archive, this project seeks to find trends in the topics that the game covers and take a deeper look into the performance of its contestants. It employs topic modeling, a text-analysis method, to organize the hundreds of thousands of archived clues and statistical analysis to rate the performance of contestants by gender. Using web-based visualization tools, the data is shown in an …