Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

University of Vermont

Theses/Dissertations

Clustering

Discipline
Publication Year

Articles 1 - 2 of 2

Full-Text Articles in Physical Sciences and Mathematics

Cluster Analysis Of Time Series Data With Application To Hydrological Events And Serious Illness Conversations, Ali Javed Jan 2021

Cluster Analysis Of Time Series Data With Application To Hydrological Events And Serious Illness Conversations, Ali Javed

Graduate College Dissertations and Theses

Cluster analysis explores the underlying structure of data and organizes it into groups (i.e., clusters) such that observations within the same group are more similar than those in different groups. Quantifying the ``similarity'' between observations, choosing the optimal number of clusters, and interpreting the results all require careful consideration of the research question at hand, the model parameters, the amount of data and their attributes. In this dissertation, the first manuscript explores the impact of design choices and the variability in clustering performance on different datasets. This is demonstrated through a benchmark study consisting of 128 datasets from the University …


A Hybrid Approach To Semantic Hashtag Clustering In Social Media, Ali Javed Jan 2016

A Hybrid Approach To Semantic Hashtag Clustering In Social Media, Ali Javed

Graduate College Dissertations and Theses

The uncontrolled usage of hashtags in social media makes them vary a lot in the quality of semantics and the frequency of usage. Such variations pose a challenge to the current approaches which capitalize on either the lexical semantics of a hashtag by using metadata or the contextual semantics of a hashtag by using the texts associated with a hashtag. This thesis presents a hybrid approach to clustering hashtags based on their semantics, designed in two phases. The first phase is a sense-level metadata-based semantic clustering algorithm that has the ability to differentiate among distinct senses of a hashtag as …