Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 4 of 4

Full-Text Articles in Entire DC Network

Inside The Selection Box: Visualising Active Learning Selection Strategies, Brian Mac Namee, Rong Hu, Sarah Jane Delany Jan 2010

Inside The Selection Box: Visualising Active Learning Selection Strategies, Brian Mac Namee, Rong Hu, Sarah Jane Delany

Conference papers

Visualisations can be used to provide developers with insights into the inner workings of interactive machine learning techniques. In active learning, an inherently interactive machine learning technique, the design of selection strategies is the key research question and this paper demonstrates how spring model based visualisations can be used to provide insight into the precise operation of various selection strategies. Using sample datasets, this paper provides detailed examples of the differences between a range of selection strategies.


Egal: Exploration Guided Active Learning For Tcbr, Rong Hu, Sarah Jane Delany, Brian Mac Namee Jan 2010

Egal: Exploration Guided Active Learning For Tcbr, Rong Hu, Sarah Jane Delany, Brian Mac Namee

Conference papers

The task of building labelled case bases can be approached using active learning (AL), a process which facilitates the labelling of large collections of examples with minimal manual labelling effort. The main challenge in designing AL systems is the development of a selection strategy to choose the most informative examples to manually label. Typical selection strategies use exploitation techniques which attempt to refine uncertain areas of the decision space based on the output of a classifier. Other approaches tend to balance exploitation with exploration, selecting examples from dense and interesting regions of the domain space. In this paper we present …


Exploring The Frontier Of Uncertainty Space, Rong Hu, Patrick Lindstrom, Sarah Jane Delany, Brian Mac Namee Jan 2010

Exploring The Frontier Of Uncertainty Space, Rong Hu, Patrick Lindstrom, Sarah Jane Delany, Brian Mac Namee

Conference papers

We aim to investigate methods balancing exploitation with exploration in active learning to improve the performance of uncertainty sampling. Two exploration guided sampling methods are compared to uncertainty sampling on various real-life datasets from the 2010 Active Learning Challenge. Our initial experiments seems to indicate that combining exploration with uncertainty sampling improves performance on certain datasets but not all.


Off To A Good Start: Using Clustering To Select The Initial Training Set In Active Learning, Rong Hu, Brian Mac Namee, Sarah Jane Delany Jan 2010

Off To A Good Start: Using Clustering To Select The Initial Training Set In Active Learning, Rong Hu, Brian Mac Namee, Sarah Jane Delany

Conference papers

Active learning (AL) is used in textual classification to alleviate the cost of labelling documents for training. An important issue in AL is the selection of a representative sample of documents to label for the initial training set that seeds the process, and clustering techniques have been successfully used in this regard. However, the clustering techniques used are nondeterministic which causes inconsistent behaviour in the AL process. In this paper we first illustrate the problems associated with using non-deterministic clustering for initial training set selection in AL. We then examine the performance of three deterministic clustering techniques for this task …