Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

Computer Science and Software Engineering

2016

Probability-based semantic similarity and distances

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Creating A Probabilistic Model For Wordnet, Lubomir Stanchev Sep 2016

Creating A Probabilistic Model For Wordnet, Lubomir Stanchev

Computer Science and Software Engineering

We present a probabilistic model for extracting and storing information from WordNet and the British National Corpus. We map the data into a directed probabilistic graph that can be used to compute the conditional probability between a pair of words from the English language. For example, the graph can be used to deduce that there is a 10% probability that someone who is interested in dogs is also interested in the word “canine”. We propose three ways for computing this probability, where the best results are achieved when performing multiple random walks in the graph. Unlike existing approaches that only …