Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

PDF

Wayne State University Dissertations

Theses/Dissertations

2012

Databases, Machine Learning, Record Linkage, Semantic Similarity, Similarity Join

Articles 1 - 1 of 1

Full-Text Articles in Entire DC Network

A New Semantic Similarity Join Method Using Diffusion Maps And Long String Table Attributes, Bilal Hani Hawashin Jan 2012

A New Semantic Similarity Join Method Using Diffusion Maps And Long String Table Attributes, Bilal Hani Hawashin

Wayne State University Dissertations

With the rapid increase of the distributed data sources, and in order to make information integration, there is a need to combine the information that refers to the same entity from different sources. However, there are no global conventions that control the format of the data, and it is impractical to impose such global conventions. Also, there could be some spelling errors in the data as it is entered manually in most of the cases. For such reasons, the need to find and join similar records instead of exact records is important in order to integrate the data. Most of …