Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Portland State University

Theses/Dissertations

2014

Big data

Articles 1 - 1 of 1

Full-Text Articles in Engineering

Ranked Similarity Search Of Scientific Datasets: An Information Retrieval Approach, Veronika Margaret Megler Jun 2014

Ranked Similarity Search Of Scientific Datasets: An Information Retrieval Approach, Veronika Margaret Megler

Dissertations and Theses

In the past decade, the amount of scientific data collected and generated by scientists has grown dramatically. This growth has intensified an existing problem: in large archives consisting of datasets stored in many files, formats and locations, how can scientists find data relevant to their research interests? We approach this problem in a new way: by adapting Information Retrieval techniques, developed for searching text documents, into the world of (primarily numeric) scientific data. We propose an approach that uses a blend of automated and curated methods to extract metadata from large repositories of scientific data. We then perform searches over …