Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

LSU Doctoral Dissertations

2014

Computer Sciences

Top-k

Articles 1 - 2 of 2

Full-Text Articles in Physical Sciences and Mathematics

Space-Efficient Data Structures For String Searching And Retrieval, Sharma Valliyil Thankachan Jan 2014

Space-Efficient Data Structures For String Searching And Retrieval, Sharma Valliyil Thankachan

LSU Doctoral Dissertations

Let D = {d_1, d_2, ...} be a collection of string documents of n characters in total, which are drawn from an alphabet set Sigma =[sigma] ={1,2,3,...sigma}. The top-k document retrieval problem is to maintain D as a data structure, such that when ever a query Q=(P, k) comes, we can report (the identifiers of) those k documents that are most relevant to the pattern P (of p characters). The relevance of a document d_r with respect to a pattern P is captured by score(P, d_r), which can be any function of the set of locations where P occurs in …


Efficient Indexing For Structured And Unstructured Data, Manish Madhukar Patil Jan 2014

Efficient Indexing For Structured And Unstructured Data, Manish Madhukar Patil

LSU Doctoral Dissertations

The collection of digital data is growing at an exponential rate. Data originates from wide range of data sources such as text feeds, biological sequencers, internet traffic over routers, through sensors and many other sources. To mine intelligent information from these sources, users have to query the data. Indexing techniques aim to reduce the query time by preprocessing the data. Diversity of data sources in real world makes it imperative to develop application specific indexing solutions based on the data to be queried. Data can be structured i.e., relational tables or unstructured i.e., free text. Moreover, increasingly many applications need …