Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Keyword
-
- Database management (3)
- Big data (1)
- Cloud computing (1)
- Data encryption (Computer science) (1)
- Data fusion (1)
-
- Database searching (1)
- Distributed databases (1)
- Distributed representation (1)
- Event processing (Computer science) (1)
- Hybrid systems -- Design and construction (1)
- Hybrid systems -- Evaluation (1)
- Image processing (1)
- Indoor Assisted Navigation Database (1)
- Information retrieval (1)
- Natural language processing (1)
- Querying (Computer science) (1)
- Remote sensing (1)
- Sentence completion (1)
Articles 1 - 9 of 9
Full-Text Articles in Physical Sciences and Mathematics
Balancing Security, Performance And Deployability In Encrypted Search, David Joel Pouliot
Balancing Security, Performance And Deployability In Encrypted Search, David Joel Pouliot
Dissertations and Theses
Encryption is an important tool for protecting data, especially data stored in the cloud. However, standard encryption techniques prevent efficient search. Searchable encryption attempts to solve this issue, protecting the data while still providing search functionality. Retaining the ability to search comes at a cost of security, performance and/or utility.
An important practical aspect of utility is compatibility with legacy systems. Unfortunately, the efficient searchable encryption constructions that are compatible with these systems have been proven vulnerable to attack, even against weaker adversary models.
The goal of this work is to address this security problem inherent with efficient, legacy compatible …
2d Vector Map And Database Design For Indoor Assisted Navigation, Luciano Caraciolo Albuquerque
2d Vector Map And Database Design For Indoor Assisted Navigation, Luciano Caraciolo Albuquerque
Dissertations and Theses
In this paper we implemented a 2D Vector Map, map editor and Database design intended to provide an efficient way to convert cad files from indoor environments to a set of vectors representing hallways, doors, exits, elevators, and other entities embedded in a floor plan, and save them in a database for use by other applications, such as assisted navigation for blind people.
A graphical application as developed in C++ to allow the user to input a CAD DXF file, process the file to automatically obtain nodes and edges, and save the nodes and edges to a database for posterior …
Identifying Relationships Between Scientific Datasets, Abdussalam Alawini
Identifying Relationships Between Scientific Datasets, Abdussalam Alawini
Dissertations and Theses
Scientific datasets associated with a research project can proliferate over time as a result of activities such as sharing datasets among collaborators, extending existing datasets with new measurements, and extracting subsets of data for analysis. As such datasets begin to accumulate, it becomes increasingly difficult for a scientist to keep track of their derivation history, which complicates data sharing, provenance tracking, and scientific reproducibility. Understanding what relationships exist between datasets can help scientists recall their original derivation history. For instance, if dataset A is contained in dataset B, then the connection between A and B could be that A was …
Evaluating Distributed Word Representations For Predicting Missing Words In Sentences, Saniya Saifee
Evaluating Distributed Word Representations For Predicting Missing Words In Sentences, Saniya Saifee
Dissertations and Theses
In recent years, the distributed representation of words in vector space or word embeddings have become very popular as they have shown significant improvements in many statistical natural language processing (NLP) tasks as compared to traditional language models like Ngram. In this thesis, we explored various state-of-the-art methods like Latent Semantic Analysis, word2vec, and GloVe to learn the distributed representation of words. Their performance was compared based on the accuracy achieved when tasked with selecting the right missing word in the sentence, given five possible options. For this NLP task we trained each of these methods using a training corpus …
Optimizing Data Movement In Hybrid Analytic Systems, Patrick Michael Leyshock
Optimizing Data Movement In Hybrid Analytic Systems, Patrick Michael Leyshock
Dissertations and Theses
Hybrid systems for analyzing big data integrate an analytic tool and a dedicated data-management platform, storing data and operating on the data at both components. While hybrid systems have benefits over alternative architectures, in order to be effective, data movement between the two hybrid components must be minimized. Extant hybrid systems either fail to address performance problems stemming from inter-component data movement, or else require the user to explicitly reason about and manage data movement. My work presents the design, implementation, and evaluation of a hybrid analytic system for array-structured data that automatically minimizes data movement between the hybrid components. …
Assessing Satellite Image Data Fusion With Information Theory Metrics, James Cross
Assessing Satellite Image Data Fusion With Information Theory Metrics, James Cross
Dissertations and Theses
A common problem in remote sensing is estimating an image with high spatial and high spectral resolution given separate sources of measurements from satellite instruments, one having each of these desirable properties. This thesis presents a survey of seven families of algorithms which have been developed to provide this common pattern of satellite image data fusion. They are all tested on artificially degraded sets of satellite data from the Moderate Resolution Imaging Spectroradiometer (“MODIS”) with known ideal results, and evaluated using the commonly accepted data fusion assessment metrics spectral angle mapper (“SAM”) and Erreur Relative Globale Adimensionelle de Synth`ese (“ERGAS”). …
Neuroevolution And An Application Of An Agent Based Model For Financial Market, Anil Yaman
Neuroevolution And An Application Of An Agent Based Model For Financial Market, Anil Yaman
Dissertations and Theses
Market prediction is one of the most difficult problems for the machine learning community. Even though, successful trading strategies can be found for the training data using various optimization methods, these strategies usually do not perform well on the test data as expected. Therefore, selection of the correct strategy becomes problematic. In this study, we propose an evolutionary algorithm that produces a variation of trader agents ensuring that the trading strategies they use are different. We discuss that because the selection of the correct strategy is difficult, a variety of agents can be used simultaneously in order to reduce risk. …
A Data-Descriptive Feedback Framework For Data Stream Management Systems, Rafael J. Fernández Moctezuma
A Data-Descriptive Feedback Framework For Data Stream Management Systems, Rafael J. Fernández Moctezuma
Dissertations and Theses
Data Stream Management Systems (DSMSs) provide support for continuous query evaluation over data streams. Data streams provide processing challenges due to their unbounded nature and varying characteristics, such as rate and density fluctuations. DSMSs need to adapt stream processing to these changes within certain constraints, such as available computational resources and minimum latency requirements in producing results. The proposed research develops an inter-operator feedback framework, where opportunities for run-time adaptation of stream processing are expressed in terms of descriptions of substreams and actions applicable to the substreams, called feedback punctuations. Both the discovery of adaptation opportunities and the exploitation of …
Performance Analysis Of A Distributed File System, Meenakshi Mukhopadhyay
Performance Analysis Of A Distributed File System, Meenakshi Mukhopadhyay
Dissertations and Theses
An important design goal of a distributed file system, a component of many distributed systems, is to provide UNIX file access semantics, e.g., the result of any write system call is visible by all processes as soon as the call completes. In a distributed environment, these semantics are difficult to implement because processes on different machines do not share kernel cache and data structures. Strong data consistency guarantees may be provided only at the expense of performance.
This work investigates the time costs paid by AFS 3.0, which uses a callback mechanism to provide consistency guarantees, and those paid by …