Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 13 of 13

Full-Text Articles in Computer Sciences

Structural Analysis And Auditing Of Snomed Hierarchies Using Abstraction Networks, Yue Wang May 2012

Structural Analysis And Auditing Of Snomed Hierarchies Using Abstraction Networks, Yue Wang

Dissertations

SNOMED is one of the leading healthcare terminologies being used worldwide. Due to its sheer volume and continuing expansion, it is inevitable that errors will make their way into SNOMED. Thus, quality assurance is an important part of its maintenance cycle.

A structural approach is presented in this dissertation, aiming at developing automated techniques that can aid auditors in the discovery of terminology errors more effectively and efficiently. Large SNOMED hierarchies are partitioned, based primarily on their relationships patterns, into concept groups of more manageable sizes. Three related abstraction networks with respect to a SNOMED hierarchy, namely the area taxonomy …


Effects Of Information Importance And Distribution On Information Exchange In Team Decision Making, Babajide James Osatuyi May 2012

Effects Of Information Importance And Distribution On Information Exchange In Team Decision Making, Babajide James Osatuyi

Dissertations

Teams in organizations are strategically built with members from domains and experiences so that a wider range of information and options can be pooled. This strategic team structure is based on the assumption that when team members share the information they have, the team as a whole can access a larger pool of information than any one member acting alone, potentially enabling them to make better decisions. However, studies have shown that teams, unlike individuals, sometimes do not effectively share and use the unique information available to them, leading to poorer decisions. Research on information sharing in team decision making …


Registration And Categorization Of Camera Captured Documents, Venkata Gopal Edupuganti May 2012

Registration And Categorization Of Camera Captured Documents, Venkata Gopal Edupuganti

Dissertations

Camera captured document image analysis concerns with processing of documents captured with hand-held sensors, smart phones, or other capturing devices using advanced image processing, computer vision, pattern recognition, and machine learning techniques. As there is no constrained capturing in the real world, the captured documents suffer from illumination variation, viewpoint variation, highly variable scale/resolution, background clutter, occlusion, and non-rigid deformations e.g., folds and crumples. Document registration is a problem where the image of a template document whose layout is known is registered with a test document image. Literature in camera captured document mosaicing addressed the registration of captured documents with …


Example Based Texture Synthesis And Quantification Of Texture Quality, Chandralekha De May 2012

Example Based Texture Synthesis And Quantification Of Texture Quality, Chandralekha De

Dissertations

Textures have been used effectively to create realistic environments for virtual worlds by reproducing the surface appearances. One of the widely-used methods for creating textures is the example based texture synthesis method. In this method of generating a texture of arbitrary size, an input image from the real world is provided. This input image is used for the basis of generating large textures. Various methods based on the underlying pattern of the image have been used to create these textures; however, the problem of finding an algorithm which provides a good output is still an open research issue. Moreover, the …


A Comparative Analysis Of Machine Learning Algorithms For Genome Wide Association Studies, Neha Singh May 2012

A Comparative Analysis Of Machine Learning Algorithms For Genome Wide Association Studies, Neha Singh

Theses

Variations present in human genome play a vital role in the emergence of genetic disorders and abnormal traits. Single Nucleotide Polymorphism (SNP) is considered as the most common source of genetic variations. Genome Wide Association Studies (GWAS) probe these variations present in human population and find their association with complex genetic disorders. Now these days, recent advances in technology and drastic reduction in costs of Genome Wide Association Studies provide the opportunity to have a plethora of genomic data that delivers huge information of these variations to analyze. In fact, there is significant difference in pace of data generation and …


Heterogeneity-Aware And Energy-Aware Scheduling And Routing In Wireless Sensor Networks, Mahesh Kumar Vasanthu Somashekar May 2012

Heterogeneity-Aware And Energy-Aware Scheduling And Routing In Wireless Sensor Networks, Mahesh Kumar Vasanthu Somashekar

Theses

A Wireless Sensor Network (WSN) is a group of specialized transducers, called sensor nodes, with a communication infrastructure intended to monitor and record conditions at diverse locations. Since WSN applications are usually deployed in an open environment, the network is exposed to rough weather conditions, such as rain and snow. Another problem that WSN applications need to deal with is the energy constraints of sensor nodes. Both problems adversely affect the lifetime of WSN applications. A lot of research has been conducted to prolong the lifetime of WSN applications considering energy constraints of sensor nodes, but not much research has …


Reducing The Risk Of Software Cost Estimation, Shixian Yang May 2012

Reducing The Risk Of Software Cost Estimation, Shixian Yang

Theses

Inaccurate cost estimation is a well-known problem in software development. The common cost estimation models are point estimates that are unable to quantify uncertainties. Furthermore, it is difficult to calibrate the uncertainties in cost estimation due to the lack of information. The purpose of this thesis is to prove that probability techniques could be synthesized into COCOMO (Constructive Cost Model) to quantify uncertainties. Another aim is to find out how to get more insight on reducing the risk of cost estimation. In this thesis, some historical data is presented to show the variance in factors of COCOMO. Monte Carlo simulation …


Data Mining Of Tetraloop-Tetraloop Receptors In Rna Xml Files, Sinan Ramazanoglu May 2012

Data Mining Of Tetraloop-Tetraloop Receptors In Rna Xml Files, Sinan Ramazanoglu

Theses

RNA (Ribonucleic acid) Motifs are tertiary structures that play an important role in the folding mechanism of the RNA molecule. The overall function of a RNA Motif depends on its specific bp (base pairs) sequence that constitutes the secondary structure. Data mining is a novel method in both discovering potential tertiary structures within DNA (Deoxyribonucleic acid), RNA, and protein molecules and storing the information in databases. The RNA Motif of interest is the tetraloop-tetraloop receptor, which is composed of a highly conserved 11 nt (nucleotide) sequence and a tetraloop with the generic form of GNRA (where N = any base …


Phenotype Prediction And Feature Selection In Genome-Wide Association Studies, Andrew Roberts May 2012

Phenotype Prediction And Feature Selection In Genome-Wide Association Studies, Andrew Roberts

Theses

Genome wide association studies (GWAS) search for correlations between single nucleotide polymorphisms (SNPs) in a subject genome and an observed phenotype. GWAS can be used to generate models for predicting phenotype based on genotype, as well as aiding in identification of specific genes affecting the biological mechanism underlying the phenotype.

In this investigation, phenotype prediction models are constructed from GWAS training data and are evaluated for performance on test data. Three methods are used to rank SNPs by their correlation with the phenotype: the univariate Wald test, a multivariate, support vector machine (SVM) based technique, and a hybrid method where …


Using An Ontology To Improve The Web Search Experience, Tian Tian Jan 2012

Using An Ontology To Improve The Web Search Experience, Tian Tian

Dissertations

The search terms that a user passes to a search engine are often ambiguous, referring to homonyms. The results in these cases are a mixture of links to documents that contain different meanings of the search terms. Current search engines provide suggested query completions in a dropdown list. However, such lists are not well organized, mixing completions for different meanings. In addition, the suggested search phrases are not discriminating enough. Moreover, current search engines often return an unexpected number of results. Zero hits are naturally undesirable, while too many hits are likely to be overwhelming and of low precision.

This …


Design And Implementation Of A Cyberinfrastructure For Rna Motif Search, Prediction And Analysis, Dongrong Wen Jan 2012

Design And Implementation Of A Cyberinfrastructure For Rna Motif Search, Prediction And Analysis, Dongrong Wen

Dissertations

RNA secondary and tertiary structure motifs play important roles in cells. However, very few web servers are available for RNA motif search and prediction. In this dissertation, a cyberinfrastructure, named RNAcyber, capable of performing RNA motif search and prediction, is proposed, designed and implemented.

The first component of RNAcyber is a web-based search engine, named RmotifDB. This web-based tool integrates an RNA secondary structure comparison algorithm with the secondary structure motifs stored in the Rfam database. With a user-friendly interface, RmotifDB provides the ability to search for ncRNA structure motifs in both structural and sequential ways. The second component of …


Approximate String Matching Methods For Duplicate Detection And Clustering Tasks, Oleksandr Rudniy Jan 2012

Approximate String Matching Methods For Duplicate Detection And Clustering Tasks, Oleksandr Rudniy

Dissertations

Approximate string matching methods are utilized by a vast number of duplicate detection and clustering applications in various knowledge domains. The application area is expected to grow due to the recent significant increase in the amount of digital data and knowledge sources. Despite the large number of existing string similarity metrics, there is a need for more precise approximate string matching methods to improve the efficiency of computer-driven data processing, thus decreasing labor-intensive human involvement.

This work introduces a family of novel string similarity methods, which outperform a number of effective well-known and widely used string similarity functions. The new …


An Examination Of Coordination Among Friends And Strangers From A Coordination Theory Perspective, Christopher D. Wamble Jan 2012

An Examination Of Coordination Among Friends And Strangers From A Coordination Theory Perspective, Christopher D. Wamble

Theses

Within mobile social coordination, there is a field of study known as outeraction, the communicative processes used by people to manage future interactions. It is an important area of research because it identifies how informal interactions support complex collaboration between individuals and groups. Outeraction is primarily conducted through the interpersonal communication channels of texting, instant messaging (IM), face-to-face, and mobile phone or Skype conversations. Currently this area of research in mobile outeraction support systems is weak. It lacks a firm foundation in system building, has very few if any conceptual frameworks, and little empirical knowledge of user requirements and attitudes …