Open Access. Powered by Scholars. Published by Universities.®

Computer Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Data Storage Systems

2014

Institution
Keyword
Publication
Publication Type
File Type

Articles 1 - 30 of 312

Full-Text Articles in Computer Engineering

New Challenges For The Archiving Of Digital Writing, Heiko Zimmermann Dec 2014

New Challenges For The Archiving Of Digital Writing, Heiko Zimmermann

CLCWeb: Comparative Literature and Culture

In his article "New Challenges for the Archiving of Digital Writing" Heiko Zimmermann discusses the challenges of the preservation of digital texts. In addition to the problems already at the focus of attention of digital archivists, there are elements in digital literature which need to be taken into consideration when trying to archive them. Zimmermann analyses two works of digital literature, the collaborative writing project A Million Penguins (2006-2007) and Renée Tuner's She… (2008) and shows how the ontology of these texts is bound to elements of performance, to direct social interaction of writers and readers to the uniquely subjective …


Effects Of Training Datasets On Both The Extreme Learning Machine And Support Vector Machine For Target Audience Identification On Twitter, Siaw Ling Lo, David Cornforth, Raymond Chiong Dec 2014

Effects Of Training Datasets On Both The Extreme Learning Machine And Support Vector Machine For Target Audience Identification On Twitter, Siaw Ling Lo, David Cornforth, Raymond Chiong

Research Collection School Of Computing and Information Systems

The ability to identify or predict a target audience from the increasingly crowded social space will provide a company some competitive advantage over other companies. In this paper, we analyze various training datasets, which include Twitter contents of an account owner and its list of followers, using features generated in different ways for two machine learning approaches - the Extreme Learning Machine (ELM) and Support Vector Machine (SVM). Various configurations of the ELM and SVM have been evaluated. The results indicate that training datasets using features generated from the owner tweets achieve the best performance, relative to other feature sets. …


Badge Web Application, Ryan Green Dec 2014

Badge Web Application, Ryan Green

Computer Engineering

This project includes the imagining, design, build, and test of a web application that creates and tracks a user’s progress on completing tasks that an administrator has created for the user. The goal of this project is to have a functioning webpage that is robust and scalable to support many users and many tasks. The application will be developed for use on all modern web browsers, and will have a persistent server to access from any platform. This project was designed to be an exercise in building a modern web application, and as such is written using many different languages, …


Wingtip Dynamics Simulator, Eugene Fox, Nick Rodriguez, Steven Rieber Dec 2014

Wingtip Dynamics Simulator, Eugene Fox, Nick Rodriguez, Steven Rieber

Mechanical Engineering

Raytheon is a defense contracting company with an electronic warfare division that is developing a radio frequency signal triangulation system. Part of the focus in improving this technology is the need for accurate and real time locational knowledge of the signal receivers, which are located at the tips of aircraft wings. Due to turbulence during flight, the fluttering motion of the wings alter the distance and angle relationships of the two receivers and add noise to the received signal data, which negatively affect the triangulation estimates. To mitigate this error caused by the wing flutter, Raytheon is developing a software …


Identifying The High-Value Social Audience From Twitter Through Text-Mining Methods, Siaw Ling Lo, David Cornforth, Raymond Chiong Nov 2014

Identifying The High-Value Social Audience From Twitter Through Text-Mining Methods, Siaw Ling Lo, David Cornforth, Raymond Chiong

Research Collection School Of Computing and Information Systems

Doing business on social media has become a common practice for many companies these days. While the contents shared on Twitter and Facebook offer plenty of opportunities to uncover business insights, it remains a challenge to sift through the huge amount of social media data and identify the potential social audience who is highly likely to be interested in a particular company. In this paper, we analyze the Twitter content of an account owner and its list of followers through various text mining methods, which include fuzzy keyword matching, statistical topic modeling and machine learning approaches. We use tweets of …


Organizing Video Search Results To Adapted Semantic Hierarchies For Topic-Based Browsing, Jiajun Wang, Yu-Gang Jiang, Qiang Wang, Kuiyuan Yang, Chong-Wah Ngo Nov 2014

Organizing Video Search Results To Adapted Semantic Hierarchies For Topic-Based Browsing, Jiajun Wang, Yu-Gang Jiang, Qiang Wang, Kuiyuan Yang, Chong-Wah Ngo

Research Collection School Of Computing and Information Systems

Organizing video search results into semantically structured hierarchies can greatly improve the efficiency of browsing complex query topics. Traditional hierarchical clustering techniques are inadequate since they lack the ability to generate semantically interpretable structures. In this paper, we introduce an approach to organize video search results to an adapted semantic hierarchy. As many hot search topics such as celebrities and famous cities have Wikipedia pages where hierarchical topic structures are available, we start from the Wikipedia hierarchies and adjust the structures according to the characteristics of the returned videos from a search engine. Ordinary clustering based on textual information of …


Vireo @ Trecvid 2014: Instance Search And Semantic Indexing, Wei Zhang, Hao Zhang, Ting Yao, Yijie Lu, Jingjing Chen, Chong-Wah Ngo Nov 2014

Vireo @ Trecvid 2014: Instance Search And Semantic Indexing, Wei Zhang, Hao Zhang, Ting Yao, Yijie Lu, Jingjing Chen, Chong-Wah Ngo

Research Collection School Of Computing and Information Systems

This paper summarizes the following two tasks participated by VIREO group: instance search and semantic indexing. We will present our approaches and analyze the results obtained in TRECVID 2014 benchmark evaluation


Click-Through-Based Subspace Learning For Image Search, Yingwei Pan, Ting Yao, Xinmei Tian, Houqiang Li, Chong-Wah Ngo Nov 2014

Click-Through-Based Subspace Learning For Image Search, Yingwei Pan, Ting Yao, Xinmei Tian, Houqiang Li, Chong-Wah Ngo

Research Collection School Of Computing and Information Systems

One of the fundamental problems in image search is to rank image documents according to a given textual query. We address two limitations of the existing image search engines in this paper. First, there is no straightforward way of comparing textual keywords with visual image content. Image search engines therefore highly depend on the surrounding texts, which are often noisy or too few to accurately describe the image content. Second, ranking functions are trained on query-image pairs labeled by human labelers, making the annotation intellectually expensive and thus cannot be scaled up. We demonstrate that the above two fundamental challenges …


Dc: Small: Energy-Aware Coordinated Caching In Cluster-Based Storage Systems, Yifeng Zhu Oct 2014

Dc: Small: Energy-Aware Coordinated Caching In Cluster-Based Storage Systems, Yifeng Zhu

University of Maine Office of Research Administration: Grant Reports

The main goal of this project is to improve the performance and energy efficiency of I/O (Input/Output) operations of large-scale cluster computing platforms.

The major activities include:

1) characterize the memory access workloads;
2) investigate the new and emerging new storage and memory devices, such as SSD and PCM, on I/O performance.
(3) study energy-efficient buffer and cache replacement algorithms,
(4) leveraging SSD as a new caching device to improve the energy efficiency and performance of I/O performance


Name-Face Association In Web Videos: A Large-Scale Dataset, Baselines, And Open Issues, Zhi-Neng Chen, Chong-Wah Ngo, Wei Zhang, Juan Cao, Yu-Gang Jiang Sep 2014

Name-Face Association In Web Videos: A Large-Scale Dataset, Baselines, And Open Issues, Zhi-Neng Chen, Chong-Wah Ngo, Wei Zhang, Juan Cao, Yu-Gang Jiang

Research Collection School Of Computing and Information Systems

Associating faces appearing in Web videos with names presented in the surrounding context is an important task in many applications. However, the problem is not well investigated particularly under large-scale realistic scenario, mainly due to the scarcity of dataset constructed in such circumstance. In this paper, we introduce a Web video dataset of celebrities, named WebV-Cele, for name-face association. The dataset consists of 75 073 Internet videos of over 4 000 hours, covering 2 427 celebrities and 649 001 faces. This is, to our knowledge, the most comprehensive dataset for this problem. We describe the details of dataset construction, discuss …


Designing An Articulation-Agreement Database For The College Of Science And Engineering And Technology Advising Center, Stephanie Fasen, Susan Hendley, Tim Pham, Danish Zaman Aug 2014

Designing An Articulation-Agreement Database For The College Of Science And Engineering And Technology Advising Center, Stephanie Fasen, Susan Hendley, Tim Pham, Danish Zaman

Journal of Undergraduate Research at Minnesota State University, Mankato

During their academic careers, some college students transfer to different universities. To allow students to transfer seamlessly to other colleges, advisors at Minnesota universities create articulation agreements that list the classes that transfer between two universities. To use these documents, students and advisors must search through binders to find the correct articulation agreement and then manually review it. This is a time-consuming process for both students and advisors. To make this information more accessible, we created a web-based database that instantly produces a list of equivalent classes for majors offered at Minnesota St ate University, Mankato (MSU) and other Minnesota …


Verification Of Costless Merge Pairing Heaps, Joshua Vander Hook Aug 2014

Verification Of Costless Merge Pairing Heaps, Joshua Vander Hook

Journal of Undergraduate Research at Minnesota State University, Mankato

Most algorithms’ performance is limited by the data structures they use. Internal algorithms then decide the performance of the data structure. This cycle continues until fundamental results, verified by analysis and experiment, prevent further improvement. In this paper I examine one specific example of this. The focus of this work is primarily on a new variant of the pairing heap. I will review the new implementation, compare its theoretical performance, and discuss my original contribution: the first preliminary data on its experimental performance. It is instructive to provide some background information, followed by a formal definition of heaps in 1.1. …


Diseño De Un Sistema De Adquisición Y Visualización De Los Resultados, Con Base En El Diagnóstico De Los Requisitos Técnicos De La Norma Iso Iec 17025 2005, En La Medición De Las Variables Humedad Y Temperatura En El Ideam, Victoria Andrea Leal Saavedra, Jonathan Jesús Cuéllar Guzmán Aug 2014

Diseño De Un Sistema De Adquisición Y Visualización De Los Resultados, Con Base En El Diagnóstico De Los Requisitos Técnicos De La Norma Iso Iec 17025 2005, En La Medición De Las Variables Humedad Y Temperatura En El Ideam, Victoria Andrea Leal Saavedra, Jonathan Jesús Cuéllar Guzmán

Ingeniería en Automatización

El proceso de calibración de un instrumento se basa principalmente en una serie de procedimientos establecidos que parten desde la reparación mecánica y estética y finalizan dejando a punto dicho instrumento. Teniendo en cuenta lo anterior, cabe aclarar que los laboratorios encargados de realizar estos procesos, son laboratorios que cuentan con una calificación de alta calidad a nivel internacional, esto por medio de una certificación de cumplimiento de la norma ISO/IEC 17025:2005 en el caso de los laboratorios de muestreo y calibración. Para cumplir con este tipo de normas se deben seguir una serie de indicaciones que están propuestas en …


Foss Big Data Storage Solution, Gary L. Jaffe Aug 2014

Foss Big Data Storage Solution, Gary L. Jaffe

STAR Program Research Presentations

Utilizing the AERO Institute as an IT test bed or “sandbox”, a small-agile development team will design, build, and test a data management storage system to support post processing of archived and in-flight data collected with the Piccolo flight control system and Compact Fiber Optic Sensing System (C-FOSS). Both systems are integrated on the APV3 aircraft, a small remote-operated vehicle. Due to the amount of data collected from C-FOSS, a system will be designed to sort and organize large data sets. An open-source database will be explored as a viable solution to manage large data loads and provide multi-cluster system …


Foss Big Data Storage Solution, Nurdeen Salami Aug 2014

Foss Big Data Storage Solution, Nurdeen Salami

STAR Program Research Presentations

NASA projects require a reliable approach to store large volumes of data. Accordingly, it is crucial to adopt a lightweight, reliable, and scalable database. Current NASA databases bear costly license fees with undesirable speed and flexibility. The purpose of utilizing the AERO Institute as an IT test bed, or “Sandbox,” is to design, build, test, and implement software solutions prior to transfer to NASA projects. Cassandra coupled with the Astyanax API is a viable solution for storing big data. Store a minimum of 2GB of C-FOSS data in multiple file formats (.csv, .log, .xml, and .jpg). Use benchmark tests to …


Ultimate Codes: Near-Optimal Mds Array Codes For Raid-6, Zhijie Huang, Hong Jiang, Chong Wang, Ke Zhou, Yuhong Zhao Jul 2014

Ultimate Codes: Near-Optimal Mds Array Codes For Raid-6, Zhijie Huang, Hong Jiang, Chong Wang, Ke Zhou, Yuhong Zhao

CSE Technical Reports

As modern storage systems have grown in size and complexity, RAID-6 is poised to replace RAID-5 as the dominant form of RAID architectures due to its ability to protect against double disk failures. Many excellent erasure codes specially designed for RAID-6 have emerged in recent years. However, all of them have limitations. In this paper, we present a class of near perfect erasure codes for RAID-6, called the Ultimate codes. These codes encode, update and decode either optimally or nearly optimally, regardless of what the code length is. This implies that utilizing these codes we can build highly efficient and …


S-Code: Lowest Density Mds Array Codes For Raid-6, Zhijie Huang, Hong Jiang, Ke Zhou, Yuhong Zhao, Chong Wang Jul 2014

S-Code: Lowest Density Mds Array Codes For Raid-6, Zhijie Huang, Hong Jiang, Ke Zhou, Yuhong Zhao, Chong Wang

CSE Technical Reports

RAID, a storage architecture designed to exploit I/O parallelism and provide data reliability, has been deployed widely in computing systems as a storage building block. In large scale storage systems, in particular, RAID-6 is gradually replacing RAID-5 as the dominant form of disk arrays due to its capability of tolerating concurrent failures of any two disks. MDS (maximum distance separable) array codes are the most popular erasure codes that can be used for implementing RAID-6, since they enable optimal storage efficiency and efficient encoding and decoding algorithms. In this paper, we propose a new class of MDS array codes called …


An Examination Of The Factors Determining Successful Implementation Of An Electronic Medical Record (Emr) System In A Regional Hospital, Mehrdad Motamed Jul 2014

An Examination Of The Factors Determining Successful Implementation Of An Electronic Medical Record (Emr) System In A Regional Hospital, Mehrdad Motamed

Morehead State Theses and Dissertations

A Thesis Presented to the Faculty of the College of Business and Public Affairs Morehead State University in Partial Fulfillment of the Requirements for the Degree Master of Science by Mehrdad Motamed in July of 2014.


Click-Through-Based Cross-View Learning For Image Search, Yingwei Pan, Ting Yao, Tao Mei, Houqiang Li, Chong-Wah Ngo, Yong Rui Jul 2014

Click-Through-Based Cross-View Learning For Image Search, Yingwei Pan, Ting Yao, Tao Mei, Houqiang Li, Chong-Wah Ngo, Yong Rui

Research Collection School Of Computing and Information Systems

One of the fundamental problems in image search is to rank image documents according to a given textual query. Existing search engines highly depend on surrounding texts for ranking images, or leverage the query-image pairs annotated by human labelers to train a series of ranking functions. However, there are two major limitations: 1) the surrounding texts are often noisy or too few to accurately describe the image content, and 2) the human annotations are resourcefully expensive and thus cannot be scaled up. We demonstrate in this paper that the above two fundamental challenges can be mitigated by jointly exploring the …


A Best-Worst Scaling Model Of Climate Change Abatement By Australian Farmers, Marit E. Kragt, Nikki Dumbrell, Fiona Gibson Jun 2014

A Best-Worst Scaling Model Of Climate Change Abatement By Australian Farmers, Marit E. Kragt, Nikki Dumbrell, Fiona Gibson

International Congress on Environmental Modelling and Software

Storing carbon from the atmosphere in terrestrial sinks has been proposed as an important way to mitigate climate change and is a major focus in Australia's climate change policies. Mitigation by changing agricultural practices is seen as a promising way to achieve significant reductions in C02 concentrations. Several policies therefore aim to stimulate farmers to adopt so-called 'carbon farming' practices. However, there is little information about farmers' ability and willingness to adopt carbon farming. We present a best-worst scaling model to analyse farmers' decisions about adopting climate change mitigating practices. Best-worst scaling data was collected through a survey amongst mixed …


Assessing The Transition To A Low-Carbon Economy Using Actor-Based System-Dynamic Models, Dmitry V. Kovalevsky, Klaus Hasselmann Jun 2014

Assessing The Transition To A Low-Carbon Economy Using Actor-Based System-Dynamic Models, Dmitry V. Kovalevsky, Klaus Hasselmann

International Congress on Environmental Modelling and Software

For a comprehensive analysis of climate mitigation policies, Integrated Assessment models (IAMs) of the coupled climate-socioeconomic system are needed. However, while there is general agreement on the physics of the climate system, the dynamics of the socioeconomic system is still the subject of considerable controversy. This has become particularly apparent since the recent global financial crisis. To explore the dynamics of the socio-economic system, a family of socio-economic models is proposed that incorporates the various alternative assumptions regarding the behaviour of the different economic actors that govern the evolution of the socio-economic system. The model family needs to be developed …


Integration Of Models For Low Carbon Economy, Getachew F. Belete, Alexey Voinov Jun 2014

Integration Of Models For Low Carbon Economy, Getachew F. Belete, Alexey Voinov

International Congress on Environmental Modelling and Software

Designing the transition to low carbon economy is a very complex task that touches upon a wide variety of climate-energy-economic systems. We need to explore the various possible climate mitigation scenarios at different temporal and spatial scales. However, due to the diversity of the involved disciplines it is difficult to find one complete and unified modeling approach that works equally well in all those different domains. As a result we have to select 'appropriate' models, which represent only specific aspects of the scenarios and assemble them 'coherently'. In this research we have identified some challenges in integrating multidisciplinary models; and …


Enhancing The Policy Relevance Of Scenarios Through A Dynamic Analytical Approach, Céline Guivarch, Vanessa Schweizer, Julia Rozenberg Jun 2014

Enhancing The Policy Relevance Of Scenarios Through A Dynamic Analytical Approach, Céline Guivarch, Vanessa Schweizer, Julia Rozenberg

International Congress on Environmental Modelling and Software

We present a new dynamic analytical approach for studying scenarios produced by an integrated assessment {IA) model. Our approach involves the analysis of a large number of scenarios, which can better address three principal shortcomings of how uncertainty is traditionally handled in IA scenario studies. The shortcomings are all a result of the prevailing practice of investigating a small number of scenarios and include (1) the ad hoc nature of exploring vast socioeconomic uncertainties with only a small number of scenarios; (2) the conventional representation of alternative scenario typologies as "parallel universes• or "diverging universes•, which provide little insight on …


Global Sensitivity Analysis Of Key Parameters In A Process-Based Sugarcane Growth Model - A Bayesian Approach, Justin Sexton, Yvette Everingham Jun 2014

Global Sensitivity Analysis Of Key Parameters In A Process-Based Sugarcane Growth Model - A Bayesian Approach, Justin Sexton, Yvette Everingham

International Congress on Environmental Modelling and Software

While several statistical methods are available to analyse model sensitivity, their application to complex process-based models is often impractical due to the large number of simulation runs required. A Bayesian approach to global sensitivity analysis can greatly reduce the number of simulation runs required by building an emulator of the model which is less computationally demanding. A Gaussian Emulation Machine (GEM) was used to efficiently assess the sensitivity of key agronomic outputs from the APSIM-Sugar crop model to influential input parameters. The sensitivity of simulated biomass and sucrose at harvest was assessed on 14 parameters representing varietal differences and growth …


Hypothesis Testing For Management: Evolving And Answering Closed Questions Using Multiobjective Visualization, Joseph Kasprzyk, Joseph Guillaume, Joshua Kollat, Chris Danilo Jun 2014

Hypothesis Testing For Management: Evolving And Answering Closed Questions Using Multiobjective Visualization, Joseph Kasprzyk, Joseph Guillaume, Joshua Kollat, Chris Danilo

International Congress on Environmental Modelling and Software

In order to use models to understand deeply uncertain future conditions, managers must be able to pose and test hypotheses about their management problems. In Iterative Closed Question Methodology (ICQM), a series of closed questions are used to structure thinking about hypotheses while looking beyond a problem's existing modeling representation. Our research is exploring how ICQM can contribute to a framework called Many Objective Robust Decision Making (MORDM), which uses multiobjective optimization and ensembles of uncertain future states of the world to create and evaluate robust solutions for environmental management. A visualization software tool; AeroVis, has greatly aided implementation of …


Ontology Mapping In Semantic Time Series Processing In Climate Change Prediction, Bojan Božić, Jan Peters-Anders, Gerald Schimak Jun 2014

Ontology Mapping In Semantic Time Series Processing In Climate Change Prediction, Bojan Božić, Jan Peters-Anders, Gerald Schimak

International Congress on Environmental Modelling and Software

In today's time series processing there is more and more a need for addressing diverse user groups interested in a specific domain with appropriate user tailored time series data. The complexity of time series (e.g. involved data from different data sources and/or domains, visualization and representation, etc.) is growing rapidly. As a consequence, it means that users need to find a path through the jungle of time series data. After we have presented our concepts for semantic time series filtering and enrichment of time series with meta-information and annotations (Božić et al., 2012), we are now going to present a …


Metadata Extraction Using Semantic And Natural Language Processing Techniques, Rob Knapen, Thomas Hüsing, Klaus Jacob, Yke Van Randen, Stefan Reis, Onno Roosenschoon, Sander Janssen Jun 2014

Metadata Extraction Using Semantic And Natural Language Processing Techniques, Rob Knapen, Thomas Hüsing, Klaus Jacob, Yke Van Randen, Stefan Reis, Onno Roosenschoon, Sander Janssen

International Congress on Environmental Modelling and Software

The World Wide Web and related technologies are playing an increasing role in the field of Integrated Environmental Modelling (IEM). Model integration software frameworks are more and more becoming web-enabled. The technologies and standards of the Web are used to access and run simulation models remotely (known as the Web of models) and are considered for enabling interoperability across model integration frameworks. Furthermore there is a growing number of local and global initiatives to provide open access to environmental data (Web of data) that can potentially be used as input for the scientific models. The availability of descriptive information of …


Improved Implicit Stochastic Optimization Technique For Multireservoir Water Systems Under Drought Conditions, Andrea Sulis Jun 2014

Improved Implicit Stochastic Optimization Technique For Multireservoir Water Systems Under Drought Conditions, Andrea Sulis

International Congress on Environmental Modelling and Software

Drought is a creeping phenomenon, making its onset and end difficult to determine. Damages from droughts can exceed those resulting from any other natural hazard, although it is difficult to assign a monetary value to them. In the Mediterranean area a severe drought period occurred over the years 2000-2002 and economic losses from that drought exceeded 250 million euros in Sardinia (Italy) (source: ENAS Regional Water Authority). Currently, technological developments and environmental modelling tools are improving our ability to more effectively manage water supply systems. Models can provide decision makers with better and more timely data and information. In this …


Modelling Biofilm Based Technologies With Activated Sludge Unit Processes: A Short Cut To Performance Simulation?, Noella Jones, Maebh Grace, Eoghan Clifford Jun 2014

Modelling Biofilm Based Technologies With Activated Sludge Unit Processes: A Short Cut To Performance Simulation?, Noella Jones, Maebh Grace, Eoghan Clifford

International Congress on Environmental Modelling and Software

Biofilm-based passive aeration systems (PAS) have attracted recent attention as alternative, energy efficient and low maintenance technologies in the wastewater sector. However the modelling of biofilm-based PAS offers unique challenges for modellers, particularly where new technologies are not easily simulated using existing commercial modelling software. However, if the modeller is concerned only with simulating "macro" plant performance (e.g. key effluent concentrations and cycle analysis) it may be possible to efficiently model these technologies using "surrogate" unit process systems (e.g. using an activated sludge process to model a biofilm process). The pumped flow biofilm reactor (PFBR); a batch biofilm technology, is …


Development Of A Decision Support Tool To Allocate Irrigation Water On Competitive Basis: Application To Kathiraveli Village, Sri Lanka, Tom Le Cerf, Muhammed A. Bhuiyan Jun 2014

Development Of A Decision Support Tool To Allocate Irrigation Water On Competitive Basis: Application To Kathiraveli Village, Sri Lanka, Tom Le Cerf, Muhammed A. Bhuiyan

International Congress on Environmental Modelling and Software

This paper focuses specifically on the irrigation field and water supply tanks for ways to improve efficiency of water use and increase the resilience to climatic variability. The purpose of this paper is to develop an excel-based water balance model that will allow the maximum cropping area be planted in the upcoming agricultural season. The model consists of three modules: a crop water requirement calculator that allows the water requirements of specific crops to be compared, a water tank balance model, and a model which simulates the storage in the permanent wetland attached to the irrigation tank. The hydrological computation …