Open Access. Powered by Scholars. Published by Universities.®

Data Storage Systems Commons

Open Access. Powered by Scholars. Published by Universities.®

1,465 Full-Text Articles 3,568 Authors 520,855 Downloads 119 Institutions

All Articles in Data Storage Systems

Faceted Search

1,465 full-text articles. Page 47 of 69.

Sheep Updates 2015 - Merredin, Bruce Mullan, Kate Pritchett, Kimbal Curtis, Chris Wilcox, Lynne Bradshaw, Geoff Lindon, Katherine Davies, Joe Young, Stephen Lee, Dawson Bradford, Khama Kelman, Lucy Anderton, Jaq Pearson, Jackie Jarvis, Ben Patrick 2015 Department of Agriculture and Food, Western Australia

Sheep Updates 2015 - Merredin, Bruce Mullan, Kate Pritchett, Kimbal Curtis, Chris Wilcox, Lynne Bradshaw, Geoff Lindon, Katherine Davies, Joe Young, Stephen Lee, Dawson Bradford, Khama Kelman, Lucy Anderton, Jaq Pearson, Jackie Jarvis, Ben Patrick

Sheep Updates

This session covers fourteen papers from different authors:

1. The Sheep Industry Business Innovation project, Bruce Mullan, Sheep Industry Development Director, Department of Agriculture and Food, Western Australia

2. Western Australian sheep stocktake, Kate Pritchett and Kimbal Curtis, Research Officers, Department of Agriculture and Food, Western Australia

3. Wool demand and supply - short term volatility, long term opportunities, Chris Wilcox, Principal of Poimena Analysis

4. Myths, Facts and the role of animal welfare in farming, Lynne Bradshaw, president, RSPCA WA

5. Latest research and development on breech strike prevention, Geoff Lindon, Manager Productivity and Animal Welfare, AWI

6. …


Evolution And Usage Of The Portal Data Archive: 10-Year Retrospective, Kristin A. Tufte, Robert Bertini, Morgan Harvey 2015 Portland State University

Evolution And Usage Of The Portal Data Archive: 10-Year Retrospective, Kristin A. Tufte, Robert Bertini, Morgan Harvey

Civil and Environmental Engineering Faculty Publications and Presentations

The Portal transportation data archive (http://portal.its.pdx.edu/) was begun in June 2004 in collaboration with the Oregon Department of Transportation, with a single data source: freeway loop detector data. In 10 years, Portal has grown to contain approximately 3 TB of transportation-related data from a wide variety of systems and sources, including freeway data, arterial signal data, travel times from Bluetooth detection systems, transit data, and bicycle count data. Over its 10-year existence, Portal has expanded both in the type of data that it receives and in the geographic regions from which it gets data. This paper discusses the …


Hadoop Based Data Intensive Computation On Iaas Cloud Platforms, Sruthi Vijayakumar 2015 University of North Florida

Hadoop Based Data Intensive Computation On Iaas Cloud Platforms, Sruthi Vijayakumar

UNF Graduate Theses and Dissertations

Cloud computing is a relatively new form of computing which uses virtualized resources. It is dynamically scalable and is often provided as pay for use service over the Internet or Intranet or both. With increasing demand for data storage in the cloud, the study of data-intensive applications is becoming a primary focus. Data intensive applications are those which involve high CPU usage, processing large volumes of data typically in size of hundreds of gigabytes, terabytes or petabytes. The research in this thesis is focused on the Amazon’s Elastic Cloud Compute (EC2) and Amazon Elastic Map Reduce (EMR) using HiBench Hadoop …


Hair-Oriented Data Model For Spatio-Temporal Data Mining, Abbas Madraky, Zulaiha Ali Othman, Razak Hamdan 2014 Universiti Kebangsaan Malaysia

Hair-Oriented Data Model For Spatio-Temporal Data Mining, Abbas Madraky, Zulaiha Ali Othman, Razak Hamdan

Abbas Madraky

Spatio-temporal data are complex in terms of number of attributes for spatial and temporal values, and the data are changing towards time. Traditional method to mining the spatio-temporal data is the fact that the data is stored in data warehouse in un-normalization form as union of spatial and temporal data know as tabular data warehouse. A Hair-Oriented Data Model (HODM) has been proved as a suitable data model for spatio-temporal data. It has reduced the file size and decreased query execution time. The spatio-temporal data stored using the HODM known as Hair-Oriented Data warehouse. However, this paper aims to presents …


New Challenges For The Archiving Of Digital Writing, Heiko Zimmermann 2014 University of Trier

New Challenges For The Archiving Of Digital Writing, Heiko Zimmermann

CLCWeb: Comparative Literature and Culture

In his article "New Challenges for the Archiving of Digital Writing" Heiko Zimmermann discusses the challenges of the preservation of digital texts. In addition to the problems already at the focus of attention of digital archivists, there are elements in digital literature which need to be taken into consideration when trying to archive them. Zimmermann analyses two works of digital literature, the collaborative writing project A Million Penguins (2006-2007) and Renée Tuner's She… (2008) and shows how the ontology of these texts is bound to elements of performance, to direct social interaction of writers and readers to the uniquely subjective …


Effects Of Training Datasets On Both The Extreme Learning Machine And Support Vector Machine For Target Audience Identification On Twitter, Siaw Ling LO, David CORNFORTH, Raymond CHIONG 2014 Singapore Management University

Effects Of Training Datasets On Both The Extreme Learning Machine And Support Vector Machine For Target Audience Identification On Twitter, Siaw Ling Lo, David Cornforth, Raymond Chiong

Research Collection School Of Computing and Information Systems

The ability to identify or predict a target audience from the increasingly crowded social space will provide a company some competitive advantage over other companies. In this paper, we analyze various training datasets, which include Twitter contents of an account owner and its list of followers, using features generated in different ways for two machine learning approaches - the Extreme Learning Machine (ELM) and Support Vector Machine (SVM). Various configurations of the ELM and SVM have been evaluated. The results indicate that training datasets using features generated from the owner tweets achieve the best performance, relative to other feature sets. …


Badge Web Application, Ryan Green 2014 California Polytechnic State University - San Luis Obispo

Badge Web Application, Ryan Green

Computer Engineering

This project includes the imagining, design, build, and test of a web application that creates and tracks a user’s progress on completing tasks that an administrator has created for the user. The goal of this project is to have a functioning webpage that is robust and scalable to support many users and many tasks. The application will be developed for use on all modern web browsers, and will have a persistent server to access from any platform. This project was designed to be an exercise in building a modern web application, and as such is written using many different languages, …


Wingtip Dynamics Simulator, Eugene Fox, Nick Rodriguez, Steven Rieber 2014 California Polytechnic State University - San Luis Obispo

Wingtip Dynamics Simulator, Eugene Fox, Nick Rodriguez, Steven Rieber

Mechanical Engineering

Raytheon is a defense contracting company with an electronic warfare division that is developing a radio frequency signal triangulation system. Part of the focus in improving this technology is the need for accurate and real time locational knowledge of the signal receivers, which are located at the tips of aircraft wings. Due to turbulence during flight, the fluttering motion of the wings alter the distance and angle relationships of the two receivers and add noise to the received signal data, which negatively affect the triangulation estimates. To mitigate this error caused by the wing flutter, Raytheon is developing a software …


Identifying The High-Value Social Audience From Twitter Through Text-Mining Methods, Siaw Ling LO, David CORNFORTH, Raymond CHIONG 2014 Singapore Management University

Identifying The High-Value Social Audience From Twitter Through Text-Mining Methods, Siaw Ling Lo, David Cornforth, Raymond Chiong

Research Collection School Of Computing and Information Systems

Doing business on social media has become a common practice for many companies these days. While the contents shared on Twitter and Facebook offer plenty of opportunities to uncover business insights, it remains a challenge to sift through the huge amount of social media data and identify the potential social audience who is highly likely to be interested in a particular company. In this paper, we analyze the Twitter content of an account owner and its list of followers through various text mining methods, which include fuzzy keyword matching, statistical topic modeling and machine learning approaches. We use tweets of …


Organizing Video Search Results To Adapted Semantic Hierarchies For Topic-Based Browsing, Jiajun WANG, Yu-Gang JIANG, Qiang WANG, Kuiyuan YANG, Chong-wah NGO 2014 Singapore Management University

Organizing Video Search Results To Adapted Semantic Hierarchies For Topic-Based Browsing, Jiajun Wang, Yu-Gang Jiang, Qiang Wang, Kuiyuan Yang, Chong-Wah Ngo

Research Collection School Of Computing and Information Systems

Organizing video search results into semantically structured hierarchies can greatly improve the efficiency of browsing complex query topics. Traditional hierarchical clustering techniques are inadequate since they lack the ability to generate semantically interpretable structures. In this paper, we introduce an approach to organize video search results to an adapted semantic hierarchy. As many hot search topics such as celebrities and famous cities have Wikipedia pages where hierarchical topic structures are available, we start from the Wikipedia hierarchies and adjust the structures according to the characteristics of the returned videos from a search engine. Ordinary clustering based on textual information of …


Vireo @ Trecvid 2014: Instance Search And Semantic Indexing, Wei ZHANG, Hao ZHANG, Ting YAO, Yijie LU, Jingjing CHEN, Chong-wah NGO 2014 Singapore Management University

Vireo @ Trecvid 2014: Instance Search And Semantic Indexing, Wei Zhang, Hao Zhang, Ting Yao, Yijie Lu, Jingjing Chen, Chong-Wah Ngo

Research Collection School Of Computing and Information Systems

This paper summarizes the following two tasks participated by VIREO group: instance search and semantic indexing. We will present our approaches and analyze the results obtained in TRECVID 2014 benchmark evaluation


Click-Through-Based Subspace Learning For Image Search, Yingwei PAN, Ting YAO, Xinmei TIAN, Houqiang LI, Chong-wah NGO 2014 Singapore Management University

Click-Through-Based Subspace Learning For Image Search, Yingwei Pan, Ting Yao, Xinmei Tian, Houqiang Li, Chong-Wah Ngo

Research Collection School Of Computing and Information Systems

One of the fundamental problems in image search is to rank image documents according to a given textual query. We address two limitations of the existing image search engines in this paper. First, there is no straightforward way of comparing textual keywords with visual image content. Image search engines therefore highly depend on the surrounding texts, which are often noisy or too few to accurately describe the image content. Second, ranking functions are trained on query-image pairs labeled by human labelers, making the annotation intellectually expensive and thus cannot be scaled up. We demonstrate that the above two fundamental challenges …


Dc: Small: Energy-Aware Coordinated Caching In Cluster-Based Storage Systems, Yifeng Zhu 2014 Principal Investigator; University of Maine, Orono

Dc: Small: Energy-Aware Coordinated Caching In Cluster-Based Storage Systems, Yifeng Zhu

University of Maine Office of Research Administration: Grant Reports

The main goal of this project is to improve the performance and energy efficiency of I/O (Input/Output) operations of large-scale cluster computing platforms.

The major activities include:

1) characterize the memory access workloads;
2) investigate the new and emerging new storage and memory devices, such as SSD and PCM, on I/O performance.
(3) study energy-efficient buffer and cache replacement algorithms,
(4) leveraging SSD as a new caching device to improve the energy efficiency and performance of I/O performance


Name-Face Association In Web Videos: A Large-Scale Dataset, Baselines, And Open Issues, Zhi-Neng CHEN, Chong-wah NGO, Wei ZHANG, Juan CAO, Yu-Gang JIANG 2014 Singapore Management University

Name-Face Association In Web Videos: A Large-Scale Dataset, Baselines, And Open Issues, Zhi-Neng Chen, Chong-Wah Ngo, Wei Zhang, Juan Cao, Yu-Gang Jiang

Research Collection School Of Computing and Information Systems

Associating faces appearing in Web videos with names presented in the surrounding context is an important task in many applications. However, the problem is not well investigated particularly under large-scale realistic scenario, mainly due to the scarcity of dataset constructed in such circumstance. In this paper, we introduce a Web video dataset of celebrities, named WebV-Cele, for name-face association. The dataset consists of 75 073 Internet videos of over 4 000 hours, covering 2 427 celebrities and 649 001 faces. This is, to our knowledge, the most comprehensive dataset for this problem. We describe the details of dataset construction, discuss …


Designing An Articulation-Agreement Database For The College Of Science And Engineering And Technology Advising Center, Stephanie Fasen, Susan Hendley, Tim Pham, Danish Zaman 2014 Minnesota State University, Mankato

Designing An Articulation-Agreement Database For The College Of Science And Engineering And Technology Advising Center, Stephanie Fasen, Susan Hendley, Tim Pham, Danish Zaman

Journal of Undergraduate Research at Minnesota State University, Mankato

During their academic careers, some college students transfer to different universities. To allow students to transfer seamlessly to other colleges, advisors at Minnesota universities create articulation agreements that list the classes that transfer between two universities. To use these documents, students and advisors must search through binders to find the correct articulation agreement and then manually review it. This is a time-consuming process for both students and advisors. To make this information more accessible, we created a web-based database that instantly produces a list of equivalent classes for majors offered at Minnesota St ate University, Mankato (MSU) and other Minnesota …


Verification Of Costless Merge Pairing Heaps, Joshua Vander Hook 2014 MInnesota State University, Mankato

Verification Of Costless Merge Pairing Heaps, Joshua Vander Hook

Journal of Undergraduate Research at Minnesota State University, Mankato

Most algorithms’ performance is limited by the data structures they use. Internal algorithms then decide the performance of the data structure. This cycle continues until fundamental results, verified by analysis and experiment, prevent further improvement. In this paper I examine one specific example of this. The focus of this work is primarily on a new variant of the pairing heap. I will review the new implementation, compare its theoretical performance, and discuss my original contribution: the first preliminary data on its experimental performance. It is instructive to provide some background information, followed by a formal definition of heaps in 1.1. …


Diseño De Un Sistema De Adquisición Y Visualización De Los Resultados, Con Base En El Diagnóstico De Los Requisitos Técnicos De La Norma Iso Iec 17025 2005, En La Medición De Las Variables Humedad Y Temperatura En El Ideam, Victoria Andrea Leal Saavedra, Jonathan Jesús Cuéllar Guzmán 2014 Universidad de La Salle, Bogotá

Diseño De Un Sistema De Adquisición Y Visualización De Los Resultados, Con Base En El Diagnóstico De Los Requisitos Técnicos De La Norma Iso Iec 17025 2005, En La Medición De Las Variables Humedad Y Temperatura En El Ideam, Victoria Andrea Leal Saavedra, Jonathan Jesús Cuéllar Guzmán

Ingeniería en Automatización

El proceso de calibración de un instrumento se basa principalmente en una serie de procedimientos establecidos que parten desde la reparación mecánica y estética y finalizan dejando a punto dicho instrumento. Teniendo en cuenta lo anterior, cabe aclarar que los laboratorios encargados de realizar estos procesos, son laboratorios que cuentan con una calificación de alta calidad a nivel internacional, esto por medio de una certificación de cumplimiento de la norma ISO/IEC 17025:2005 en el caso de los laboratorios de muestreo y calibración. Para cumplir con este tipo de normas se deben seguir una serie de indicaciones que están propuestas en …


Foss Big Data Storage Solution, Gary L. Jaffe 2014 CSUS

Foss Big Data Storage Solution, Gary L. Jaffe

STAR Program Research Presentations

Utilizing the AERO Institute as an IT test bed or “sandbox”, a small-agile development team will design, build, and test a data management storage system to support post processing of archived and in-flight data collected with the Piccolo flight control system and Compact Fiber Optic Sensing System (C-FOSS). Both systems are integrated on the APV3 aircraft, a small remote-operated vehicle. Due to the amount of data collected from C-FOSS, a system will be designed to sort and organize large data sets. An open-source database will be explored as a viable solution to manage large data loads and provide multi-cluster system …


Foss Big Data Storage Solution, nurdeen salami 2014 NASA AFRC

Foss Big Data Storage Solution, Nurdeen Salami

STAR Program Research Presentations

NASA projects require a reliable approach to store large volumes of data. Accordingly, it is crucial to adopt a lightweight, reliable, and scalable database. Current NASA databases bear costly license fees with undesirable speed and flexibility. The purpose of utilizing the AERO Institute as an IT test bed, or “Sandbox,” is to design, build, test, and implement software solutions prior to transfer to NASA projects. Cassandra coupled with the Astyanax API is a viable solution for storing big data. Store a minimum of 2GB of C-FOSS data in multiple file formats (.csv, .log, .xml, and .jpg). Use benchmark tests to …


Ultimate Codes: Near-Optimal Mds Array Codes For Raid-6, Zhijie Huang, Hong Jiang, Chong Wang, Ke Zhou, Yuhong Zhao 2014 Huazhong University of Science and Technology

Ultimate Codes: Near-Optimal Mds Array Codes For Raid-6, Zhijie Huang, Hong Jiang, Chong Wang, Ke Zhou, Yuhong Zhao

CSE Technical Reports

As modern storage systems have grown in size and complexity, RAID-6 is poised to replace RAID-5 as the dominant form of RAID architectures due to its ability to protect against double disk failures. Many excellent erasure codes specially designed for RAID-6 have emerged in recent years. However, all of them have limitations. In this paper, we present a class of near perfect erasure codes for RAID-6, called the Ultimate codes. These codes encode, update and decode either optimally or nearly optimally, regardless of what the code length is. This implies that utilizing these codes we can build highly efficient and …


Digital Commons powered by bepress