Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics

Data analysis

Institution
Publication Year
Publication
Publication Type
File Type

Articles 1 - 30 of 33

Full-Text Articles in Engineering

Supporting Text And Data Analysis Across Campus From The Academic Library, Amy Kirchhoff, Hejin Shin Phd Apr 2024

Supporting Text And Data Analysis Across Campus From The Academic Library, Amy Kirchhoff, Hejin Shin Phd

Digital Initiatives Symposium

The ability to comprehend and communicate with text-based data is essential to future success in academics and employment, as evidenced in a recent survey from Bloomberg Research Services which shows that nearly 97% of survey respondents now use data analytics in their companies and 58% consider data and text mining a business analytics tool (https://www.sas.com/content/dam/SAS/bp_de/doc/studie/ba-st-the-current-state-of-business-analytics-2317022.pdf). This has fueled a substantial growth in text analysis research (involving the use of technology to analyze un- and semi-structured text data for valuable insights, trends, and patterns) across disciplines and a corresponding demand on academic libraries to support text analysis pedagogy and text analysis …


The Effectiveness Of Visualization Techniques For Supporting Decision-Making, Cansu Yalim, Holly A. H. Handley Apr 2023

The Effectiveness Of Visualization Techniques For Supporting Decision-Making, Cansu Yalim, Holly A. H. Handley

Modeling, Simulation and Visualization Student Capstone Conference

Although visualization is beneficial for evaluating and communicating data, the efficiency of various visualization approaches for different data types is not always evident. This research aims to address this issue by investigating the usefulness of several visualization techniques for various data kinds, including continuous, categorical, and time-series data. The qualitative appraisal of each technique's strengths, weaknesses, and interpretation of the dataset is investigated. The research questions include: which visualization approaches perform best for different data types, and what factors impact their usefulness? The absence of clear directions for both researchers and practitioners on how to identify the most effective visualization …


State Prediction Of Poverty Alleviation Objects Based On Hmm And Multidimensional Data, Jun He, Sunyan Hong, Yifang Zhou, Shikai Shen, Muquan Zou May 2022

State Prediction Of Poverty Alleviation Objects Based On Hmm And Multidimensional Data, Jun He, Sunyan Hong, Yifang Zhou, Shikai Shen, Muquan Zou

Journal of System Simulation

Abstract: In order to solve the problems of inaccurate prediction of poverty, poverty reduction and poverty returen, and the difficulty in identifying the key factors affecting the state transition, 8 key features and 22 observed states are extracted from the poverty reduction basic data and multi-industry data. The relationship between observed state and implied state is constructed, and the hidden markov model (HMM) of poverty alleviation is established. Data of a deep poverty county for three consecutive years are used as samples for parameter training, test experiment and result verification. The results show that the method has a strong …


How Blockchain Solutions Enable Better Decision Making Through Blockchain Analytics, Sammy Ter Haar May 2022

How Blockchain Solutions Enable Better Decision Making Through Blockchain Analytics, Sammy Ter Haar

Information Systems Undergraduate Honors Theses

Since the founding of computers, data scientists have been able to engineer devices that increase individuals’ opportunities to communicate with each other. In the 1990s, the internet took over with many people not understanding its utility. Flash forward 30 years, and we cannot live without our connection to the internet. The internet of information is what we called early adopters with individuals posting blogs for others to read, this was known as Web 1.0. As we progress, platforms became social allowing individuals in different areas to communicate and engage with each other, this was known as Web 2.0. As Dr. …


Big Data With Cloud Computing: Discussions And Challenges, Amanpreet Kaur Sandhu Mar 2022

Big Data With Cloud Computing: Discussions And Challenges, Amanpreet Kaur Sandhu

Big Data Mining and Analytics

With the recent advancements in computer technologies, the amount of data available is increasing day by day. However, excessive amounts of data create great challenges for users. Meanwhile, cloud computing services provide a powerful environment to store large volumes of data. They eliminate various requirements, such as dedicated space and maintenance of expensive computer hardware and software. Handling big data is a time-consuming task that requires large computational clusters to ensure successful data storage and processing. In this work, the definition, classification, and characteristics of big data are discussed, along with various cloud services, such as Microsoft Azure, Google Cloud, …


The State Of The Art Of Information Integration In Space Applications, Zhuming Bi, K. L. Yung, Andrew W.H. Ip., Yuk Ming Tang, Chris W.J. Zhang, Li Da Xu Jan 2022

The State Of The Art Of Information Integration In Space Applications, Zhuming Bi, K. L. Yung, Andrew W.H. Ip., Yuk Ming Tang, Chris W.J. Zhang, Li Da Xu

Information Technology & Decision Sciences Faculty Publications

This paper aims to present a comprehensive survey on information integration (II) in space informatics. With an ever-increasing scale and dynamics of complex space systems, II has become essential in dealing with the complexity, changes, dynamics, and uncertainties of space systems. The applications of space II (SII) require addressing some distinctive functional requirements (FRs) of heterogeneity, networking, communication, security, latency, and resilience; while limited works are available to examine recent advances of SII thoroughly. This survey helps to gain the understanding of the state of the art of SII in sense that (1) technical drivers for SII are discussed and …


A Longitudinal Analysis Of Pathways To Computing Careers: Defining Broadening Participation In Computing (Bpc) Success With A Rearview Lens, Mercy Jaiyeola Dec 2021

A Longitudinal Analysis Of Pathways To Computing Careers: Defining Broadening Participation In Computing (Bpc) Success With A Rearview Lens, Mercy Jaiyeola

Theses and Dissertations

Efforts to increase the participation of groups historically underrepresented in computing studies, and in the computing workforce, are well documented. It is a national effort with funding from a variety of sources being allocated to research in broadening participation in computing (BPC). Many of the BPC efforts are funded by the National Science Foundation (NSF) but as existing literature shows, the growth in representation of traditionally underrepresented minorities and women is not commensurate to the efforts and resources that have been directed toward this aim.

Instead of attempting to tackle the barriers to increasing representation, this dissertation research tackles the …


A Machine Learning Approach To Understanding Emerging Markets, Namita Balani Jul 2021

A Machine Learning Approach To Understanding Emerging Markets, Namita Balani

Graduate Theses and Dissertations

Logistic providers have learned to efficiently serve their existing customer bases with optimized routes and transportation resource allocation. The problem arises when there is potential for logistics growth in an emerging market with no previous data. The purpose of this work is to use industry data for previously known and well-documented markets to apply data analytic techniques such as machine learning to investigate the uncertainty in a new market. The thesis looks into machine learning techniques to predict miles per stop given historical data. It mainly focuses on Random Forest Regression Analysis, but concludes that additional techniques, such as Polynomial …


Intermittent Dynamics Of Dense Particulate Matter, Chao Cheng May 2021

Intermittent Dynamics Of Dense Particulate Matter, Chao Cheng

Dissertations

Granular particle systems are scattered around the universe, and they can behave like solids when there exist strong force-bearing networks, so that the granular system can resist certain stress without deformation. When such a network is not present, particles yield to small stress and behave like a fluid. A wide range of systems exhibit intermittent dynamics as they are slowly loaded, with different dynamical regimes governing many industrial and natural phenomena. While a significant amount of research on exploring intermittent dynamics of granular systems has been carried out, not much is known about the connection between particle-scale response and the …


A Deep Machine Learning Approach For Predicting Freeway Work Zone Delay Using Big Data, Abdullah Shabarek Dec 2020

A Deep Machine Learning Approach For Predicting Freeway Work Zone Delay Using Big Data, Abdullah Shabarek

Dissertations

The introduction of deep learning and big data analytics may significantly elevate the performance of traffic speed prediction. Work zones become one of the most critical factors causing congestion impact, which reduces the mobility as well as traffic safety. A comprehensive literature review on existing work zone delay prediction models (i.e., parametric, simulation and non-parametric models) is conducted in this research. The research shows the limitations of each model. Moreover, most previous modeling approaches did not consider user delay for connected freeways when predicting traffic speed under work zone conditions. This research proposes Deep Artificial Neural Network (Deep ANN) and …


Forecasting Of Short-Term Power Load Of Secrpso-Svm Based On Data-Driven, Hairong Sun, Bixia Xie, Tian Yao, Zhuoqun Li Jun 2020

Forecasting Of Short-Term Power Load Of Secrpso-Svm Based On Data-Driven, Hairong Sun, Bixia Xie, Tian Yao, Zhuoqun Li

Journal of System Simulation

Abstract: For the parameter selection of support vector machine in modeling, a particle swarm optimization algorithm based on second-order oscillation and repulsion factor was proposed to optimize the parameter of SVM. The algorithm employed the nonlinear decreasing weight to balance the global and local search ability. Second-order oscillation factor could maintain the population diversity. The repulsion factor was introduced to make the swarm even distribution in search space, which could avoid local optimum. For the complex characteristics of nonlinearity, time-varying and multifactorial of electric power load, a support vector machine forecasting model based on data was proposed, and the influence …


Securing The Emerging Technologies Of Autonomous And Connected Vehicles, Shahab Tayeb, Matin Pirouz Apr 2020

Securing The Emerging Technologies Of Autonomous And Connected Vehicles, Shahab Tayeb, Matin Pirouz

Mineta Transportation Institute

The Internet of Vehicles (IoV) aims to establish a network of autonomous and connected vehicles that communicate with one another through facilitation led by road-side units (RSUs) and a central trust authority (TA). Messages must be efficiently and securely disseminated to conserve resources and preserve network security. Currently, research in this area lacks consensus about security schemes and methods of disseminating messages. Furthermore, a current deficiency of information regarding resource optimization prevents further efficient development of this network. This paper takes an interdisciplinary approach to these issues by merging both cybersecurity and data science to optimize and secure the network. …


Algorithm Selection Framework: A Holistic Approach To The Algorithm Selection Problem, Marc W. Chalé Mar 2020

Algorithm Selection Framework: A Holistic Approach To The Algorithm Selection Problem, Marc W. Chalé

Theses and Dissertations

A holistic approach to the algorithm selection problem is presented. The “algorithm selection framework" uses a combination of user input and meta-data to streamline the algorithm selection for any data analysis task. The framework removes the conjecture of the common trial and error strategy and generates a preference ranked list of recommended analysis techniques. The framework is performed on nine analysis problems. Each of the recommended analysis techniques are implemented on the corresponding data sets. Algorithm performance is assessed using the primary metric of recall and the secondary metric of run time. In six of the problems, the recall of …


Social Media Based Algorithmic Clinical Decision Support Learning From Behavioral Predispositions, Radhika V. Medury Jan 2020

Social Media Based Algorithmic Clinical Decision Support Learning From Behavioral Predispositions, Radhika V. Medury

Doctoral Dissertations

Behavioral disorders are disabilities characterized by an individual’s mood, thinking, and social interactions. The commonality of behavioral disorders amongst the United States population has increased in the last few years, with an estimated 50% of all Americans diagnosed with a behavioral disorder at some point in their lifetime. AttentionDeficit/Hyperactivity Disorder is one such behavioral disorder that is a severe public health concern because of its high prevalence, incurable nature, significant impact on domestic life, and peer relationships. Symptomatically, in theory, ADHD is characterized by inattention, hyperactivity, and impulsivity. Access to providers who can offer diagnosis and treat the disorder varies …


Disaster Damage Categorization Applying Satellite Images And Machine Learning Algorithm, Farinaz Sabz Ali Pour, Adrian Gheorghe Jan 2020

Disaster Damage Categorization Applying Satellite Images And Machine Learning Algorithm, Farinaz Sabz Ali Pour, Adrian Gheorghe

Engineering Management & Systems Engineering Faculty Publications

Special information has a significant role in disaster management. Land cover mapping can detect short- and long-term changes and monitor the vulnerable habitats. It is an effective evaluation to be included in the disaster management system to protect the conservation areas. The critical visual and statistical information presented to the decision-makers can help in mitigation or adaption before crossing a threshold. This paper aims to contribute in the academic and the practice aspects by offering a potential solution to enhance the disaster data source effectiveness. The key research question that the authors try to answer in this paper is how …


Identifying Regional Trends In Avatar Customization, Peter Mawhorter, Sercan Sengun, Haewoon Kwak, D. Fox Harrell Dec 2019

Identifying Regional Trends In Avatar Customization, Peter Mawhorter, Sercan Sengun, Haewoon Kwak, D. Fox Harrell

Research Collection School Of Computing and Information Systems

Since virtual identities such as social media profiles and avatars have become a common venue for self-expression, it has become important to consider the ways in which existing systems embed the values of their designers. In order to design virtual identity systems that reflect the needs and preferences of diverse users, understanding how the virtual identity construction differs between groups is important. This paper presents a new methodology that leverages deep learning and differential clustering for comparative analysis of profile images, with a case study of almost 100 000 avatars from a large online community using a popular avatar creation …


Twitter And Disasters: A Social Resilience Fingerprint, Benjamin A. Rachunok, Jackson B. Bennett, Roshanak Nateghi May 2019

Twitter And Disasters: A Social Resilience Fingerprint, Benjamin A. Rachunok, Jackson B. Bennett, Roshanak Nateghi

Purdue University Libraries Open Access Publishing Fund

Understanding the resilience of a community facing a crisis event is critical to improving its adaptive capacity. Community resilience has been conceptualized as a function of the resilience of components of a community such as ecological, infrastructure, economic, and social systems, etc. In this paper, we introduce the concept of a “resilience fingerprint” and propose a multi-dimensional method for analyzing components of community resilience by leveraging existing definitions of community resilience with data from the social network Twitter. Twitter data from 14 events are analyzed and their resulting resilience fingerprints computed. We compare the fingerprints between events and show that …


A Bayesian Network Based Adaptability Design Of Product Structures For Function Evolution, Shaobo Li, Yongming Wu, Yan-Xia Xu, Jie Hu, Jianjun Hu Mar 2018

A Bayesian Network Based Adaptability Design Of Product Structures For Function Evolution, Shaobo Li, Yongming Wu, Yan-Xia Xu, Jie Hu, Jianjun Hu

Faculty Publications

Structure adaptability design is critical for function evolution in product families, in which many structural and functional design factors are intertwined together with manufacturing cost, customer satisfaction, and final market sales. How to achieve a delicate balance among all of these factors to maximize the market performance of the product is too complicated to address based on traditional domain experts’ knowledge or some ad hoc heuristics. Here, we propose a quantitative product evolution design model that is based on Bayesian networks to model the dynamic relationship between customer needs and product structure design. In our model, all of the structural …


Pythagorean Combinations For Lego Robot Building., Ronald I. Greenberg Jan 2018

Pythagorean Combinations For Lego Robot Building., Ronald I. Greenberg

Ronald Greenberg

This paper provides tips for LEGO robot construction involving bracing or gear meshing along a diagonal using standard Botball kits.


Pythagorean Approximations For Lego: Merging Educational Robot Construction With Programming And Data Analysis, Ronald I. Greenberg Jan 2018

Pythagorean Approximations For Lego: Merging Educational Robot Construction With Programming And Data Analysis, Ronald I. Greenberg

Ronald Greenberg

Abstract. This paper can be used in two ways. It can provide reference information for incorporating diagonal elements (for bracing or gear meshing) in educational robots built from standard LEGO kits. Alternatively, it can be used as the basis for an assignment for high school or college students to recreate this information; in the process, students will exercise skills in both computer programming and data analysis. Using the paper in the second way can be an excellent integrative experience to add to an existing course; for example, the Exploring Computer Science high school curriculum concludes with the units “Introduction to …


Smartphone Sensing Meets Transport Data: A Collaborative Framework For Transportation Service Analytics, Yu Lu, Archan Misra, Wen Sun, Huayu Wu Aug 2017

Smartphone Sensing Meets Transport Data: A Collaborative Framework For Transportation Service Analytics, Yu Lu, Archan Misra, Wen Sun, Huayu Wu

Research Collection School Of Computing and Information Systems

We advocate for and introduce TRANSense, a framework for urban transportation service analytics that combines participatory smartphone sensing data with city-scale transportation-related transactional data (taxis, trains etc.). Our work is driven by the observed limitations of using each data type in isolation: (a) commonly-used anonymous city-scale datasets (such as taxi bookings and GPS trajectories) provide insights into the aggregate behavior of transport infrastructure, but fail to reveal individual-specific transport experiences (e.g., wait times in taxi queues); while (b) mobile sensing data can capture individual-specific commuting-related activities, but suffers from accuracy and energy overhead challenges due to usage artefacts and lack …


Data Analysis And Processing Techniques For Remaining Useful Life Estimations, John Scott Bucknam Jun 2017

Data Analysis And Processing Techniques For Remaining Useful Life Estimations, John Scott Bucknam

Theses and Dissertations

In the field of engineering, it is important to understand different engineering systems and components, not only in how they currently perform, but also how their performance degrades over time. This extends to the field of prognostics, which attempts to predict the future of a system or component based on its past and present states. A common problem in this field is the estimation of remaining useful life, or how long a system or component functionality will last. The well-known datasets for this problem are the PHM and C-MAPSS datasets. These datasets contain simulated sensor data for different turbofan engines …


Pythagorean Approximations For Lego: Merging Educational Robot Construction With Programming And Data Analysis, Ronald I. Greenberg Apr 2017

Pythagorean Approximations For Lego: Merging Educational Robot Construction With Programming And Data Analysis, Ronald I. Greenberg

Computer Science: Faculty Publications and Other Works

Abstract. This paper can be used in two ways. It can provide reference information for incorporating diagonal elements (for bracing or gear meshing) in educational robots built from standard LEGO kits. Alternatively, it can be used as the basis for an assignment for high school or college students to recreate this information; in the process, students will exercise skills in both computer programming and data analysis. Using the paper in the second way can be an excellent integrative experience to add to an existing course; for example, the Exploring Computer Science high school curriculum concludes with the units “Introduction to …


Integrative Pathway Analysis Pipeline For Mirna And Mrna Data, Diana Mabel Diaz Herrera Jan 2017

Integrative Pathway Analysis Pipeline For Mirna And Mrna Data, Diana Mabel Diaz Herrera

Wayne State University Theses

The identification of pathways that are involved in a particular phenotype helps us understand the underlying biological processes. Traditional pathway analysis techniques aim to infer the impact on individual pathways using only mRNA levels. However, recent studies showed that gene expression alone is unable to capture the whole picture of biological phenomena. At the same time, MicroRNAs (miRNAs) are newly discovered gene regulators that have shown to play an important role in diagnosis, and prognosis for different types of diseases. Current pathway analysis techniques do not take miRNAs into consideration. In this project, we investigate the effect of integrating miRNA …


An Online Approach For Feature Selection For Classification In Big Data, Nasrin Banu Nazar, Radha Senthilkumar Jan 2017

An Online Approach For Feature Selection For Classification In Big Data, Nasrin Banu Nazar, Radha Senthilkumar

Turkish Journal of Electrical Engineering and Computer Sciences

Feature selection (FS), also known as attribute selection, is a process of selection of a subset of relevant features used in model construction. This process or method improves the classification accuracy by removing irrelevant and noisy features. FS is implemented using either batch learning or online learning. Currently, the FS methods are executed in batch learning. Nevertheless, these techniques take longer execution time and require larger storage space to process the entire dataset. Due to the lack of scalability, the batch learning process cannot be used for large data. In the present study, a scalable efficient Online Feature Selection (OFS) …


Pythagorean Combinations For Lego Robot Building., Ronald I. Greenberg Jul 2016

Pythagorean Combinations For Lego Robot Building., Ronald I. Greenberg

Computer Science: Faculty Publications and Other Works

This paper provides tips for LEGO robot construction involving bracing or gear meshing along a diagonal using standard Botball kits.


Feature Knowledge Based Fault Detection Of Induction Motors Through The Analysis Of Stator Current Data, Ting Yang, Haibo Pen, Zhaoxia Wang, Che Sau Chang Mar 2016

Feature Knowledge Based Fault Detection Of Induction Motors Through The Analysis Of Stator Current Data, Ting Yang, Haibo Pen, Zhaoxia Wang, Che Sau Chang

Research Collection School Of Computing and Information Systems

The fault detection of electrical or mechanical anomalies in induction motors has been a challenging problem for researchers over decades to ensure the safety and economic operations of industrial processes. To address this issue, this paper studies the stator current data obtained from inverter-fed laboratory induction motors and investigates the unique signatures of the healthy and faulty motors with the aim of developing knowledge based fault detection method for performing online detection of motor fault problems, such as broken-rotor-bar and bearing faults. Stator current data collected from induction motors were analyzed by leveraging fast Fourier transform (FFT), and the FFT …


Ad-Hoc Automated Teller Machine Failure Forecast And Field Service Optimization, Michelle L. F. Cheong, Ping Shung Koo, B. Chandra Babu Aug 2015

Ad-Hoc Automated Teller Machine Failure Forecast And Field Service Optimization, Michelle L. F. Cheong, Ping Shung Koo, B. Chandra Babu

Research Collection School Of Computing and Information Systems

As part of its overall effort to maintain good customer service while managing operational efficiency and reducing cost, a bank in Singapore has embarked on using data and decision analytics methodologies to perform better ad-hoc ATM failure forecasting and plan the field service engineers to repair the machines. We propose using a combined Data and Decision Analytics Framework which helps the analyst to first understand the business problem by collecting, preparing and exploring data to gain business insights, before proposing what objectives and solutions can and should be done to solve the problem. This paper reports the work in analyzing …


The Allen Telescope Array Search For Electrostatic Discharges On Mars, Marin M. Anderson, Andrew P.V. Siemion, William C. Barott, Geoffery C. Bower, Gregory T. Delory, Imke De Pater, Dan Werthimer Mar 2014

The Allen Telescope Array Search For Electrostatic Discharges On Mars, Marin M. Anderson, Andrew P.V. Siemion, William C. Barott, Geoffery C. Bower, Gregory T. Delory, Imke De Pater, Dan Werthimer

William Barott

The Allen Telescope Array was used to monitor Mars between 2010 March 9 and June 2, over a total of approximately 30 hr, for radio emission indicative of electrostatic discharge. The search was motivated by the report from Ruf et al. of the detection of non-thermal microwave radiation from Mars characterized by peaks in the power spectrum of the kurtosis, or kurtstrum, at 10 Hz, coinciding with a large dust storm event on 2006 June 8. For these observations, we developed a wideband signal processor at the Center for Astronomy Signal Processing and Electronics Research. This 1024 channel spectrometer calculates …


Characterization Of Samples For Optimization Of Infrared Stray Light Coatings, Carey L. Baxter, Rebecca Salvemini, Zaheer A. Ali, Patrick Waddell, Greg Perryman, Bob Thompson Aug 2013

Characterization Of Samples For Optimization Of Infrared Stray Light Coatings, Carey L. Baxter, Rebecca Salvemini, Zaheer A. Ali, Patrick Waddell, Greg Perryman, Bob Thompson

STAR Program Research Presentations

NASA’s Stratospheric Observatory for Infrared Astronomy (SOFIA) is a converted 747SP that houses a 2.5 m telescope that observes the sky through an opening in the side of the aircraft. Because it flies at altitudes up to 45,000 feet, SOFIA gets 99.99% transmission in the infrared. Multiple science instruments mount one at a time on the telescope to interpret infrared and visible light from target sources. Ball Infrared Black (BIRB) currently coats everything that the optics sees inside the telescope assembly (TA) cavity in order to eliminate noise from the glow of background sky, aircraft exhaust, and other sources. A …