Open Access. Powered by Scholars. Published by Universities.®

Data Science Commons

Open Access. Powered by Scholars. Published by Universities.®

86 Full-Text Articles 230 Authors 0 Downloads 39 Institutions

All Articles in Data Science

Faceted Search

86 full-text articles. Page 4 of 5.

An Analytical Examination On The Effects Of Vegetarian And Omnivorous Diets On C-Reactive Protein, Aletha Kleis 2020 Central Washington University

An Analytical Examination On The Effects Of Vegetarian And Omnivorous Diets On C-Reactive Protein, Aletha Kleis

Undergraduate Honors Theses

There is a lack of research regarding how following a vegetarian or omnivores diet effects C-Reactive Protein (CRP) levels of people as seen through results from an analysis of data gathered from the National Health and Nutrition Examination Survey (NHANES). The level of CRP is a reflection of how much inflammation there is in one’s body and is a popular indicator of risk for heart disease. Thus, in this research, I use the NHANES data to look at the relationship of CRP levels of people who identified themselves as vegetarian or not, while also considering the general healthiness of ...


Multimodal Fusion Strategies For Outcome Prediction In Stroke, Esra Zihni, John D. Kelleher, Vince I. Madai, Ahmed Khalil, Ivana Galinovic, Jochen Fiebach, Michelle Livne, Dietmar Frey 2020 Technological University Dublin

Multimodal Fusion Strategies For Outcome Prediction In Stroke, Esra Zihni, John D. Kelleher, Vince I. Madai, Ahmed Khalil, Ivana Galinovic, Jochen Fiebach, Michelle Livne, Dietmar Frey

Conference papers

Data driven methods are increasingly being adopted in the medical domain for clinical predictive modeling. Prediction of stroke outcome using machine learning could provide a decision support system for physicians to assist them in patient-oriented diagnosis and treatment. While patient-specific clinical parameters play an important role in outcome prediction, a multimodal fusion approach that integrates neuroimaging with clinical data has the potential to improve accuracy. This paper addresses two research questions: (a) does multimodal fusion aid in the prediction of stroke outcome, and (b) what fusion strategy is more suitable for the task at hand. The baselines for our experimental ...


Edge-Cloud Computing For Iot Data Analytics: Embedding Intelligence In The Edge With Deep Learning, Ananda Mohon M. Ghosh, Katarina Grolinger 2020 The University of Western Ontario

Edge-Cloud Computing For Iot Data Analytics: Embedding Intelligence In The Edge With Deep Learning, Ananda Mohon M. Ghosh, Katarina Grolinger

Electrical and Computer Engineering Publications

Rapid growth in numbers of connected devices including sensors, mobile, wearable, and other Internet of Things (IoT) devices, is creating an explosion of data that are moving across the network. To carry out machine learning (ML), IoT data are typically transferred to the cloud or another centralized system for storage and processing; however, this causes latencies and increases network traffic. Edge computing has the potential to remedy those issues by moving computation closer to the network edge and data sources. On the other hand, edge computing is limited in terms of computational power and thus is not well suited for ...


Image Features For Tuberculosis Classification In Digital Chest Radiographs, Brian Hooper 2020 Central Washington University

Image Features For Tuberculosis Classification In Digital Chest Radiographs, Brian Hooper

All Master's Theses

Tuberculosis (TB) is a respiratory disease which affects millions of people each year, accounting for the tenth leading cause of death worldwide, and is especially prevalent in underdeveloped regions where access to adequate medical care may be limited. Analysis of digital chest radiographs (CXRs) is a common and inexpensive method for the diagnosis of TB; however, a trained radiologist is required to interpret the results, and is subject to human error. Computer-Aided Detection (CAD) systems are a promising machine-learning based solution to automate the diagnosis of TB from CXR images. As the dimensionality of a high-resolution CXR image is very ...


Finding Common Ground For Citizen Empowerment In The Smart City, John D. Kelleher, Aphra Kerr 2020 Technological University Dublin

Finding Common Ground For Citizen Empowerment In The Smart City, John D. Kelleher, Aphra Kerr

Articles

Corporate smart city initiatives are just one example of the contemporary culture of surveillance. They rely on extensive information gathering systems and Big Data analysis to predict citizen behaviour and optimise city services. In this paper we argue that many smart city and social media technologies result in a paradox whereby digital inclusion for the purposes of service provision also results in marginalisation and disempowerment of citizens. Drawing upon insights garnered from a digital inclusion workshop conducted in the Galapagos islands, we propose that critically and creatively unpacking the computational techniques embedded in data services is needed as a first ...


Critical Media, Information, And Digital Literacy: Increasing Understanding Of Machine Learning Through An Interdisciplinary Undergraduate Course, Barbara R. Burke, Elena Machkasova 2020 University of Minnesota, Morris

Critical Media, Information, And Digital Literacy: Increasing Understanding Of Machine Learning Through An Interdisciplinary Undergraduate Course, Barbara R. Burke, Elena Machkasova

Communication, Media, and Rhetoric Publications

Widespread use of Artificial Intelligence in all areas of today’s society creates a unique problem: algorithms used in decision-making are generally not understandable to those without a background in data science. Thus, those who use out-of-the-box Machine Learning (ML) approaches in their work and those affected by these approaches are often not in a position to analyse their outcomes and applicability. Our paper describes and evaluates our undergraduate course at the University of Minnesota Morris, which fosters understanding of the main ideas behind ML. With Communication, Media & Rhetoric and Computer Science faculty expertise, students from a variety of majors ...


The Trust Principles For Digital Repositories, Dawei Lin, Jonathan Crabtree, Ingrid Dillo, Robert R. Downs, Rorie Edmunds, David Giaretta, Marisa De Giusti, Hervé L'Hours, Wim Hugo, Reyna Jenkyns, Varsha Khodiyar, Maryann E. Martone, Mustapha Mokrane, Vivek Navale, Jonathan Petters, Barbara Sierman, Dina V. Sokolova, Martina Stockhause, John Westbrook 2020 United States National Institutes of Health

The Trust Principles For Digital Repositories, Dawei Lin, Jonathan Crabtree, Ingrid Dillo, Robert R. Downs, Rorie Edmunds, David Giaretta, Marisa De Giusti, Hervé L'Hours, Wim Hugo, Reyna Jenkyns, Varsha Khodiyar, Maryann E. Martone, Mustapha Mokrane, Vivek Navale, Jonathan Petters, Barbara Sierman, Dina V. Sokolova, Martina Stockhause, John Westbrook

Copyright, Fair Use, Scholarly Communication, etc.

As information and communication technology has become pervasive in our society, we are increasingly dependent on both digital data and repositories that provide access to and enable the use of such resources. Repositories must earn the trust of the communities they intend to serve and demonstrate that they are reliable and capable of appropriately managing the data they hold.

Following a year-long public discussion and building on existing community consensus , several stakeholders, representing various segments of the digital repository community, have collaboratively developed and endorsed a set of guiding principles to demonstrate digital repository trustworthiness. Transparency, Responsibility, User focus, Sustainability ...


Invariance And Invertibility In Deep Neural Networks, Han Zhang 2020 Virginia Commonwealth University

Invariance And Invertibility In Deep Neural Networks, Han Zhang

Theses and Dissertations

Machine learning is concerned with computer systems that learn from data instead of being explicitly programmed to solve a particular task. One of the main approaches behind recent advances in machine learning involves neural networks with a large number of layers, often referred to as deep learning. In this dissertation, we study how to equip deep neural networks with two useful properties: invariance and invertibility. The first part of our work is focused on constructing neural networks that are invariant to certain transformations in the input, that is, some outputs of the network stay the same even if the input ...


Real-Time Feedback For Colonoscopy In A Multicenter Clinical Trial, Wallapak Tavanapong, JungHwan Oh, Gavin Kijkul, Jacob Pratt, Johnny Wong, Piet C. deGroen 2020 Iowa State University

Real-Time Feedback For Colonoscopy In A Multicenter Clinical Trial, Wallapak Tavanapong, Junghwan Oh, Gavin Kijkul, Jacob Pratt, Johnny Wong, Piet C. Degroen

Computer Science Conference Presentations, Posters and Proceedings

We report the technical challenges, solutions, and lessons learned from deploying real-time feedback systems in three hospitals as part of a multi-center controlled clinical trial to improve quality of colonoscopy. Previous clinical trials were conducted in one center. The technical challenges for our multicenter clinical trial include 1) reducing additional work by the endoscopists to utilize real-time feedback, 2) handling different colonoscopy practices at different hospitals, and 3) training an effective CNN-based classification model with a large variety of patterns of data in day-to-day colonoscopy practice. We report performance of our real-time systems over a period of 20 weeks at ...


The Mathematics, Computer Science, And Data Science Student Research Showcase, Seton Hall University 2020 Seton Hall University

The Mathematics, Computer Science, And Data Science Student Research Showcase, Seton Hall University

Petersheim Academic Exposition

No abstract provided.


Health-Aware Food Planner: A Personalized Recipe Generation Approach Based On Gpt-2, Bushra Aljbawi 2020 Wilfrid Laurier University

Health-Aware Food Planner: A Personalized Recipe Generation Approach Based On Gpt-2, Bushra Aljbawi

Theses and Dissertations (Comprehensive)

"What to eat today?" With the flourish of Internet, more and more people nowadays are inclined to find an answer to this most problematic question online. The recent explosion of food networks; however, produces large volumes of recipes, making it even harder to make an informed decision. This yields the need for advanced decision-making algorithms and efficient recommendation systems. Conventional recommender systems are not feasible anymore as food is a complicated feature that presents unique challenges and is less studied. For example, it can be one of the main reasons for obesity and many other chronic diseases. Food recommender system ...


Sediment Dynamics In The Magdalena River Basin, Colombia: Implications For Understanding Tropical River Processes And Hydropower Development, Luke H. Fisher 2020 The University Of Montana

Sediment Dynamics In The Magdalena River Basin, Colombia: Implications For Understanding Tropical River Processes And Hydropower Development, Luke H. Fisher

Graduate Student Theses, Dissertations, & Professional Papers

The Magdalena River Basin of Colombia has a globally relevant sediment flux, however, studies of the sediment regime in the basin are limited in scope. This knowledge gap limits application of understanding of sediment dynamics to hydropower decision making. To close this gap, we implemented a sediment budget framework to quantify the impacts of hydropower development in a 118,000 km2 portion of the Magdalena River basin. We informed this framework with analysis of background erosion rates derived from 10Be cosmogenic nuclides and modern sediment fluxes derived from monitoring and optical remote sensing. We standardized these data to ...


Three Essays On Health Economics And Policy Evaluation, Shishir Shakya 2020 West Virginia University

Three Essays On Health Economics And Policy Evaluation, Shishir Shakya

Graduate Theses, Dissertations, and Problem Reports

This dissertation consists of three essays on the U.S. Health care policy. Each paragraph below refers to the three abstracts for the three chapters in this dissertation, respectively. I provide quantitative evidence on how much Prescription Drug Monitoring Programs (PDMPs) affects the retail opioid prescribing behaviors. Using the American Community Survey (ACS), I retrieve county-level high dimensional panel data set from 2010 to 2017. I employ three separate identification strategies: difference-in-difference, double selection post-LASSO, and spatial difference-in-difference. I compare how the retail opioid prescribing behaviors of counties, that are mandatory for prescribers to check the PDMP before prescribing controlled ...


A Study On Real-Time Database Technology And Its Applications, Geethmi Nimantha Dissanayake 2020 Eastern Illinois University

A Study On Real-Time Database Technology And Its Applications, Geethmi Nimantha Dissanayake

Masters Theses

No abstract provided.


What About The Environment?: Exploring The Neglected Third Dimension Of Antimicrobial Resistance, Paige E. Montfort 2019 SIT Study Abroad

What About The Environment?: Exploring The Neglected Third Dimension Of Antimicrobial Resistance, Paige E. Montfort

Independent Study Project (ISP) Collection

Antimicrobial resistance (AMR) is one of the most urgent and complex health risks of our time, with links to human health, animal health, and the environment. The majority of research and policy related to AMR, however, has been dedicated to human and animal health. The third dimension — the environment — has been relatively neglected. Conversations about this problem have begun, but gaps in understanding remain. This study explores the key barriers that have hindered developments related to the environmental aspect of AMR and some of the solutions that have begun to or could be utilized to overcome these barriers.

A grounded ...


Eavesdropping Hackers: Detecting Software Vulnerability Communication On Social Media Using Text Mining, Susan McKeever, brian keegan, Andrei Quieroz 2019 Technological University Dublin

Eavesdropping Hackers: Detecting Software Vulnerability Communication On Social Media Using Text Mining, Susan Mckeever, Brian Keegan, Andrei Quieroz

Conference papers

Abstract—Cyber security is striving to find new forms of protection against hacker attacks. An emerging approach nowadays is the investigation of security-related messages exchanged on Deep/Dark Web and even Surface Web channels. This approach can be supported by the use of supervised machine learning models and text mining techniques. In our work, we compare a variety of machine learning algorithms, text representations and dimension reduction approaches for the detection accuracies of software-vulnerability-related communications. Given the imbalanced nature of the three public datasets used, we investigate appropriate sampling approaches to boost detection accuracies of our models. In addition, we ...


Streaming Feature Grouping And Selection (Sfgs) For Big Data Classification, Noura Helal Hamad Al Nuaimi 2019 United Arab Emirates University

Streaming Feature Grouping And Selection (Sfgs) For Big Data Classification, Noura Helal Hamad Al Nuaimi

Dissertations

Real-time data has always been an essential element for organizations when the quickness of data delivery is critical to their businesses. Today, organizations understand the importance of real-time data analysis to maintain benefits from their generated data. Real-time data analysis is also known as real-time analytics, streaming analytics, real-time streaming analytics, and event processing. Stream processing is the key to getting results in real-time. It allows us to process the data stream in real-time as it arrives. The concept of streaming data means the data are generated dynamically, and the full stream is unknown or even infinite. This data becomes ...


Untapped Potential Of Clinical Text For Opioid Surveillance, Amy L. Olex, Tamas Gal, Majid Afshar, Dmitriy Dligach, Niranjan Karnik, Travis Oakes, Brihat Sharma, Meng Xie, Bridget T. McInnes, Julian Solway, Abel Kho, William Cramer, F. Gerard Moeller 2019 Virginia Commonwealth University

Untapped Potential Of Clinical Text For Opioid Surveillance, Amy L. Olex, Tamas Gal, Majid Afshar, Dmitriy Dligach, Niranjan Karnik, Travis Oakes, Brihat Sharma, Meng Xie, Bridget T. Mcinnes, Julian Solway, Abel Kho, William Cramer, F. Gerard Moeller

Wright Center for Clinical and Translational Research Works

Accurate surveillance is needed to combat the growing opioid epidemic. To investigate the potential volume of missed opioid overdoses, we compare overdose encounters identified by ICD-10-CM codes and an NLP pipeline from two different medical systems. Our results show that the NLP pipeline identified a larger percentage of OOD encounters than ICD-10-CM codes. Thus, incorporating sophisticated NLP techniques into current diagnostic methods has the potential to improve surveillance on the incidence of opioid overdoses.


A Mathematical Analysis Of The Game Of Chess, John C. White 2018 Southeastern University - Lakeland

A Mathematical Analysis Of The Game Of Chess, John C. White

Selected Honors Theses

This paper analyzes chess through the lens of mathematics. Chess is a complex yet easy to understand game. Can mathematics be used to perfect a player’s skills? The work of Ernst Zermelo shows that one player should be able to force a win or force a draw. The work of Shannon and Hardy demonstrates the complexities of the game. Combinatorics, probability, and some chess puzzles are used to better understand the game. A computer program is used to test a hypothesis regarding chess strategy. Through the use of this program, we see that it is detrimental to be the ...


Big Data And Its Visualization With Fog Computing, Richard S. Segall, Gao Niu 2018 Arkansas State University

Big Data And Its Visualization With Fog Computing, Richard S. Segall, Gao Niu

Mathematics Department Journal Articles

Big Data is data sets that are so voluminous and complex that traditional data processing application software are inadequate to deal with them. This article discusses what isBig Data, and its characteristics, and how this information revolution of Big Data is transforming our lives and the new technology and methodologiesthat have been developed to process data of these huge dimensionalities. Big Data can be discrete or a continuous stream of data, and can be accessed using many types and kinds of computing devicesranging from supercomputers, personal work stations, to mobile devices and tablets. Discussion is presented of how fog computing ...


Digital Commons powered by bepress