Open Access. Powered by Scholars. Published by Universities.®

Databases and Information Systems Commons

Open Access. Powered by Scholars. Published by Universities.®

Data Science

2022

Institution
Keyword
Publication
Publication Type

Articles 1 - 16 of 16

Full-Text Articles in Databases and Information Systems

Big Data Technology Enabling Legal Supervision, Qingjie Liu, Shuo Liu, Yirong Wu, Yueqiang Weng, Yihao Wen, Ming Li Dec 2022

Big Data Technology Enabling Legal Supervision, Qingjie Liu, Shuo Liu, Yirong Wu, Yueqiang Weng, Yihao Wen, Ming Li

Bulletin of Chinese Academy of Sciences (Chinese Version)

Legal supervision plays an important role in the national governance system and capacity. In the era of digital revolution, the rapid development of digital procuratorial work with big data legal supervision as the core promotes to reshape the legal supervision and governance system. In this study, the inherent need of legal supervision for active prosecution in the new era, and the innovative role of new public interest litigation in comprehensive social governance, are firstly analyzed. Then, the core meaning and reshaping role of big-data-enabling-legalsupervision and supervision-promoting-national-governance of digital prosecution are discussed. After summarizing the practical experiences and challenges of big …


A Maturity Model Of Data Modeling In Self-Service Business Intelligence Software, Anna Kurenkov Dec 2022

A Maturity Model Of Data Modeling In Self-Service Business Intelligence Software, Anna Kurenkov

Master of Science in Information Technology Theses

Although Self-Service Business Intelligence (SSBI) is continually being adopted in various industries, there is a lack of research focused on data modeling in SSBI. This research aims to fill that research gap and propose a maturity model for SSBI data modeling which is generalizeable between different software and applicable for users of all technical backgrounds. Through extensive literature review, a five-tier maturity model was proposed, explained, and instantiated in PowerBI and Tableau. The testing of the model was found to be simple and intuitive, and the research concludes that the model is applicable to enterprise SSBI environments. This research is …


“I Think I Discovered A Military Base In The Middle Of The Ocean”—Null Island, The Most Real Of Fictional Places, Levente Juhasz, Peter Mooney Aug 2022

“I Think I Discovered A Military Base In The Middle Of The Ocean”—Null Island, The Most Real Of Fictional Places, Levente Juhasz, Peter Mooney

GIS Center

This paper explores Null Island, a fictional place located at 0° latitude and 0° longitude in the WGS84 (World Geodetic System 1984) geographic coordinate system. Null Island is erroneously associated with large amounts of geographic data in a wide variety of location-based services, place databases, social media and web-based maps. Whereas it was originally considered a joke within the geospatial community, this article will demonstrate implications of its existence, both technological and social in nature, promoting Null Island as a fundamental issue of geographic information that requires more widespread awareness. The article summarizes error sources that lead to data being …


Automated Identification Of Astronauts On Board The International Space Station: A Case Study In Space Archaeology, Rao Hamza Ali, Amir Kanan Kashefi, Alice C. Gorman, Justin St. P. Walsh, Erik J. Linstead Aug 2022

Automated Identification Of Astronauts On Board The International Space Station: A Case Study In Space Archaeology, Rao Hamza Ali, Amir Kanan Kashefi, Alice C. Gorman, Justin St. P. Walsh, Erik J. Linstead

Art Faculty Articles and Research

We develop and apply a deep learning-based computer vision pipeline to automatically identify crew members in archival photographic imagery taken on-board the International Space Station. Our approach is able to quickly tag thousands of images from public and private photo repositories without human supervision with high degrees of accuracy, including photographs where crew faces are partially obscured. Using the results of our pipeline, we carry out a large-scale network analysis of the crew, using the imagery data to provide novel insights into the social interactions among crew during their missions.


Development Of The Implementation Of Iot Monitoring System Based On Node-Red Technology, Anvar Kabulov, Inomjon Yarashov, Salamat Mirzataev Jun 2022

Development Of The Implementation Of Iot Monitoring System Based On Node-Red Technology, Anvar Kabulov, Inomjon Yarashov, Salamat Mirzataev

Karakalpak Scientific Journal

This article describes how to design and implement a process for storing environmental information in a database using the Internet of Things. The problems that need to be solved with the help of this IoT system are the growing demand for forecasts in the world, the demand of the world market for a new sustainable method of implementing the digitization environment through the Internet of Things. The design was implemented using Arduino, Node-Red and sensors, selected when choosing a component based on the required parameters and sent to the database for monitoring and processing. A study of previous work and …


Scalable Data Analytics For Relational Databases, Graphs And Videos, Fubao Wu Jun 2022

Scalable Data Analytics For Relational Databases, Graphs And Videos, Fubao Wu

Doctoral Dissertations

Data analytics is to analyze raw data and mine insights, trends, and patterns from them. Due to the dramatic increase in data volume and size in recent years with the development of big data and cloud storage, big data analytics algorithms and techniques have been faced with more challenges. Moreover, there are various types of data formats, such as relational databases, text data, audio data, and image/video data. It is challenging to generate a unified framework or algorithm for data analytics on various data formats. Different data formats still need refined and scalable algorithms. In this dissertation, we explore three …


Nonparametric Contextual Reasoning For Question Answering Over Large Knowledge Bases, Rajarshi Das Jun 2022

Nonparametric Contextual Reasoning For Question Answering Over Large Knowledge Bases, Rajarshi Das

Doctoral Dissertations

Question answering (QA) over knowledge bases provides a user-friendly way of accessing the massive amount of information stored in them. We have experienced tremendous progress in the performance of QA systems, thanks to the recent advancements in representation learning by deep neural models. However, such deep models function as black boxes with an opaque reasoning process, are brittle, and offer very limited control (e.g. for debugging an erroneous model prediction). It is also unclear how to reliably add or update knowledge stored in their model parameters. This thesis proposes nonparametric models for question answering that disentangle logic from knowledge. For …


A Large-Scale Sentiment Analysis Of Tweets Pertaining To The 2020 Us Presidential Election, Rao Hamza Ali, Gabriela Pinto, Evelyn Lawrie, Erik J. Linstead Jun 2022

A Large-Scale Sentiment Analysis Of Tweets Pertaining To The 2020 Us Presidential Election, Rao Hamza Ali, Gabriela Pinto, Evelyn Lawrie, Erik J. Linstead

Engineering Faculty Articles and Research

We capture the public sentiment towards candidates in the 2020 US Presidential Elections, by analyzing 7.6 million tweets sent out between October 31st and November 9th, 2020. We apply a novel approach to first identify tweets and user accounts in our database that were later deleted or suspended from Twitter. This approach allows us to observe the sentiment held for each presidential candidate across various groups of users and tweets: accessible tweets and accounts, deleted tweets and accounts, and suspended or inaccessible tweets and accounts. We compare the sentiment scores calculated for these groups and provide key insights into the …


Codem: Conditional Domain Embeddings For Scalable Human Activity Recognition, Abu Zaher Md Faridee, Avijoy Chakma, Zahid Hasan, Nirmalya Roy, Archan Misra Jun 2022

Codem: Conditional Domain Embeddings For Scalable Human Activity Recognition, Abu Zaher Md Faridee, Avijoy Chakma, Zahid Hasan, Nirmalya Roy, Archan Misra

Research Collection School Of Computing and Information Systems

We explore the effect of auxiliary labels in improving the classification accuracy of wearable sensor-based human activity recognition (HAR) systems, which are primarily trained with the supervision of the activity labels (e.g. running, walking, jumping). Supplemental meta-data are often available during the data collection process such as body positions of the wearable sensors, subjects' demographic information (e.g. gender, age), and the type of wearable used (e.g. smartphone, smart-watch). This information, while not directly related to the activity classification task, can nonetheless provide auxiliary supervision and has the potential to significantly improve the HAR accuracy by providing extra guidance on how …


Rhythmedge: Enabling Contactless Heart Rate Estimation On The Edge, Zahid Hasan, Emon Dey, Sreenivasan Ramasamy Ramamurthy, Nirmalya Roy, Archan Misra Jun 2022

Rhythmedge: Enabling Contactless Heart Rate Estimation On The Edge, Zahid Hasan, Emon Dey, Sreenivasan Ramasamy Ramamurthy, Nirmalya Roy, Archan Misra

Research Collection School Of Computing and Information Systems

The primary contribution of this paper is designing and prototyping a real-time edge computing system, RhythmEdge, that is capable of detecting changes in blood volume from facial videos (Remote Photoplethysmography; rPPG), enabling cardio-vascular health assessment instantly. The benefits of RhythmEdge include non-invasive measurement of cardiovascular activity, real-time system operation, inexpensive sensing components, and computing. RhythmEdge captures a short video of the skin using a camera and extracts rPPG features to estimate the Photoplethysmography (PPG) signal using a multi-task learning framework while offloading the edge computation. In addition, we intelligently apply a transfer learning approach to the multi-task learning framework to …


College Of Education Filemaker Extraction And End-User Database Development, Andrew Tran May 2022

College Of Education Filemaker Extraction And End-User Database Development, Andrew Tran

Electronic Theses, Projects, and Dissertations

The College of Education (CoE) at the California State University San Bernardino (CSUSB) developed a system to keep track of both state and national accreditation requirements using FileMaker 5, a database system. This accreditation data is crucial for reporting and record-keeping for the CSU Chancellor’s Office as well as the State of California. However, the database system was developed several decades ago, and software support has long since been dropped, causing the CoE’s legacy accreditation data to be at risk of being lost should the software or hardware suffer permanent failure. The purpose of this project was to perform extraction …


Cancel Culture: Who Or What Will Be Next?, Christine Trumper Apr 2022

Cancel Culture: Who Or What Will Be Next?, Christine Trumper

Honors Projects in Data Science

This paper utilizes Data Science and Applied Statistic techniques, to perform an analytical dive into Cancel Culture as it is referenced and used on Twitter. The research focuses on analyzing how Cancel Culture has affected the sentiment of Twitter, specifically how it impacts prominent topics in the media that have occurred between February 2021 to September 2021. The development of a topic and sentiment analysis will be based on 1,302,844 Tweets collected using Twitter’s API. Cancel Culture became popularized on social media in the past few years and there is little concrete information regarding its process and the demographics it …


Osm Science - The Academic Study Of The Openstreetmap Project, Data, Contributors, Community, And Applications, A. Yair Grinberger, Marco Minghini, Levente Juhasz, Godwin Yeboah, Peter Mooney Mar 2022

Osm Science - The Academic Study Of The Openstreetmap Project, Data, Contributors, Community, And Applications, A. Yair Grinberger, Marco Minghini, Levente Juhasz, Godwin Yeboah, Peter Mooney

GIS Center

This paper is an Editorial for the Special Issue titled “OpenStreetMap as a multidisciplinary nexus: perspectives, practices and procedures”. The Special Issue is largely based on the talks presented in the 2019 and 2020 editions of the Academic Track at the State of the Map conferences. As such, it represents the most pressing and relevant issues and topics considered by the academic community in relation to OpenStreetMap (OSM)—a global project and community aimed to create and maintain a free and editable database and map of the world. In this Editorial, we survey the papers included in the Special Issue, grouping …


The Data Analytics And The Science Revolution, Leila Halawi, Amal Clarke, Kelly George Feb 2022

The Data Analytics And The Science Revolution, Leila Halawi, Amal Clarke, Kelly George

Publications

This text highlights the difference between analytics and data science, using predictive analytic techniques to analyze different historical data, including aviation data and concrete data, interpreting the predictive models, and highlighting the steps to deploy the models and the steps ahead. The book combines the conceptual perspective and a hands-on approach to predictive analytics using SAS VIYA, an analytic and data management platform. The authors use SAS VIYA to focus on analytics to solve problems, highlight how analytics is applied in the airline and business environment, and compare several different modeling techniques. They decipher complex algorithms to demonstrate how they …


A Predictive Model To Predict Cyberattack Using Self-Normalizing Neural Networks, Oluwapelumi Eniodunmo Jan 2022

A Predictive Model To Predict Cyberattack Using Self-Normalizing Neural Networks, Oluwapelumi Eniodunmo

Theses, Dissertations and Capstones

Cyberattack is a never-ending war that has greatly threatened secured information systems. The development of automated and intelligent systems provides more computing power to hackers to steal information, destroy data or system resources, and has raised global security issues. Statistical and Data mining tools have received continuous research and improvements. These tools have been adopted to create sophisticated intrusion detection systems that help information systems mitigate and defend against cyberattacks. However, the advancement in technology and accessibility of information makes more identifiable elements that can be used to gain unauthorized access to systems and resources. Data mining and classification tools …


The Application Of Deep Learning And Cloud Technologies To Data Science, Ian A. Trawinski Jan 2022

The Application Of Deep Learning And Cloud Technologies To Data Science, Ian A. Trawinski

Electronic Theses and Dissertations

Machine Learning and Cloud Computing have become a staple to businesses and educational institutions over the recent years. The two forefronts of big data solutions have garnered technology giants to race for the superior implementation of both Machine Learning and Cloud Computing. The objective of this thesis is to test and utilize AWS SageMaker in three different applications: time-series forecasting with sentiment analysis, automated Machine Learning (AutoML), and finally anomaly detection. The first study covered is a sentiment-based LSTM for stock price prediction. The LSTM was created with two methods, the first being SQL Server Data Tools, and the second …