Open Access. Powered by Scholars. Published by Universities.®

Data Science Commons

Open Access. Powered by Scholars. Published by Universities.®

1,483 Full-Text Articles 2,962 Authors 435,013 Downloads 189 Institutions

All Articles in Data Science

Faceted Search

1,483 full-text articles. Page 17 of 73.

Chatgpt As Metamorphosis Designer For The Future Of Artificial Intelligence (Ai): A Conceptual Investigation, Amarjit Kumar Singh (Library Assistant), Dr. Pankaj Mathur (Deputy Librarian) 2023 Central University of South Bihar, Panchanpur, Gaya, Bihar

Chatgpt As Metamorphosis Designer For The Future Of Artificial Intelligence (Ai): A Conceptual Investigation, Amarjit Kumar Singh (Library Assistant), Dr. Pankaj Mathur (Deputy Librarian)

Library Philosophy and Practice (e-journal)

Abstract

Purpose: The purpose of this research paper is to explore ChatGPT’s potential as an innovative designer tool for the future development of artificial intelligence. Specifically, this conceptual investigation aims to analyze ChatGPT’s capabilities as a tool for designing and developing near about human intelligent systems for futuristic used and developed in the field of Artificial Intelligence (AI). Also with the helps of this paper, researchers are analyzed the strengths and weaknesses of ChatGPT as a tool, and identify possible areas for improvement in its development and implementation. This investigation focused on the various features and functions of ChatGPT that …


A Chairpersons Guide To Managing Time And Stress, Christian K. Hansen 2023 Eastern Washington University

A Chairpersons Guide To Managing Time And Stress, Christian K. Hansen

Academic Chairpersons Conference Proceedings

In this interactive workshop we discuss time and stress management specifically from the perspective of a department chairperson responsible for leading an academic department through numerous internal and external challenges. The focus will be on practical strategies for effective use of time, not only at a personal level, but also at a department wide level.


Text And Data Mining Applications For Teaching Music Bibliography, Taylor Greene, Laurie Sampsel 2023 Chapman University

Text And Data Mining Applications For Teaching Music Bibliography, Taylor Greene, Laurie Sampsel

Library Presentations, Posters, and Audiovisual Materials

Text and data mining (TDM) is a process of increasing interdisciplinary potential and one with many practical applications for music graduate students. TDM, however, remains a topic rarely introduced in the music bibliography course. Understandably, talk of artificial intelligence, algorithms, and programming languages are intimidating to music students, but thanks to software applications, knowledge about these computer science topics are not required to participate in research using TDM. This presentation explores ways to introduce digital humanities to music students through TDM.

In our presentation, we will discuss two approaches to incorporating TDM into the music bibliography course, focusing on two …


Institutional Design And Policy Responsiveness In Us States, Scott J. LaCombe 2023 Smith College

Institutional Design And Policy Responsiveness In Us States, Scott J. Lacombe

Government: Faculty Publications

There is significant disagreement on the moderating role of institutions on policy responsive- ness, yet overwhelmingly research in state politics has focused on single institutions. This project leverages a new aggregate scale of state institutions to evaluate if the collective insti- tutional context moderates the influence of public opinion on policy. I use a recently released latent scale of institutional context and find that high levels of accountability pressure strongly strengthen public opinion’s influence on policy for both economic and social policy, while the strength of a state’s checks and balance system is largely unrelated to policy responsiveness. These results …


Wearables For In-Situ Monitoring Of Cognitive States: Challenges And Opportunities, Meera RADHAKRISHNAN, Thivya KANDAPPU, Manoj GULATI, Archan MISRA 2023 Singapore Management University

Wearables For In-Situ Monitoring Of Cognitive States: Challenges And Opportunities, Meera Radhakrishnan, Thivya Kandappu, Manoj Gulati, Archan Misra

Research Collection School Of Computing and Information Systems

We propose using wrist and ear-based sensing, via multiple novel and complementary modalities, to unobtrusively infer activity-aware, complex cognitive and affective states (such as confusion, boredom, and recall failure) of individuals. While state-of-the-art wearable devices are predominantly used (a) independently, with limited coordination among multiple devices, and (b) to capture macro-level physical activity and physiological state, we seek to expand the ambit of unobtrusive wearable sensing to capture the cognitive states while performing commonplace physical activities. Such states typically manifest via fine-grained, almost unobservable, microscopic head, face, and eye movements. We identify some of these fine-grained physical markers that serve …


Predicting Micronutrient Deficiency With Publicly Available Satellite Data, Elizabeth Bondi-Kelly, Haipeng Chen, Christopher D. Golden, Nikhil Behari, Milind Tambe 2023 William & Mary

Predicting Micronutrient Deficiency With Publicly Available Satellite Data, Elizabeth Bondi-Kelly, Haipeng Chen, Christopher D. Golden, Nikhil Behari, Milind Tambe

Arts & Sciences Articles

Micronutrient deficiency (MND), which is a form of malnutrition that can have serious health consequences, is difficult to diagnose in early stages without blood draws, which are expensive and time-consuming to collect and process. It is even more difficult at a public health scale seeking to identify regions at higher risk of MND. To provide data more widely and frequently, we propose an accurate, scalable, low-cost, and interpretable regional-level MND prediction system. Specifically, our work is the first to use satellite data, such as forest cover, weather, and presence of water, to predict deficiency of micronutrients such as iron, Vitamin …


Multicollinearity Applied Stepwise Stochastic Imputation: A Large Dataset Imputation Through Correlation‑Based Regression, Benjamin D. Leiby, Darryl K. Ahner 2023 Air Force Institute of Technology

Multicollinearity Applied Stepwise Stochastic Imputation: A Large Dataset Imputation Through Correlation‑Based Regression, Benjamin D. Leiby, Darryl K. Ahner

Faculty Publications

This paper presents a stochastic imputation approach for large datasets using a correlation selection methodology when preferred commercial packages struggle to iterate due to numerical problems. A variable range-based guard rail modification is proposed that benefits the convergence rate of data elements while simultaneously providing increased confidence in the plausibility of the imputations. A large country conflict dataset motivates the search to impute missing values well over a common threshold of 20% missingness. The Multicollinearity Applied Stepwise Stochastic imputation methodology (MASS-impute) capitalizes on correlation between variables within the dataset and uses model residuals to estimate unknown values. Examination of the …


Named Entity Recognition From Biomedical Text, Maged Guirguis 2023 American University in Cairo

Named Entity Recognition From Biomedical Text, Maged Guirguis

Theses and Dissertations

As vast amounts of unstructured data are becoming available digitally, computer-based methods to extract relevant and meaningful information are needed. Named entity recognition (NER) is the task of identifying text spans that mention named entities, and to classify them into predefined categories. Despite the existence of numerous and well-versed NER methods, the bio-medical domain remains under-studied. The objective of this research is to identify an efficient technique for NER tasks from biomedical data. This is achieved by investigating using deep learning technologies namely pre-trained BERT [1] model and its variances SciBERT [2] and BioBERT [3]. Preprocessing the data before passing …


Session11: Skip-Gcn : A Framework For Hierarchical Graph Representation Learning, Jackson Cates, Justin Lewis, Randy Hoover, Kyle Caudle 2023 SDSMT

Session11: Skip-Gcn : A Framework For Hierarchical Graph Representation Learning, Jackson Cates, Justin Lewis, Randy Hoover, Kyle Caudle

SDSU Data Science Symposium

Recently there has been high demand for the representation learning of graphs. Graphs are a complex data structure that contains both topology and features. There are first several domains for graphs, such as infectious disease contact tracing and social media network communications interactions. The literature describes several methods developed that work to represent nodes in an embedding space, allowing for classical techniques to perform node classification and prediction. One such method is the graph convolutional neural network that aggregates the node neighbor’s features to create the embedding. Another method, Walklets, takes advantage of the topological information stored in a graph …


2d Respiratory Sound Analysis To Detect Lung Abnormalities, Rafia Sharmin Alice, KC Santosh 2023 University of South Dakota

2d Respiratory Sound Analysis To Detect Lung Abnormalities, Rafia Sharmin Alice, Kc Santosh

SDSU Data Science Symposium

Abstract. In this paper, we analyze deep visual features from 2D data representation(s) of the respiratory sound to detect evidence of lung abnormalities. The primary motivation behind this is that visual cues are more important in decision-making than raw data (lung sound). Early detection and prompt treatments are essential for any future possible respiratory disorders, and respiratory sound is proven to be one of the biomarkers. In contrast to state-of-the-art approaches, we aim at understanding/analyzing visual features using our Convolutional Neural Networks (CNN) tailored Deep Learning Models, where we consider all possible 2D data such as Spectrogram, Mel-frequency Cepstral Coefficients …


Temporal Tensor Factorization For Multidimensional Forecasting, Jackson Cates, Karissa Scipke, Randy Hoover, Kyle Caudle 2023 SDSMT

Temporal Tensor Factorization For Multidimensional Forecasting, Jackson Cates, Karissa Scipke, Randy Hoover, Kyle Caudle

SDSU Data Science Symposium

In the era of big data, there is a need for forecasting high-dimensional time series that might be incomplete, sparse, and/or nonstationary. The current research aims to solve this problem for two-dimensional data through a combination of temporal matrix factorization (TMF) and low-rank tensor factorization. From this method, we propose an expansion of TMF to two-dimensional data: temporal tensor factorization (TTF). The current research aims to interpolate missing values via low-rank tensor factorization, which produces a latent space of the original multilinear time series. We then can perform forecasting in the latent space. We present experimental results of the proposed …


Emotion Classification Of Indonesian Tweets Using Bidirectional Lstm, Aaron K. Glenn, Phillip M. LaCasse, Bruce A. Cox 2023 Air Force Institute of Technology

Emotion Classification Of Indonesian Tweets Using Bidirectional Lstm, Aaron K. Glenn, Phillip M. Lacasse, Bruce A. Cox

Faculty Publications

Emotion classification can be a powerful tool to derive narratives from social media data. Traditional machine learning models that perform emotion classification on Indonesian Twitter data exist but rely on closed-source features. Recurrent neural networks can meet or exceed the performance of state-of-the-art traditional machine learning techniques using exclusively open-source data and models. Specifically, these results show that recurrent neural network variants can produce more than an 8% gain in accuracy in comparison with logistic regression and SVM techniques and a 15% gain over random forest when using FastText embeddings. This research found a statistical significance in the performance of …


Social Impacts Of Robotics On The Labor And Employment Market, Kelvin Espinal 2023 The Graduate Center, City University of New York

Social Impacts Of Robotics On The Labor And Employment Market, Kelvin Espinal

Dissertations, Theses, and Capstone Projects

Robotics have been introduced into the workplace to perform tasks that human beings have traditionally fulfilled. Complementing or substituting human labor with robotics eliminates human involvement in functions attributable to hazardous environments, heavy lifting, toxic substances, and repetitive low-level tasks. On the other hand, they are meant to be more efficient and cost-effective, saving money, time, and labor. However, since the introduction of robotics in the workforce, societal opposition has been towards this branch of technology in fear of losing employment, wages, and purpose.

Previous studies have reported an overarching societal fear that adopting robotics in the workplace and industry …


Analyzing Relationships With Machine Learning, Oscar Ko 2023 The Graduate Center, City University of New York

Analyzing Relationships With Machine Learning, Oscar Ko

Dissertations, Theses, and Capstone Projects

Procedurally, this project aims to take a dataset, analyze it, and offer insights to the audience in an easy-to-digest format. Conceptually, this project will seek to explore questions like: “Do couples that meet through online dating or dating apps have higher or lower quality relationships?”, “Can any features in this dataset help predict how a subject would rate their relationship quality?”, and “What other insights can I derive from using machine learning for exploratory analysis?” The intended audience for this project is anyone interested in romantic relationships or machine learning.

The dataset is from a Stanford University survey, “How Couples …


Revealing The Three-Dimensional Magnetic Texture With Machine Learning Models, Shihua Zhao 2023 The Graduate Center, City University of New York

Revealing The Three-Dimensional Magnetic Texture With Machine Learning Models, Shihua Zhao

Dissertations, Theses, and Capstone Projects

Revealing three-dimensional (3D) magnetic textures with vector field electron tomography (VFET) is essential in studying novel magnetic materials with topologically protected spin textures potentially being used in the next-generation semiconductor industry. In this dissertation, we use machine learning (ML) models to reconstruct 3D magnetic textures from electron holography (EH) data.

We can feed the EH data, a series of two-dimensional (2D) phasemaps, into a neural network (NN) architecture directly or feed the EH data into a conventional VFET and then feed the reconstructed results into a NN. Thus, perceptive NN, either a simple convolutional neural network (CNN) or Unet architecture, …


Determining The Proportionality Of Ischemic Stroke Risk Factors To Age, Elizabeth Hunter, John D. Kelleher 2023 Technological University Dublin

Determining The Proportionality Of Ischemic Stroke Risk Factors To Age, Elizabeth Hunter, John D. Kelleher

Articles

While age is an important risk factor, there are some disadvantages to including it in a stroke risk model: age can dominate the risk score and lead to over-or under-predictions in some age groups. There is evidence to suggest that some of these disadvantages are due to the non-proportionality of other risk factors with age, eg, risk factors contribute differently to stroke risk based on an individual’s age. In this paper, we present a framework to test if risk factors are proportional with age. We then apply the framework to a set of risk factors using Framingham heart study data …


A Bidirectional Deep Lstm Machine Learning Method For Flight Delay Modelling And Analysis, Desmond B. Bisandu, Irene Moulitsas 2023 Cranfield University

A Bidirectional Deep Lstm Machine Learning Method For Flight Delay Modelling And Analysis, Desmond B. Bisandu, Irene Moulitsas

National Training Aircraft Symposium (NTAS)

Flight delays can be prevented by providing a reference point from an accurate prediction model because predicting flight delays is a problem with a specific space. Only a few algorithms consider predicted classes' mutual correlation during flight delay classification or prediction modelling tasks. None of these existing methods works for all scenarios. Therefore, the need to investigate the performance of more models in solving the problem of flight delay is vast and rapidly increasing. This paper presents the development and evaluation of LSTM and BiLSTM models by comparing them for a flight delay prediction. The LSTM does the feature extraction …


Integrated Organizational Machine Learning For Aviation Flight Data, Michael J. Pritchard, Paul Thomas, Eric Webb, Jon Martin, Austin Walden 2023 Kansas State University

Integrated Organizational Machine Learning For Aviation Flight Data, Michael J. Pritchard, Paul Thomas, Eric Webb, Jon Martin, Austin Walden

National Training Aircraft Symposium (NTAS)

An increased availability of data and computing power has allowed organizations to apply machine learning techniques to various fleet monitoring activities. Additionally, our ability to acquire aircraft data has increased due to the miniaturization of small form factor computing machines. Aircraft data collection processes contain many data features in the form of multivariate time-series (continuous, discrete, categorical, etc.) which can be used to train machine learning models. Yet, three major challenges still face many flight organizations 1) integration and automation of data collection frameworks, 2) data cleanup and preparation, and 3) embedded machine learning framework. Data cleanup and preparation has …


Data Poisoning: A New Threat To Artificial Intelligence, Nary Simms 2023 La Salle University

Data Poisoning: A New Threat To Artificial Intelligence, Nary Simms

Mathematics and Computer Science Capstones

Artificial Intelligence (AI) adoption is rapidly being deployed in a number of fields, from banking and finance to healthcare, robotics, transportation, military, e-commerce and social networks. Grand View Research estimates that the global AI market was worth 93.5 billion in 2021 and that it will increase at a compound annual growth rate (CAGR) of 38.1% from 2022 to 2030. According to a 2020 MIT Sloan Management survey, 87% of multinational corporations believe that AI technology will provide a competitive edge. Artificial Intelligence relies heavily on datasets to train its models. The more data, the better it learns and predicts. However, …


Visual Analytics And Modeling Of Materials Property Data, Diwas Bhattarai 2023 Louisiana State University and Agricultural and Mechanical College

Visual Analytics And Modeling Of Materials Property Data, Diwas Bhattarai

LSU Doctoral Dissertations

Due to significant advancements in experimental and computational techniques, materials data are abundant. To facilitate data-driven research, it calls for a system for managing and sharing data and supporting a set of tools for effective data analysis and modeling. Generally, a given material property M can be considered as a multivariate data problem. The dimensions of M are the values of the property itself, the conditions (pressure P, temperature T, and multi-component composition X) that control the concerned property, and relevant metadata I (source, date).

Here we present a comprehensive database considering both experimental and computational sources …


Digital Commons powered by bepress