Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

40,540 Full-Text Articles 45,079 Authors 14,916,241 Downloads 328 Institutions

All Articles in Computer Sciences

Faceted Search

40,540 full-text articles. Page 1 of 1306.

Exposing And Fixing Causes Of Inconsistency And Nondeterminism In Clustering Implementations, Xin Yin 2021 New Jersey Institute of Technology

Exposing And Fixing Causes Of Inconsistency And Nondeterminism In Clustering Implementations, Xin Yin

Dissertations

Cluster analysis aka Clustering is used in myriad applications, including high-stakes domains, by millions of users. Clustering users should be able to assume that clustering implementations are correct, reliable, and for a given algorithm, interchangeable. Based on observations in a wide-range of real-world clustering implementations, this dissertation challenges the aforementioned assumptions.This dissertation introduces an approach named SmokeOut that uses differential clustering to show that clustering implementations suffer from nondeterminism and inconsistency: on a given input dataset and using a given clustering algorithm, clustering outcomes and accuracy vary widely between (1) successive runs of the same toolkit, i.e., nondeterminism ...


Enterprise Environment Modeling For Penetration Testing On The Openstack Virtualization Platform, Vincent Karovič Jr., Jakub Bartaloš, Vincent Karovič, Michal Greguš 2021 Comenius University

Enterprise Environment Modeling For Penetration Testing On The Openstack Virtualization Platform, Vincent Karovič Jr., Jakub Bartaloš, Vincent Karovič, Michal Greguš

Journal of Global Business Insights

The article presents the design of a model environment for penetration testing of an organization using virtualization. The need for this model was based on the constantly increasing requirements for the security of information systems, both in legal terms and in accordance with international security standards. The model was created based on a specific team from the unnamed company. The virtual working environment offered the same functions as the physical environment. The virtual working environment was created in OpenStack and tested with a Linux distribution Kali Linux. We demonstrated that the virtual environment is functional and its security testable. Virtualizing ...


Tradao: A Visual Analytics System For Trading Algorithm Optimization, Ka Wing TSANG, Haotian LI, Fuk Ming LAM, Yifan MU, Yong WANG, Huamin QU 2021 Singapore Management University

Tradao: A Visual Analytics System For Trading Algorithm Optimization, Ka Wing Tsang, Haotian Li, Fuk Ming Lam, Yifan Mu, Yong Wang, Huamin Qu

Research Collection School Of Computing and Information Systems

With the wide applications of algorithmic trading, it has become critical for traders to build a winning trading algorithm to beat the market. However, due to the lack of efficient tools, traders mainly rely on their memory to manually compare the algorithm instances of a trading algorithm and further select the best trading algorithm instance for the real trading deployment. We work closely with industry practitioners to discover and consolidate user requirements and develop an interactive visual analytics system for trading algorithm optimization. Structured expert interviews are conducted to evaluateTradAOand a representative case study is documented for illustrating the system ...


Qlens: Visual Analytics Of Multi-Step Problem-Solving Behaviors For Improving Question Design, Meng XIA, Reshika P. VELUMANI, Yong WANG, Huamin QU, Xiaojuan MA 2021 Singapore Management University

Qlens: Visual Analytics Of Multi-Step Problem-Solving Behaviors For Improving Question Design, Meng Xia, Reshika P. Velumani, Yong Wang, Huamin Qu, Xiaojuan Ma

Research Collection School Of Computing and Information Systems

With the rapid development of online education in recent years, there has been an increasing number of learning platforms that provide students with multi-step questions to cultivate their problem-solving skills. To guarantee the high quality of such learning materials, question designers need to inspect how students’ problem-solving processes unfold step by step to infer whether students’ problem-solving logic matches their design intent. They also need to compare the behaviors of different groups (e.g., students from different grades) to distribute questions to students with the right level of knowledge. The availability of fine-grained interaction data, such as mouse movement trajectories ...


Taxthemis: Interactive Mining And Exploration Of Suspicious Tax Evasion Group, Yating LIN, Kamkwai WONG, Yong WANG, Rong ZHANG, Bo DONG, Huamin QU, Qinghua ZHENG 2021 Singapore Management University

Taxthemis: Interactive Mining And Exploration Of Suspicious Tax Evasion Group, Yating Lin, Kamkwai Wong, Yong Wang, Rong Zhang, Bo Dong, Huamin Qu, Qinghua Zheng

Research Collection School Of Computing and Information Systems

Tax evasion is a serious economic problem for many countries, as it can undermine the government’s tax system and lead to an unfair business competition environment. Recent research has applied data analytics techniques to analyze and detect tax evasion behaviors of individual taxpayers. However, they have failed to support the analysis and exploration of the related party transaction tax evasion (RPTTE) behaviors (e.g., transfer pricing), where a group of taxpayers is involved. In this paper, we present TaxThemis, an interactive visual analytics system to help tax officers mine and explore suspicious tax evasion groups through analyzing heterogeneous tax-related ...


Visual Analysis Of Discrimination In Machine Learning, Qianwen WANG, Zhenghua XU, Zhutian CHEN, Yong WANG, Yong WANG, Huamin Qu 2021 Singapore Management University

Visual Analysis Of Discrimination In Machine Learning, Qianwen Wang, Zhenghua Xu, Zhutian Chen, Yong Wang, Yong Wang, Huamin Qu

Research Collection School Of Computing and Information Systems

The growing use of automated decision-making in critical applications, such as crime prediction and college admission, has raised questions about fairness in machine learning. How can we decide whether different treatments are reasonable or discriminatory? In this paper, we investigate discrimination in machine learning from a visual analytics perspective and propose an interactive visualization tool, DiscriLens, to support a more comprehensive analysis. To reveal detailed information on algorithmic discrimination, DiscriLens identifies a collection of potentially discriminatory itemsets based on causal modeling and classification rules mining. By combining an extended Euler diagram with a matrix-based visualization, we develop a novel set ...


Splash: Learnable Activation Functions For Improving Accuracy And Adversarial Robustness, Mohammadamin Tavakoli, Forest Agostinelli, Pierre Baldi 2021 University of California, Irvine

Splash: Learnable Activation Functions For Improving Accuracy And Adversarial Robustness, Mohammadamin Tavakoli, Forest Agostinelli, Pierre Baldi

Publications

We introduce SPLASH units, a class of learnable activation functions shown to simultaneously improve the accuracy of deep neural networks while also improving their robustness to adversarial attacks. SPLASH units have both a simple parameterization and maintain the ability to approximate a wide range of non-linear functions. SPLASH units are: (1) continuous; (2) grounded (f(0)=0"); (3) use symmetric hinges; and (4) their hinges are placed at fixed locations which are derived from the data (i.e. no learning required). Compared to nine other learned and fixed activation functions, including ReLU and its variants, SPLASH units show superior performance ...


A Bert-Based Two-Stage Model For Chinese Chengyu Recommendation, Minghuan TAN, Jing JIANG, Bingtian DAI 2021 Singapore Management University

A Bert-Based Two-Stage Model For Chinese Chengyu Recommendation, Minghuan Tan, Jing Jiang, Bingtian Dai

Research Collection School Of Computing and Information Systems

In Chinese, Chengyu are fixed phrases consisting of four characters. As a type of idioms, their meanings usually cannot be derived from their component characters. In this paper, we study the task of recommending a Chengyu given a textual context. Observing some of the limitations with existing work, we propose a two-stage model, where during the first stage we re-train a Chinese BERT model by masking out Chengyu from a large Chinese corpus with a wide coverage of Chengyu. During the second stage, we fine-tune the retrained, Chengyu-oriented BERT on a specific Chengyu recommendation dataset. We evaluate this method on ...


Hierarchical Mapping For Crosslingual Word Embedding Alignment, Ion Madrazo Azpiazu, Maria Soledad Pera 2021 Boise State University

Hierarchical Mapping For Crosslingual Word Embedding Alignment, Ion Madrazo Azpiazu, Maria Soledad Pera

Computer Science Faculty Publications and Presentations

The alignment of word embedding spaces in different languages into a common crosslingual space has recently been in vogue. Strategies that do so compute pairwise alignments and then map multiple languages to a single pivot language (most often English). These strategies, however, are biased towards the choice of the pivot language, given that language proximity and the linguistic characteristics of the target language can strongly impact the resultant crosslingual space in detriment of topologically distant languages. We present a strategy that eliminates the need for a pivot language by learning the mappings across languages in a hierarchicalway. Experiments demonstrate that ...


Blockchain For Automotive: An Insight Towards The Ipfs Blockchain-Based Auto Insurance Sector, Nishara Nizamuddin, Ahed Abugabah 2021 Zayed University

Blockchain For Automotive: An Insight Towards The Ipfs Blockchain-Based Auto Insurance Sector, Nishara Nizamuddin, Ahed Abugabah

All Works

The advancing technology and industrial revolution have taken the automotive industry by storm in recent times. The auto sector’s constantly growing demand has paved the way for the automobile sector to embrace new technologies and disruptive innovations. The multi-trillion dollar, complex auto insurance sector is still stuck in the regulations of the past. Most of the customers still contact the insurance company by phone to buy new policies and process existing insurance claims. The customers still face the risk of fraudulent online brokers, as policies are mostly signed and processed on papers which often require human supervision, with a ...


Learn Biologically Meaningful Representation With Transfer Learning, Di He 2021 City University of New York (CUNY)

Learn Biologically Meaningful Representation With Transfer Learning, Di He

Dissertations, Theses, and Capstone Projects

Machine learning has made significant contributions to bioinformatics and computational biol­ogy. In particular, supervised learning approaches have been widely used in solving problems such as bio­marker identification, drug response prediction, and so on. However, because of the limited availability of comprehensively labeled and clean data, constructing predictive models in super­ vised settings is not always desirable or possible, especially when using data­hunger, red­hot learning paradigms such as deep learning methods. Hence, there are urgent needs to develop new approaches that could leverage more readily available unlabeled data in driving successful machine learning ap­ plications in this ...


An Empirical Study Of Refactorings And Technical Debt In Machine Learning Systems, Yiming Tang, Raffi T. Khatchadourian, Mehdi Bagherzadeh, Rhia Singh, Ajani Stewart, Anita Raja 2021 CUNY Graduate Center

An Empirical Study Of Refactorings And Technical Debt In Machine Learning Systems, Yiming Tang, Raffi T. Khatchadourian, Mehdi Bagherzadeh, Rhia Singh, Ajani Stewart, Anita Raja

Publications and Research

Machine Learning (ML), including Deep Learning (DL), systems, i.e., those with ML capabilities, are pervasive in today’s data-driven society. Such systems are complex; they are comprised of ML models and many subsystems that support learning processes. As with other complex systems, ML systems are prone to classic technical debt issues, especially when such systems are long-lived, but they also exhibit debt specific to these systems. Unfortunately, there is a gap of knowledge in how ML systems actually evolve and are maintained. In this paper, we fill this gap by studying refactorings, i.e., source-to-source semantics-preserving program transformations, performed ...


Using An Integrative Machine Learning Approach To Study Microrna Regulation Networks In Pancreatic Cancer Progression, Roland Madadjim 2021 University of Nebraska-Lincoln

Using An Integrative Machine Learning Approach To Study Microrna Regulation Networks In Pancreatic Cancer Progression, Roland Madadjim

Computer Science and Engineering: Theses, Dissertations, and Student Research

With advances in genomic discovery tools, recent biomedical research has produced a massive amount of genomic data on post-transcriptional regulations related to various transcript factors, microRNAs, lncRNAs, epigenetic modifications, and genetic variations. In this direction, the field of gene regulation network inference is created and aims to understand the interactome regulations between these molecules (e.g., gene-gene, miRNA-gene) that take place to build models able to capture behavioral changes in biological systems. A question of interest arises in integrating such molecules to build a network while treating each specie in its uniqueness. Given the dynamic changes of interactome in chaotic ...


High-Order Flexible Multirate Integrators For Multiphysics Applications, Rujeko Chinomona 2021 Southern Methodist University

High-Order Flexible Multirate Integrators For Multiphysics Applications, Rujeko Chinomona

Mathematics Theses and Dissertations

Traditionally, time integration methods within multiphysics simulations have been chosen to cater to the most restrictive dynamics, sometimes at a great computational cost. Multirate integrators accurately and efficiently solve systems of ordinary differential equations that exhibit different time scales using two or more time steps. In this thesis, we explore three classes of time integrators that can be classified as one-step multi-stage multirate methods for which the slow dynamics are evolved using a traditional one step scheme and the fast dynamics are solved through a sequence of modified initial value problems. Practically, the fast dynamics are subcycled using a small ...


Musical Gesture Through The Human Computer Interface: An Investigation Using Information Theory, Michael Vincent Blandino 2021 Louisiana State University and Agricultural and Mechanical College

Musical Gesture Through The Human Computer Interface: An Investigation Using Information Theory, Michael Vincent Blandino

LSU Doctoral Dissertations

This study applies information theory to investigate human ability to communicate using continuous control sensors with a particular focus on informing the design of digital musical instruments. There is an active practice of building and evaluating such instruments, for instance, in the New Interfaces for Musical Expression (NIME) conference community. The fidelity of the instruments can depend on the included sensors, and although much anecdotal evidence and craft experience informs the use of these sensors, relatively little is known about the ability of humans to control them accurately. This dissertation addresses this issue and related concerns, including continuous control performance ...


Automated Analysis Of Rfps Using Natural Language Processing (Nlp) For The Technology Domain, Sterling Beason, William Hinton, Yousri A. Salamah, Jordan Salsman 2021 Southern Methodist University

Automated Analysis Of Rfps Using Natural Language Processing (Nlp) For The Technology Domain, Sterling Beason, William Hinton, Yousri A. Salamah, Jordan Salsman

SMU Data Science Review

Much progress has been made in text analysis, specifically within the statistical domain of Term Frequency (TF) and Inverse Document Frequency (IDF). However, there is much room for improvement especially within the area of discovering Emerging Trends. Emerging Trend Detection Systems (ETDS) depend on ingesting a collection of textual data and TF/IDF to identify new or up-trending topics within the Corpus. However, the tremendous rate of change and the amount of digital information presents a challenge that makes it almost impossible for a human expert to spot emerging trends without relying on an automated ETD system. Since the U ...


The Social Market Economy As A Formula For Peace, Prosperity, And Sustainability, Almuth D. Merkel 2021 Kennesaw State University

The Social Market Economy As A Formula For Peace, Prosperity, And Sustainability, Almuth D. Merkel

Doctor of International Conflict Management Dissertations

The social market economy was developed in Germany during the interwar period amidst political and economic turmoil. With clear demarcation lines differentiating it from socialism and laissez-faire capitalism, the social market economy became a formula for peace and prosperity for post WWII Germany. Since then, the success of the social market economy has inspired many other countries to adopt its principles. Drawing on evidence from economic history and the history of economic thought, this thesis first reviews the evolution of the fundamental principles that form the foundation of social-market economic thought. Blending the micro-economic utility maximization framework with traditional growth ...


Airbnb Price Prediction With Sentiment Classification, Peilu Liu 2021 San Jose State University

Airbnb Price Prediction With Sentiment Classification, Peilu Liu

Master's Projects

Airbnb is an online platform that provides arrangements for short-term local home renting services. It is a challenging task for the house owner to price a rental home and attract customers. Customers also need to evaluate the price of the rental property based on the listing details. This paper demonstrates several existing Airbnb price prediction models using machine learning and external data to improve the prediction accuracy. It also discusses machine learning and neural network models that are commonly used for price prediction. The goal of this paper is to build a price prediction model using machine learning and sentiment ...


2vt: Visions, Technologies, And Visions Of Technologies For Understanding Human Scale Spaces, Ville Paanen, Piia Markkanen, Jonas Oppenlaender, Haider Akmal, Lik Hang Lee, Ava Fatah Gen Schieck, John Dunham, Konstantinos Papangelis, Nicolas Lalone, Niels Van Berkel, Jorge Goncalves, Simo Hosio 2021 University of Oulu

2vt: Visions, Technologies, And Visions Of Technologies For Understanding Human Scale Spaces, Ville Paanen, Piia Markkanen, Jonas Oppenlaender, Haider Akmal, Lik Hang Lee, Ava Fatah Gen Schieck, John Dunham, Konstantinos Papangelis, Nicolas Lalone, Niels Van Berkel, Jorge Goncalves, Simo Hosio

Presentations and other scholarship

Spatial experience is an important subject in various fields, and in HCI it has been mostly investigated in the urban scale. Research on human scale spaces has focused mostly on the personal meaning or aesthetic and embodied experiences in the space. Further, spatial experience is increasingly topical in envisioning how to build and interact with technologies in our everyday lived environments, particularly in so-called smart cities. This workshop brings researchers and practitioners from diverse fields to collaboratively discover new ways to understand and capture human scale spatial experience and envision its implications to future technological and creative developments in our ...


Analysis Of Students’ Multi-Representation Ability In Augmented Reality-Assisted Learning, Sri Jumini, Edy Cahyono, Muhamad Miftakhul Falah 2021 Universitas Sains Al-Qur'an of Central Java in Wonosobo

Analysis Of Students’ Multi-Representation Ability In Augmented Reality-Assisted Learning, Sri Jumini, Edy Cahyono, Muhamad Miftakhul Falah

Library Philosophy and Practice (e-journal)

Not all learning sources can directly and cheaply be presented, so augmented reality media is needed to be applied to students with various talents and intelligence. This study aims to analyze students’ multi-representation ability through the use of augmented reality media. The research method was carried out through pre-experiment with one group posttest only design. Test question items were given to see the students’ multi-representation ability. Data analysis was carried out through the percentage of the number of students achieving test scores of more than or equal to 80 on a scale of 100. The results showed that 88% (28 ...


Digital Commons powered by bepress