Open Access. Powered by Scholars. Published by Universities.®

Databases and Information Systems Commons

Open Access. Powered by Scholars. Published by Universities.®

6,572 Full-Text Articles 8,902 Authors 3,346,977 Downloads 211 Institutions

All Articles in Databases and Information Systems

Faceted Search

6,572 full-text articles. Page 106 of 248.

Detecting Personal Intake Of Medicine From Twitter, Debanjan MAHATA, Jasper FRIEDRICHS, Rajiv Ratn SHAH, Jing JIANG 2018 Bloomberg

Detecting Personal Intake Of Medicine From Twitter, Debanjan Mahata, Jasper Friedrichs, Rajiv Ratn Shah, Jing Jiang

Research Collection School Of Computing and Information Systems

Mining social media messages such as tweets, blogs, and Facebook posts for health and drug related information has received significant interest in pharmacovigilance research. Social media sites (e.g., Twitter), have been used for monitoring drug abuse, adverse reactions to drug usage, and analyzing expression of sentiments related to drugs. Most of these studies are based on aggregated results from a large population rather than specific sets of individuals. In order to conduct studies at an individual level or specific groups of people, identifying posts mentioning intake of medicine by the user is necessary. Toward this objective we develop a classifier …


Searching For The X-Factor: Exploring Corpus Subjectivity For Word Embeddings, Maksim TKACHENKO, Chong Cher CHIA, Hady W. LAUW 2018 Singapore Management University

Searching For The X-Factor: Exploring Corpus Subjectivity For Word Embeddings, Maksim Tkachenko, Chong Cher Chia, Hady W. Lauw

Research Collection School Of Computing and Information Systems

We explore the notion of subjectivity, and hypothesize that word embeddings learnt from input corpora of varying levels of subjectivity behave differently on natural language processing tasks such as classifying a sentence by sentiment, subjectivity, or topic. Through systematic comparative analyses, we establish this to be the case indeed. Moreover, based on the discovery of the outsized role that sentiment words play on subjectivity-sensitive tasks such as sentiment classification, we develop a novel word embedding SentiVec which is infused with sentiment information from a lexical resource, and is shown to outperform baselines on such tasks.


Disease Gene Classification With Metagraph Representations, Sezin KIRCALI ATA, Yuan FANG, Min WU, Xiao-Li LI, Xiaokui XIAO 2018 Nanyang Technological University

Disease Gene Classification With Metagraph Representations, Sezin Kircali Ata, Yuan Fang, Min Wu, Xiao-Li Li, Xiaokui Xiao

Research Collection School Of Computing and Information Systems

This chapter is based on exploiting the network-based representations of proteins, metagraphs, in protein-protein interaction network to identify candidate disease-causing proteins. Protein-protein interaction (PPI) networks are effective tools in studying the functional roles of proteins in the development of various diseases. However, they are insufficient without the support of additional biological knowledge for proteins such as their molecular functions and biological processes. To enhance PPI networks, we utilize biological properties of individual proteins as well. More specifically, we integrate keywords from UniProt database describing protein properties into the PPI network and construct a novel heterogeneous PPI-Keyword (PPIK) network consisting …


Rumor Detection On Twitter With Tree-Structured Recursive Neural Networks, Jing MA, Wei GAO, Kam-Fai WONG 2018 Singapore Management University

Rumor Detection On Twitter With Tree-Structured Recursive Neural Networks, Jing Ma, Wei Gao, Kam-Fai Wong

Research Collection School Of Computing and Information Systems

Sentiment expression in microblog posts can be affected by user’s personal character, opinion bias, political stance and so on. Most of existing personalized microblog sentiment classification methods suffer from the insufficiency of discriminative tweets for personalization learning. We observed that microblog users have consistent individuality and opinion bias in different languages. Based on this observation, in this paper we propose a novel user-attention-based Convolutional Neural Network (CNN) model with adversarial cross-lingual learning framework. The user attention mechanism is leveraged in CNN model to capture user’s language-specific individuality from the posts. Then the attention-based CNN model is incorporated into a novel …


A Driver Guidance System For Taxis In Singapore, Shashi Shekhar JHA, Shih-Fen CHENG, Meghna LOWALEKAR, Nicholas WONG, Rishikeshan RAJENDRAM, Pradeep VARAKANTHAM, Nghia Troung TROUNG, Firmansyah BIN ABD RAHMAN 2018 Singapore Management University

A Driver Guidance System For Taxis In Singapore, Shashi Shekhar Jha, Shih-Fen Cheng, Meghna Lowalekar, Nicholas Wong, Rishikeshan Rajendram, Pradeep Varakantham, Nghia Troung Troung, Firmansyah Bin Abd Rahman

Research Collection School Of Computing and Information Systems

Traditional taxi fleet operators world-over have been facing intense competitions from various ride-hailing services such as Uber and Grab.Based on our studies on the taxi industry in Singapore, we see that the emergence of Uber and Grab in the ride-hailing market has greatly impacted the taxi industry: the average daily taxi ridership for the past two years has been falling continuously, by close to 20% in total. In this work, we discuss how efficient real-time data analytics and large-scale multiagent optimization technology could help taxi drivers compete against more technologically advanced service platforms. Our system has been in field trial …


Taxis Strike Back: A Field Trial Of The Driver Guidance System, Shih-Fen CHENG, Shashi Shekhar JHA, Rishikeshan RAJENDRAM 2018 Singapore Management University

Taxis Strike Back: A Field Trial Of The Driver Guidance System, Shih-Fen Cheng, Shashi Shekhar Jha, Rishikeshan Rajendram

Research Collection School Of Computing and Information Systems

Traditional taxi fleet operators world-over have been facing intense competitions from various ride-hailing services such as Uber and Grab (specific to the Southeast Asia region). Based on our studies on the taxi industry in Singapore, we see that the emergence of Uber and Grab in the ride-hailing market has greatly impacted the taxi industry: the average daily taxi ridership for the past two years has been falling continuously, by close to 20% in total. In this work, we discuss how efficient real-time data analytics and large-scale multi-agent optimization technology could potentially help taxi drivers compete against more technologically advanced service …


Analysis Of Public Transportation Patterns In A Densely Populated City With Station-Based Shared Bikes, Di WANG, Evan WU, Ah-hwee TAN 2018 Singapore Management University

Analysis Of Public Transportation Patterns In A Densely Populated City With Station-Based Shared Bikes, Di Wang, Evan Wu, Ah-Hwee Tan

Research Collection School Of Computing and Information Systems

Densely populated cities face great challenges of high transportation demand and limited physical space. Thus, in these cities, the public transportation system is heavily relied on. Conventional public transportation modes such as bus, taxi and subway have been globally deployed over the past century. In the last decade, a new type of public transportation mode, shared bike, emerged in many cities. These shared bikes are deployed by either government-regulated or profit-driven companies and are either station-based or station-less. Nonetheless, all of them are designed to better solve the last-mile problem in densely populated cities as complements to the conventional public …


Pacela: A Neural Framework For User Visitation In Location-Based Social Networks, Thanh Nam DOAN, Ee-peng LIM 2018 Singapore Management University

Pacela: A Neural Framework For User Visitation In Location-Based Social Networks, Thanh Nam Doan, Ee-Peng Lim

Research Collection School Of Computing and Information Systems

Check-in prediction using location-based social network data is an important research problem for both academia and industry since an accurate check-in predictive model is useful to many applications, e.g. urban planning, venue recommendation, route suggestion, and context-aware advertising. Intuitively, when considering venues to visit, users may rely on their past observed visit histories as well as some latent attributes associated with the venues. In this paper, we therefore propose a check-in prediction model based on a neural framework called Preference and Context Embeddings with Latent Attributes (PACELA). PACELA learns the embeddings space for the user and venue data as well …


Probabilistic Guided Exploration For Reinforcement Learning In Self-Organizing Neural Networks, Peng WANG, Weigui Jair ZHOU, Di WANG, Ah-hwee TAN 2018 Singapore Management University

Probabilistic Guided Exploration For Reinforcement Learning In Self-Organizing Neural Networks, Peng Wang, Weigui Jair Zhou, Di Wang, Ah-Hwee Tan

Research Collection School Of Computing and Information Systems

Exploration is essential in reinforcement learning, which expands the search space of potential solutions to a given problem for performance evaluations. Specifically, carefully designed exploration strategy may help the agent learn faster by taking the advantage of what it has learned previously. However, many reinforcement learning mechanisms still adopt simple exploration strategies, which select actions in a pure random manner among all the feasible actions. In this paper, we propose novel mechanisms to improve the existing knowledgebased exploration strategy based on a probabilistic guided approach to select actions. We conduct extensive experiments in a Minefield navigation simulator and the results …


Autonomous Agents In Snake Game Via Deep Reinforcement Learning, Zhepei WEI, Di WANG, Ming ZHANG, Ah-hwee TAN, Chunyan MIAO, You ZHOU 2018 Singapore Management University

Autonomous Agents In Snake Game Via Deep Reinforcement Learning, Zhepei Wei, Di Wang, Ming Zhang, Ah-Hwee Tan, Chunyan Miao, You Zhou

Research Collection School Of Computing and Information Systems

Since DeepMind pioneered a deep reinforcement learning (DRL) model to play the Atari games, DRL has become a commonly adopted method to enable the agents to learn complex control policies in various video games. However, similar approaches may still need to be improved when applied to more challenging scenarios, where reward signals are sparse and delayed. In this paper, we develop a refined DRL model to enable our autonomous agent to play the classical Snake Game, whose constraint gets stricter as the game progresses. Specifically, we employ a convolutional neural network (CNN) trained with a variant of Q-learning. Moreover, we …


Modeling Contemporaneous Basket Sequences With Twin Networks For Next-Item Recommendation, Duc Trong LE, Hady W. LAUW, Yuan FANG 2018 Singapore Management University

Modeling Contemporaneous Basket Sequences With Twin Networks For Next-Item Recommendation, Duc Trong Le, Hady W. Lauw, Yuan Fang

Research Collection School Of Computing and Information Systems

Our interactions with an application frequently leave a heterogeneous and contemporaneous trail of actions and adoptions (e.g., clicks, bookmarks, purchases). Given a sequence of a particular type (e.g., purchases)-- referred to as the target sequence, we seek to predict the next item expected to appear beyond this sequence. This task is known as next-item recommendation. We hypothesize two means for improvement. First, within each time step, a user may interact with multiple items (a basket), with potential latent associations among them. Second, predicting the next item in the target sequence may be helped by also learning from another supporting sequence …


Distributed K-Nearest Neighbor Queries In Metric Spaces, Xin DING, Yuanliang ZHANG, Lu CHEN, Yunjun GAO, Baihua ZHENG 2018 Zhejiang University

Distributed K-Nearest Neighbor Queries In Metric Spaces, Xin Ding, Yuanliang Zhang, Lu Chen, Yunjun Gao, Baihua Zheng

Research Collection School Of Computing and Information Systems

Metric k nearest neighbor (MkNN) queries have applications in many areas such as multimedia retrieval, computational biology, and location-based services. With the growing volumes of data, a distributed method is required. In this paper, we propose an Asynchronous Metric Distributed System (AMDS), which uniformly partitions the data with the pivot-mapping technique to ensure the load balancing, and employs publish/subscribe communication model to asynchronously process large scale of queries. The employment of asynchronous processing model also improves robustness and efficiency of AMDS. In addition, we develop an efficient estimation based MkNN method using AMDS to improve the query efficiency. Extensive experiments …


Adopt: Combining Parameter Tuning And Adaptive Operator Ordering For Solving A Class Of Orienteering Problems, Aldy GUNAWAN, Hoong Chuin LAU, Kun LU 2018 Singapore Management University

Adopt: Combining Parameter Tuning And Adaptive Operator Ordering For Solving A Class Of Orienteering Problems, Aldy Gunawan, Hoong Chuin Lau, Kun Lu

Research Collection School Of Computing and Information Systems

Two fundamental challenges in local search based metaheuristics are how to determine parameter configurations and design the underlying Local Search (LS) procedure. In this paper, we propose a framework in order to handle both challenges, called ADaptive OPeraTor Ordering (ADOPT). In this paper, The ADOPT framework is applied to two metaheuristics, namely Iterated Local Search (ILS) and a hybridization of Simulated Annealing and ILS (SAILS) for solving two variants of the Orienteering Problem: the Team Dependent Orienteering Problem (TDOP) and the Team Orienteering Problem with Time Windows (TOPTW). This framework consists of two main processes. The Design of Experiment (DOE) …


A Bayesian Latent Variable Model Of User Preferences With Item Context, Aghiles SALAH, Hady W. LAUW 2018 Singapore Management University

A Bayesian Latent Variable Model Of User Preferences With Item Context, Aghiles Salah, Hady W. Lauw

Research Collection School Of Computing and Information Systems

Personalized recommendation has proven to be very promising in modeling the preference of users over items. However, most existing work in this context focuses primarily on modeling user-item interactions, which tend to be very sparse. We propose to further leverage the item-item relationships that may reflect various aspects of items that guide users’ choices. Intuitively, items that occur within the same “context” (e.g., browsed in the same session, purchased in the same basket) are likely related in some latent aspect. Therefore, accounting for the item’s context would complement the sparse user-item interactions by extending a user’s preference to other items …


Striving To Earn More: A Survey Of Work Strategies And Tool Use Among Crowd Workers, Toni KAPLAN, Susumu SAITO, Kotaro HARA, Jeffrey P. BIGHAM 2018 Carnegie Mellon University

Striving To Earn More: A Survey Of Work Strategies And Tool Use Among Crowd Workers, Toni Kaplan, Susumu Saito, Kotaro Hara, Jeffrey P. Bigham

Research Collection School Of Computing and Information Systems

Earning money is a primary motivation for workers on Amazon Mechanical Turk, but earning a good wage is difficult because work that pays well is not easily identified and can be time-consuming to find. We explored the strategies that both low- and high-earning workers use to find and complete tasks via a survey of 360 workers. Nearly all workers surveyed had earning money as their primary goal, and workers used many of the same tools (browser extensions and scripts) and strategies in an attempt to earn more money, regardless of earning level. However, high-earning workers used more tools, were more …


Mining Temporal Activity Patterns On Social Media, Nikan Chavoshi 2018 University of New Mexico

Mining Temporal Activity Patterns On Social Media, Nikan Chavoshi

Computer Science ETDs

Social media provide communication networks for their users to easily create and share content. Automated accounts, called bots, abuse these platforms by engaging in suspicious and/or illegal activities. Bots push spam content and participate in sponsored activities to expand their audience. The prevalence of bot accounts in social media can harm the usability of these platforms, and decrease the level of trustworthiness in them. The main goal of this dissertation is to show that temporal analysis facilitates detecting bots in social media. I introduce new bot detection techniques which exploit temporal information. Since automated accounts are controlled by computer programs, …


Face Detection Using Deep Learning: An Improved Faster Rcnn Approach, Xudong SUN, Pengcheng WU, Steven C. H. HOI 2018 DeepIR Inc

Face Detection Using Deep Learning: An Improved Faster Rcnn Approach, Xudong Sun, Pengcheng Wu, Steven C. H. Hoi

Research Collection School Of Computing and Information Systems

In this paper, we present a new face detection scheme using deep learning and achieve the state-of-the-art detection performance on the well-known FDDB face detection benchmark evaluation. In particular, we improve the state-of-the-art Faster RCNN framework by combining a number of strategies, including feature concatenation, hard negative mining, multi-scale training, model pre-training, and proper calibration of key parameters. As a consequence, the proposed scheme obtained the state-of-the-art face detection performance and was ranked as one of the best models in terms of ROC curves of the published methods on the FDDB benchmark


Deeptravel: A Neural Network Based Travel Time Estimation Model With Auxiliary Supervision, Hanyuan ZHANG, Hao WU, Weiwei SUN, Baihua ZHENG 2018 Fudan University

Deeptravel: A Neural Network Based Travel Time Estimation Model With Auxiliary Supervision, Hanyuan Zhang, Hao Wu, Weiwei Sun, Baihua Zheng

Research Collection School Of Computing and Information Systems

Estimating the travel time of a path is of great importance to smart urban mobility. Existing approaches are either based on estimating the time cost of each road segment or designed heuristically in a non-learning-based way. The former is not able to capture many cross-segment complex factors while the latter fails to utilize the existing abundant temporal labels of the data, i.e., the time stamp of each trajectory point. In this paper, we leverage on new development of deep neural networks and propose a novel auxiliary supervision model, namely DeepTravel, that can automatically and effectively extract different features, as well …


Online Active Learning With Expert Advice, Shuji HAO, Peiying HU, Peilin ZHAO, Steven C. H. HOI, Chunyan MIAO 2018 Institute of High Performance of Computing

Online Active Learning With Expert Advice, Shuji Hao, Peiying Hu, Peilin Zhao, Steven C. H. Hoi, Chunyan Miao

Research Collection School Of Computing and Information Systems

In literature, learning with expert advice methods usually assume that a learner always obtain the true label of every incoming training instance at the end of each trial. However, in many real-world applications, acquiring the true labels of all instances can be both costly and time consuming, especially for large-scale problems. For example, in the social media, data stream usually comes in a high speed and volume, and it is nearly impossible and highly costly to label all of the instances. In this article, we address this problem with active learning with expert advice, where the ground truth of an …


Kbase: The United States Department Of Energy Systems Biology Knowledgebase, Adam P. Arkin, Robert W. Cottingham, Christopher S. Henry, Nomi L. Harris, Rick L. Stevens, Sergei Maslov, Doreen Ware, Fernando Perez, Shane Canon, Michael W. Sneddon, Matthew L. Henderson, William J. Riehl, Dan Murphy-Olson, Stephen Y. Chan, Roy T. Kamimura, Sunita Kumari, Meghan M. Drake, Thomas S. Brettin, Elizabeth M. Glass, Dylan Chivian, Dan Gunter, David J. Weston, Benjamin H. Allen, Jason Baumohl, Nathan L. Tintle 2018 University of California - Berkeley

Kbase: The United States Department Of Energy Systems Biology Knowledgebase, Adam P. Arkin, Robert W. Cottingham, Christopher S. Henry, Nomi L. Harris, Rick L. Stevens, Sergei Maslov, Doreen Ware, Fernando Perez, Shane Canon, Michael W. Sneddon, Matthew L. Henderson, William J. Riehl, Dan Murphy-Olson, Stephen Y. Chan, Roy T. Kamimura, Sunita Kumari, Meghan M. Drake, Thomas S. Brettin, Elizabeth M. Glass, Dylan Chivian, Dan Gunter, David J. Weston, Benjamin H. Allen, Jason Baumohl, Nathan L. Tintle

Faculty Work Comprehensive List

No abstract provided.


Digital Commons powered by bepress