Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Keyword
-
- Semantic Sensor Web (5)
- Twitter (4)
- Coordination (3)
- Semantic Perception (3)
- Biomedicine (2)
-
- Crisis Computing (2)
- Crisis Informatics (2)
- Emergency Response (2)
- IExplore (2)
- Machine Perception (2)
- Sensor Data (2)
- Social Media (2)
- Social Networks (2)
- Abstraction (1)
- Activity-Influence-Diffusion (AID) Identity (1)
- Ad-Hoc Registration (1)
- Anomaly Detection (1)
- Attribute Behavior Similarity (1)
- Attribute Usage Diversity (1)
- BKR (1)
- Beta-PDF (1)
- Binary (1)
- Binary Classification (1)
- Biomedical Data Integration (1)
- Biomedical Knowledge Discovery (1)
- Biomedical Ontologies (1)
- Chagas Disease (1)
- Cirrocumulus (1)
- Classifier Training (1)
- Cloud (1)
Articles 1 - 30 of 37
Full-Text Articles in Physical Sciences and Mathematics
Are Twitter Users Equal In Predicting Elections? A Study Of User Groups In Predicting 2012 U.S. Republican Primaries, Lu Chen, Wenbo Wang, Amit P. Sheth
Are Twitter Users Equal In Predicting Elections? A Study Of User Groups In Predicting 2012 U.S. Republican Primaries, Lu Chen, Wenbo Wang, Amit P. Sheth
Kno.e.sis Publications
Existing studies on predicting election results are under the assumption that all the users should be treated equally. However, recent work [14] shows that social media users from different groups (e.g., “silent majority” vs. “vocal minority”) have significant differences in the generated content and tweeting behavior. The effect of these differences on predicting election results has not been exploited yet. In this paper, we study the spectrum of Twitter users who participate in the on-line discussion of 2012 U.S. Republican Presidential Primaries, and examine the predictive power of different user groups (e.g., highly engaged users vs. lowly engaged users, right-leaning …
The Ssn Ontology Of The W3c Semantic Sensor Network Incubator Group, Michael Compton, Payam Barnaghi, Luis Bermudez, Raul Garcia-Castro, Oscar Corcho, Simon Cox, John Graybeal, Manfred Hauswirth, Cory Andrew Henson, Arthur Herzog, Vincent Huang, Krzysztof Janowicz, W. David Kelsey, Danh Le Phuoc, Laurent Lefort, Myriam Leggieri, Holger Neuhaus, Andriy Nikolov, Kevin Page, Alexandre Passant, Amit P. Sheth, Kerry Taylor
The Ssn Ontology Of The W3c Semantic Sensor Network Incubator Group, Michael Compton, Payam Barnaghi, Luis Bermudez, Raul Garcia-Castro, Oscar Corcho, Simon Cox, John Graybeal, Manfred Hauswirth, Cory Andrew Henson, Arthur Herzog, Vincent Huang, Krzysztof Janowicz, W. David Kelsey, Danh Le Phuoc, Laurent Lefort, Myriam Leggieri, Holger Neuhaus, Andriy Nikolov, Kevin Page, Alexandre Passant, Amit P. Sheth, Kerry Taylor
Kno.e.sis Publications
The W3C Semantic Sensor Network Incubator group (the SSN-XG) produced an OWL 2 ontology to describe sensors and observations — the SSN ontology, available at http://purl.oclc.org/NET/ssnx/ssn. The SSN ontology can describe sensors in terms of capabilities, measurement processes, observations and deployments. This article describes the SSN ontology. It further gives an example and describes the use of the ontology in recent research projects.
User Taglines: Alternative Presentations Of Expertise And Interest In Social Media, Hemant Purohit, Alex Dow, Omar Alonso, Lei Duan, Kevin Haas
User Taglines: Alternative Presentations Of Expertise And Interest In Social Media, Hemant Purohit, Alex Dow, Omar Alonso, Lei Duan, Kevin Haas
Kno.e.sis Publications
Web applications are increasingly showing recommended users from social media along with some descriptions, an attempt to show relevancy - why they are being shown. For example, Twitter search for a topical keyword shows expert twitterers on the side for 'whom to follow'. Google+ and Facebook also recommend users to follow or add to friend circle. Popular Internet newspaper- The Huffington Post shows Twitter influencers/ experts on the side of an article for authoritative relevant tweets. The state of the art shows user profile bios as summary for Twitter experts, but it has issues with length constraint imposed by user …
Demonstration: Dynamic Sensor Registration And Semantic Processing For Ad-Hoc Mobile Environments (Semmob), Pramod Anantharam, Gary Alan Smith, Josh Pschorr, Krishnaprasad Thirunarayan, Amit P. Sheth
Demonstration: Dynamic Sensor Registration And Semantic Processing For Ad-Hoc Mobile Environments (Semmob), Pramod Anantharam, Gary Alan Smith, Josh Pschorr, Krishnaprasad Thirunarayan, Amit P. Sheth
Kno.e.sis Publications
SemMOB enables dynamic registration of sensors via mobile devices, search, and near real-time inference over sensor observations in ad-hoc mobile environments (e.g., fire fighting). We demonstrate SemMOB in the context of an emergency response use case that requires automatic and dynamic registrations of sensor devices and annotation of sensor observations, decoding of latitude-longitude information in terms of human sensible names, fusion and abstraction of sensor values using background knowledge, and their visualization using mash-up.
Using Social Influence To Predict Subscriber Churn, Derek Doran, Veena Mendiratta, Chitra Phadke, Dan Kushnir, Huseyin Uzunalioglu
Using Social Influence To Predict Subscriber Churn, Derek Doran, Veena Mendiratta, Chitra Phadke, Dan Kushnir, Huseyin Uzunalioglu
Kno.e.sis Publications
The saturation of mobile phone markets has resulted in rising costs for operators to obtain new customers. These operators thus focus their energies on identifying users that will churn so they can be targeted for retention campaigns. Typical churn prediction algorithms identify churners based on service usage metrics, network performance indicators, and demographic information. Social and peer-influence to churn, however, is usually not considered. In this paper, we describe a new churn prediction algorithm that incorporates the influence churners spread to their social peers. Using data from a major service provider, we show that social influence improves churn prediction and …
Understanding User Triads On Facebook, Derek Doran, Alberta De La Rosa Algarin, Swapna S. Gokhale
Understanding User Triads On Facebook, Derek Doran, Alberta De La Rosa Algarin, Swapna S. Gokhale
Kno.e.sis Publications
Contemporary approaches that analyze user behavior on online social networks only consider interactions among dyads, which are pairs of directly connected users. A large body of sociological work, however, suggests that mutual connections among users can influence their activities, leading to differences between two- and three-way interactions. This paper explores the dynamics of triads among Facebook users based on the wall posts from the New Orleans regional network. Initially, each connection is categorized as a close friendship or an acquiantance, contingent on the number of wall posts exchanged. Subsequently, the impact of different types of connections comprising triads is examined …
How I Would Like Semantic Web To Be, For My Children., Raghava Mutharaju
How I Would Like Semantic Web To Be, For My Children., Raghava Mutharaju
Kno.e.sis Publications
Semantic Web, since its inception, has gone through lot of developments in its relatively nascent existence; right from people's perception, to the standards and to its adoption by the industry and more importantly by the scientific community. This impressive growth only seems to increase. In this paper, we project this growth to the next 10 years and highlight some of the facets on which Semantic Web could have a major impact on. We also present the challenges that Semantic Web and its community has to deal with in order to get there.
Iexplore: Interactive Browsing And Exploring Biomedical Knowledge, Vinh Nguyen, Olivier Bodenreider, Jagannathan Srinivasan, Todd Minning, Thomas Rindflesch, Bastien Rance, Ramakanth Kavuluru, Himi Yalamanchili, Krishnaprasad Thirunarayan, Satya S. Sahoo, Amit P. Sheth
Iexplore: Interactive Browsing And Exploring Biomedical Knowledge, Vinh Nguyen, Olivier Bodenreider, Jagannathan Srinivasan, Todd Minning, Thomas Rindflesch, Bastien Rance, Ramakanth Kavuluru, Himi Yalamanchili, Krishnaprasad Thirunarayan, Satya S. Sahoo, Amit P. Sheth
Kno.e.sis Publications
We present iExplore, a Semantic Web based application that helps biomedical researchers study and explore biomedical knowledge interactively. iExplore uses the Biomedical Knowledge Repository (BKR), which integrates knowledge from various sources ranging from information extracted from biomedical literature (from PubMed) to many structured vocabularies in the Unified Medical Language System (UMLS). The current version of BKR provides a unified provenance representation for 12 million semantic predications (triples with a predicate connecting a subject and an object) derived from 87 vocabulary families in the UMLS and 14 million predications extracted from 21 million PubMed abstracts. To engage the domain experts in …
An Efficient Bit Vector Approach To Semantics-Based Machine Perception In Resource-Constrained Devices, Cory Andrew Henson, Krishnaprasad Thirunarayan, Amit P. Sheth
An Efficient Bit Vector Approach To Semantics-Based Machine Perception In Resource-Constrained Devices, Cory Andrew Henson, Krishnaprasad Thirunarayan, Amit P. Sheth
Kno.e.sis Publications
The primary challenge of machine perception is to define efficient computational methods to derive high-level knowledge from low-level sensor observation data. Emerging solutions are using ontologies for expressive representation of concepts in the domain of sensing and perception, which enable advanced integration and interpretation of heterogeneous sensor data. The computational complexity of OWL, however, seriously limits its applicability and use within resource-constrained environments, such as mobile devices. To overcome this issue, we employ OWL to formally define the inference tasks needed for machine perception – explanation and discrimination – and then provide efficient algorithms for these tasks, using bit-vector encodings …
Privacy Preserving Boosting In The Cloud With Secure Half-Space Queries, Shumin Guo, Keke Chen
Privacy Preserving Boosting In The Cloud With Secure Half-Space Queries, Shumin Guo, Keke Chen
Kno.e.sis Publications
This paper presents a preliminary study on the PerturBoost approach that aims to provide efficient and secure classifier learning in the cloud with both data and model privacy preserved.
Computing Perception From Sensor Data, Payam Barnaghi, Frieder Ganz, Cory Andrew Henson, Amit P. Sheth
Computing Perception From Sensor Data, Payam Barnaghi, Frieder Ganz, Cory Andrew Henson, Amit P. Sheth
Kno.e.sis Publications
This paper describes a framework for perception creation from sensor data. We propose using data abstraction techniques, in particular Symbolic Aggregate Approximation (SAX), to analyse and create patterns from sensor data. The created patterns are then linked to semantic descriptions that define thematic, spatial and temporal features, providing highly granular abstract representation of the raw sensor data. This helps to reduce the size of the data that needs to be communicated from the sensor nodes to the gateways or highlevel processing components. We then discuss a method that uses abstract patterns created by SAX method and occurrences of different observations …
Using Attribute Behavior Diversity To Build Accurate Decision Tree Committees For Microarray Data, Qian Han, Guozhu Dong
Using Attribute Behavior Diversity To Build Accurate Decision Tree Committees For Microarray Data, Qian Han, Guozhu Dong
Kno.e.sis Publications
DNA microarrays (gene chips), frequently used in biological and medical studies, measure the expressions of thousands of genes per sample. Using microarray data to build accurate classifiers for diseases is an important task. This paper introduces an algorithm, called Committee of Decision Trees by Attribute Behavior Diversity (CABD), to build highly accurate ensembles of decision trees for such data. Since a committee's accuracy is greatly influenced by the diversity among its member classifiers, CABD uses two new ideas to "optimize" that diversity, namely (1) the concept of attribute behavior–based similarity between attributes, and (2) …
Twitris+: Social Media Analytics Platform For Effective Coordination, Gary Alan Smith, Amit P. Sheth, Ashutosh Sopan Jadhav, Hemant Purohit, Lu Chen, Michael Cooney, Pavan Kapanipathi, Pramod Anantharam, Pramod Koneru, Wenbo Wang
Twitris+: Social Media Analytics Platform For Effective Coordination, Gary Alan Smith, Amit P. Sheth, Ashutosh Sopan Jadhav, Hemant Purohit, Lu Chen, Michael Cooney, Pavan Kapanipathi, Pramod Anantharam, Pramod Koneru, Wenbo Wang
Kno.e.sis Publications
Twitris+ is a Semantic Social Media analytics platform to provide technologies for analyzing large-scale social media streams across Spatio-Temporal-Thematic (STT) and People-Content-Network (PCN) dimensions. It provides holistic situational awareness from one interface and enables organizational actors to engage in well-coordinated ways for desired tasks during emergency response.
The Latent Maximum Entropy Principle, Shaojun Wang, Dale Schuurmans, Yunxin Zhao
The Latent Maximum Entropy Principle, Shaojun Wang, Dale Schuurmans, Yunxin Zhao
Kno.e.sis Publications
We present an extension to Jaynes’ maximum entropy principle that incorporates latent variables. The principle of latent maximum entropy we propose is different from both Jaynes’ maximum entropy principle and maximum likelihood estimation, but can yield better estimates in the presence of hidden variables and limited training data. We first show that solving for a latent maximum entropy model poses a hard nonlinear constrained optimization problem in general. However, we then show that feasible solutions to this problem can be obtained efficiently for the special case of log-linear models---which forms the basis for an efficient approximation to the latent maximum …
Exploring The Impact Of Knowledge And Social Environment On Influenza Prevention And Transmission In Midwestern United States High School Students, William L. Romine, Tanvi Banerjee, William S. Barrow, William R. Folk
Exploring The Impact Of Knowledge And Social Environment On Influenza Prevention And Transmission In Midwestern United States High School Students, William L. Romine, Tanvi Banerjee, William S. Barrow, William R. Folk
Kno.e.sis Publications
We used data from a convenience sample of 410 Midwestern United States students from six secondary schools to develop parsimonious models for explaining and predicting precautions and illness related to influenza. Scores for knowledge and perceptions were obtained using two-parameter Item Response Theory (IRT) models. Relationships between outcome variables and predictors were verified using Pearson and Spearman correlations, and nested [student within school] fixed effects multinomial logistic regression models were specified from these using Akaike’s Information Criterion (AIC). Neural network models were then formulated as classifiers using 10-fold cross validation to predict precautions and illness. Perceived barriers against taking precautions …
What Kind Of #Communication Is Twitter? A Psycholinguistic Perspective On Communication In Twitter For The Purpose Of Emergency Coordination, Hemant Purohit, Andrew Hampton, Valerie L. Shalin, Amit P. Sheth, John Flach
What Kind Of #Communication Is Twitter? A Psycholinguistic Perspective On Communication In Twitter For The Purpose Of Emergency Coordination, Hemant Purohit, Andrew Hampton, Valerie L. Shalin, Amit P. Sheth, John Flach
Kno.e.sis Publications
The present research aims to detect coordinated citizen response within social media traffic to assist emergency response. We use domain-independent linguistic properties as the first step in narrowing the candidate set of messages for domain-dependent and computationally intensive analysis.
Iexplore: A Provenance-Based Application For Exploring Biomedical Knowledge, Vinh Nguyen, Olivier Bodenreider, Thomas Rindflesch, Amit P. Sheth
Iexplore: A Provenance-Based Application For Exploring Biomedical Knowledge, Vinh Nguyen, Olivier Bodenreider, Thomas Rindflesch, Amit P. Sheth
Kno.e.sis Publications
No abstract provided.
W3c Semantic Sensor Networks: Ontologies, Applications, And Future Directions, Cory Andrew Henson
W3c Semantic Sensor Networks: Ontologies, Applications, And Future Directions, Cory Andrew Henson
Kno.e.sis Publications
Plenary Talk discussing the W3C Semantic Sensor Network, including the ontology, applications, and future directions.
A Web-Based Study Of Self-Treatment Of Opioid Withdrawal Symptoms With Loperamide, Raminta Daniulaityte, Robert G. Carlson, Russel S. Falck, Delroy H. Cameron, Sujan Udayanaga, Lu Chen, Amit P. Sheth
A Web-Based Study Of Self-Treatment Of Opioid Withdrawal Symptoms With Loperamide, Raminta Daniulaityte, Robert G. Carlson, Russel S. Falck, Delroy H. Cameron, Sujan Udayanaga, Lu Chen, Amit P. Sheth
Kno.e.sis Publications
Aims: Many websites provide a medium for individuals to freely share their experiences and knowledge about different drugs. Such user-generated content can be used as a rich data source to study emerging drug use practices and trends. The study aims to examine web-based reports of loperamide use practices among non-medical opioid users. Loperamide, a piperidine derivative, is an opioid agonist approved for the control of diarrhea symptoms. Because of its general inability to cross the blood-brain barrier, it is considered to have no abuse potential and is available without a prescription. Methods: A website that allows free discussion of illicit …
Prediction Of Topic Volume On Twitter, Yiye Ruan, Hemant Purohit, David Fuhry, Srinivasan Parthasarathy, Amit P. Sheth
Prediction Of Topic Volume On Twitter, Yiye Ruan, Hemant Purohit, David Fuhry, Srinivasan Parthasarathy, Amit P. Sheth
Kno.e.sis Publications
We discuss an approach for predicting microscopic (individual) and macroscopic (collective) user behavioral patterns with respect to specific trending topics on Twitter. Going beyond previous efforts that have analyzed driving factors in whether and when a user will publish topic-relevant tweets, here we seek to predict the strength of content generation which allows more accurate understanding of Twitter users' behavior and more effective utilization of the online social network for diffusing information. Unlike traditional approaches, we consider multiple dimensions into one regression-based prediction framework covering network structure, user interaction, content characteristics and past activity. Experimental results on three large Twitter …
Topical Anomaly Detection From Twitter Streams, Pramod Anantharam, Krishnaprasad Thirunarayan, Amit P. Sheth
Topical Anomaly Detection From Twitter Streams, Pramod Anantharam, Krishnaprasad Thirunarayan, Amit P. Sheth
Kno.e.sis Publications
No abstract provided.
Semantic Aspects Of Earthcube, Pascal Hitzler, Krzysztof Janowicz, Gary Berg-Cross, Leo Obrst, Amit P. Sheth, Timothy Finin, Isabel F. Cruz
Semantic Aspects Of Earthcube, Pascal Hitzler, Krzysztof Janowicz, Gary Berg-Cross, Leo Obrst, Amit P. Sheth, Timothy Finin, Isabel F. Cruz
Kno.e.sis Publications
In this document, we give a high-level overview of selected Semantic (Web) technologies, methods, and other important considerations, that are relevant for the success of EarthCube. The goal of this initial document is to provide entry points and references for discussions between the Semantic Technologies experts and the domain experts within EarthCube. The selected topics are intended to ground the EarthCube roadmap in the state of the art in semantics research and ontology engineering.
We anticipate that this document will evolve as EarthCube progresses. Indeed, all EarthCube parties are asked to provide topics of importance that should be treated in …
Trust Networks, Krishnaprasad Thirunarayan, Pramod Anantharam, Cory Andrew Henson, Amit P. Sheth
Trust Networks, Krishnaprasad Thirunarayan, Pramod Anantharam, Cory Andrew Henson, Amit P. Sheth
Kno.e.sis Publications
Trust relationships occur naturally in many diverse contexts such as collaborative systems, e-commerce, interpersonal interactions, social networks, semantic sensor web, etc. As collaborating agents providing content and services become increasingly removed from the agents that consume them, the issue of robust trust inference and update become critical. There is a need to find online substitutes for traditional (direct or face-to-face) cues to derive measures of trust, and create efficient and secure system for managing trust, to support decision-making. Unfortunately, there is neither a universal notion of trust that is applicable to all domains nor a clear explication of its semantics …
Localized Deconvolution: Characterizing Nmr-Based Metabolomics Spectroscopic Data Using Localized High-Throughput Deconvolution, Paul E. Anderson, Ajith H. Ranabahu, Deirdre A. Mahle, Nicholas V. Reo, Michael L. Raymer, Amit P. Sheth, Nicholas J. Delraso
Localized Deconvolution: Characterizing Nmr-Based Metabolomics Spectroscopic Data Using Localized High-Throughput Deconvolution, Paul E. Anderson, Ajith H. Ranabahu, Deirdre A. Mahle, Nicholas V. Reo, Michael L. Raymer, Amit P. Sheth, Nicholas J. Delraso
Kno.e.sis Publications
The interpretation of nuclear magnetic resonance (NMR) experimental results for metabolomics studies requires intensive signal processing and multivariate data analysis techniques. Standard quantification techniques attempt to minimize effects from variations in peak positions caused by sample pH, ionic strength, and composition. These techniques fail to account for adjacent signals which can lead to drastic quantification errors. Attempts at full spectrum deconvolution have been limited in adoption and development due to the computational resources required. Herein, we develop a novel localized deconvolution algorithm for general purpose quantification of NMR-based metabolomics studies. Localized deconvolution decreases average absolute quantification error by 97% and …
Framework For The Analysis Of Coordination In Crisis Response, Hemant Purohit, Andrew Hampton, Valerie L. Shalin, Amit P. Sheth, John M. Flach
Framework For The Analysis Of Coordination In Crisis Response, Hemant Purohit, Andrew Hampton, Valerie L. Shalin, Amit P. Sheth, John M. Flach
Kno.e.sis Publications
Social Media play a critical role during crisis events, revealing a natural coordination dynamic. We propose a computational framework guided by social science principles to measure, analyze, and understand coordination among the different types of organizations and actors in crisis response. The analysis informs both the scientific account of cooperative behavior and the design of applications and protocols to support crisis management.
Discovering Fine-Grained Sentiment In Suicide Notes, Wenbo Wang, Lu Chen, Ming Tan, Shaojun Wang, Amit P. Sheth
Discovering Fine-Grained Sentiment In Suicide Notes, Wenbo Wang, Lu Chen, Ming Tan, Shaojun Wang, Amit P. Sheth
Kno.e.sis Publications
This paper presents our solution for the i2b2 sentiment classification challenge. Our hybrid system consists of machine learning and rule-based classifiers. For the machine learning classifier, we investigate a variety of lexical, syntactic and knowledge-based features, and show how much these features contribute to the performance of the classifier through experiments. For the rule-based classifier, we propose an algorithm to automatically extract effective syntactic and lexical patterns from training examples. The experimental results show that the rule-based classifier outperforms the baseline machine learning classifier using unigram features. By combining the machine learning classifier and the rule-based classifier, the hybrid system …
A Semantic Problem Solving Environment For Integrative Parasite Research: Identification Of Intervention Targets For Trypanosoma Cruzi, Priti Parikh, Todd Minning, Vinh Nguyen, Sarasi Lalithsena, Amir H. Asiaee, Satya S. Sahoo, Prashant Doshi, Rick L. Tarleton, Amit P. Sheth
A Semantic Problem Solving Environment For Integrative Parasite Research: Identification Of Intervention Targets For Trypanosoma Cruzi, Priti Parikh, Todd Minning, Vinh Nguyen, Sarasi Lalithsena, Amir H. Asiaee, Satya S. Sahoo, Prashant Doshi, Rick L. Tarleton, Amit P. Sheth
Kno.e.sis Publications
Background: Research on the biology of parasites requires a sophisticated and integrated computational platform to query and analyze large volumes of data, representing both unpublished (internal) and public (external) data sources. Effective analysis of an integrated data resource using knowledge discovery tools would significantly aid biologists in conducting their research, for example, through identifying various intervention targets in parasites, and in deciding the future direction of ongoing as well as planned projects. A key challenge in achieving this objective is the heterogeneity between the internal lab data, usually stored as flat files, Excel spreadsheets or custom-built databases, and the external …
Role Of Semantic Web In Health Informatics, Satya S. Sahoo, Guo-Qiang Zhang, Amit P. Sheth
Role Of Semantic Web In Health Informatics, Satya S. Sahoo, Guo-Qiang Zhang, Amit P. Sheth
Kno.e.sis Publications
This tutorial weaves together three themes and the associated topics: [1] The role of biomedical ontologies [2] Key Semantic Web technologies with focus on Semantic provenance and integration [3] In-practice tools and real world use cases built to serve the needs of sleep medicine researchers, cardiologists involved in clinical practice, and work on vaccine development for human pathogens.
Resident Identification Using Kinect Depth Image Data And Fuzzy Clustering Techniques, Tanvi Banerjee, James M. Keller, Marjorie Skubic
Resident Identification Using Kinect Depth Image Data And Fuzzy Clustering Techniques, Tanvi Banerjee, James M. Keller, Marjorie Skubic
Kno.e.sis Publications
As a part of our passive fall risk assessment research in home environments, we present a method to identify older residents using features extracted from their gait information from a single depth camera. Depth images have been collected continuously for about eight months from several apartments at a senior housing facility. Shape descriptors such as bounding box information and image moments were extracted from silhouettes of the depth images. The features were then clustered using Possibilistic C Means for resident identification. This technology will allow researchers and health professionals to gather more information on the individual residents by filtering out …
A Scalable Distributed Syntactic, Semantic And Lexical Language Model, Ming Tan, Wenli Zhou, Lei Zheng, Shaojun Wang
A Scalable Distributed Syntactic, Semantic And Lexical Language Model, Ming Tan, Wenli Zhou, Lei Zheng, Shaojun Wang
Kno.e.sis Publications
This paper presents an attempt at building a large scale distributed composite language model that is formed by seamlessly integrating an n-gram model, a structured language model, and probabilistic latent semantic analysis under a directed Markov random field paradigm to simultaneously account for local word lexical information, mid-range sentence syntactic structure, and long-span document semantic content. The composite language model has been trained by performing a convergent N-best list approximate EM algorithm and a follow-up EM algorithm to improve word prediction power on corpora with up to a billion tokens and stored on a supercomputer. The large scale distributed composite …