Open Access. Powered by Scholars. Published by Universities.®

Library and Information Science Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 8 of 8

Full-Text Articles in Library and Information Science

Leveraging One-Class Svm And Semantic Analysis To Detect Anomalous Content, Ozgur Yilmazel, Svetlana Symonenko, Niranjan Balasubramanian, Elizabeth D. Liddy Jan 2005

Leveraging One-Class Svm And Semantic Analysis To Detect Anomalous Content, Ozgur Yilmazel, Svetlana Symonenko, Niranjan Balasubramanian, Elizabeth D. Liddy

School of Information Studies - Faculty Scholarship

Experiments were conducted to test several hypotheses on methods for improving document classification for the malicious insider threat problem within the Intelligence Community. Bag-of-words (BOW) representations of documents were compared to Natural Language Processing (NLP) based representations in both the typical and one-class classification problems using the Support Vector Machine algorithm. Results show that the NLP features significantly improved classifier performance over the BOW approach both in terms of precision and recall, while using many fewer features. The one-class algorithm using NLP features demonstrated robustness when tested on new domains.


Improved Document Representation For Classification Tasks For The Intelligence Community, Elizabeth D. Liddy, Ozgur Yilmazel, Svetlana Symonenko, Niranjan Balasubramanian Jan 2005

Improved Document Representation For Classification Tasks For The Intelligence Community, Elizabeth D. Liddy, Ozgur Yilmazel, Svetlana Symonenko, Niranjan Balasubramanian

School of Information Studies - Faculty Scholarship

This research addresses the question of whether the AI technologies of Natural Language Processing (NLP) and Machine Learning (ML) can be used to improve security within the Intelligence Community (IC).


Hands-On Nlp For An Interdisciplinary Audience, Elizabeth D. Liddy, Nancy Mccracken Jan 2005

Hands-On Nlp For An Interdisciplinary Audience, Elizabeth D. Liddy, Nancy Mccracken

School of Information Studies - Faculty Scholarship

The need for a single NLP offering for a diverse mix of graduate students (including computer scientists, information scientists, and linguists) has motivated us to develop a course that provides students with a breadth of understanding of the scope of real world applications, as well as depth of knowledge of the computational techniques on which to build in later experiences. We describe the three hands-on tasks for the course that have proven successful, namely: 1) in-class group simulations of computational processes; 2) team posters and public presentations on state-of-the-art commercial NLP applications, and; 3) team projects implementing various levels of …


Document Retrieval, Automatic, Elizabeth D. Liddy Jan 2005

Document Retrieval, Automatic, Elizabeth D. Liddy

School of Information Studies - Faculty Scholarship

Document Retrieval is the computerized process of producing a relevance ranked list of documents in response to an inquirer’s request by comparing their request to an automatically produced index of the documents in the system. Everyone uses such systems today in the form of web-based search engines. While evolving from a fairly small discipline in the 1940s, to a large, profitable industry today, the field has maintained a healthy research focus, supported by test collections and large-scale annual comparative tests of systems. A document retrieval system is comprised of three core modules: document processor, query analyzer, and matching function. There …


Effective Work Practices For Floss Development: A Model And Propositions, Kevin Crowston, Hala Annabi, James Howison, Chengetai Masango Jan 2005

Effective Work Practices For Floss Development: A Model And Propositions, Kevin Crowston, Hala Annabi, James Howison, Chengetai Masango

School of Information Studies - Faculty Scholarship

We review the literature on Free/Libre Open Source Software (FLOSS) development and on software development, distributed work and teams more generally to develop a theoretical model to explain the performance of FLOSS teams. The proposed model is based on Hackman’s [1] model of effectiveness of work teams, with coordination theory [2] and collective mind [3] to extend Hackman’s model by elaborating team practices relevant to effectiveness in software development. We propose a set of propositions to guide further research.


Coordination Of Free/Libre Open Source Software Development, Kevin Crowston, Kangning Wei, Qing Li, U Yeliz Eseryel, James Howison Jan 2005

Coordination Of Free/Libre Open Source Software Development, Kevin Crowston, Kangning Wei, Qing Li, U Yeliz Eseryel, James Howison

School of Information Studies - Faculty Scholarship

The apparent success of free/libre open source software (FLOSS) development projects such as Linux, Apache, and many others has raised the question, what lessons from FLOSS development can be transferred to mainstream software development? In this paper, we use coordination theory to analyze coordination mechanisms in FLOSS development and compare our analysis with existing literature on coordination in proprietary software development. We examined developer interaction data from three active and successful FLOSS projects and used content analysis to identify the coordination mechanisms used by the participants. We found that there were similarities between the FLOSS groups and the reported practices …


The Implications Of Property Rights In Virtual World Business Models, Ian Macinnes Jan 2005

The Implications Of Property Rights In Virtual World Business Models, Ian Macinnes

School of Information Studies - Faculty Scholarship

The financial success of online communities based on multiplayer game environments has been a bright spot among the many failures in electronic commerce initiatives. While this form of business has existed for less than a decade, it is growing rapidly and has become a mainstream form of entertainment in some areas of the world, such as Korea. Game environments are becoming more immersive and compelling and if this rate of improvement continues, such as through growing broadband penetration, they are likely to become as common as other forms of entertainment. This paper analyzes the issues facing developers of game communities …


Genres Of Digital Documents: Introduction To The Special Issue., Barbara H. Kwasnik, Kevin Crowston Jan 2005

Genres Of Digital Documents: Introduction To The Special Issue., Barbara H. Kwasnik, Kevin Crowston

School of Information Studies - Faculty Scholarship

Purpose – To introduce the special issue on “Genres of digital documents.” While there are many definitions of genre, most include consideration of the intended communicative purpose, form and sometimes expected content of a document. Most also include the notion of social acceptance, that a document is of a particular genre to the extent that it is recognized as such within a given discourse community. Design/methodology/approach – The article reviews the notion of document genre and its applicability to studies of digital documents and introduces the four articles in the special issue. Findings – Genre can be studied based on …