Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Databases and Information Systems (60)
- Numerical Analysis and Scientific Computing (26)
- Social and Behavioral Sciences (19)
- Systems Architecture (19)
- Software Engineering (18)
-
- Mathematics (17)
- Logic and Foundations (15)
- Engineering (14)
- Business (12)
- Computer Engineering (11)
- Theory and Algorithms (11)
- Communication (9)
- Medicine and Health Sciences (7)
- Statistics and Probability (7)
- Artificial Intelligence and Robotics (6)
- Data Science (6)
- Education (6)
- Social Media (6)
- Data Storage Systems (5)
- Information Security (5)
- Life Sciences (5)
- Management Information Systems (5)
- Other Computer Sciences (4)
- Bioinformatics (3)
- Communication Technology and New Media (3)
- Educational Assessment, Evaluation, and Research (3)
- Electrical and Computer Engineering (3)
- Health Information Technology (3)
- Institution
-
- Singapore Management University (54)
- Portland State University (26)
- Zayed University (9)
- Technological University Dublin (6)
- University of Nebraska - Lincoln (6)
-
- Edith Cowan University (4)
- Kennesaw State University (4)
- Western Kentucky University (4)
- Air Force Institute of Technology (3)
- Central Washington University (3)
- Claremont Colleges (3)
- Old Dominion University (3)
- San Jose State University (3)
- University of Nebraska at Omaha (3)
- Chapman University (2)
- MBZUAI (2)
- Syracuse University (2)
- University of Texas Rio Grande Valley (2)
- Utah State University (2)
- City University of New York (CUNY) (1)
- College of Saint Benedict and Saint John's University (1)
- Dartmouth College (1)
- Embry-Riddle Aeronautical University (1)
- Georgia State University (1)
- Kutztown University (1)
- Marquette University (1)
- Munster Technological University (1)
- Smith College (1)
- The University of Southern Mississippi (1)
- University of Nevada, Las Vegas (1)
- Publication Year
- Publication
-
- Research Collection School Of Computing and Information Systems (54)
- Systems Science Faculty Publications and Presentations (23)
- All Works (9)
- Computer Science Faculty Publications and Presentations (6)
- Faculty Publications (5)
-
- Faculty and Research Publications (4)
- All Faculty Scholarship for the College of the Sciences (3)
- CGU Faculty Publications and Research (3)
- Computer Science Faculty Publications (3)
- Conference papers (3)
- Masters Theses & Specialist Projects (3)
- CSE Conference and Workshop Papers (2)
- Dissertations (2)
- Electrical Engineering and Computer Science - All Scholarship (2)
- Faculty Publications, Computer Science (2)
- Instructional Technology and Learning Sciences Faculty Publications (2)
- School of Computing: Faculty Publications (2)
- All College Thesis Program, 2016-2019 (1)
- Australian Information Security Management Conference (1)
- Business Faculty Articles and Research (1)
- Center for Coastal and Ocean Mapping (1)
- Computer Science Summer Fellows (1)
- Computer Science and Information Technology Faculty (1)
- Computer Science: Faculty Publications (1)
- Computer Vision Faculty Publications (1)
- Dartmouth Scholarship (1)
- Department of Computer Science Publications (1)
- Department of Computer Science and Engineering: Dissertations, Theses, and Student Research (1)
- Doctoral (1)
- EBCS Articles (1)
Articles 151 - 157 of 157
Full-Text Articles in Computer Sciences
Using Reconstructability Analysis To Select Input Variables For Artificial Neural Networks, Stephen Shervais, Martin Zwick
Using Reconstructability Analysis To Select Input Variables For Artificial Neural Networks, Stephen Shervais, Martin Zwick
Systems Science Faculty Publications and Presentations
We demonstrate the use of Reconstructability Analysis to reduce the number of input variables for a neural network. Using the heart disease dataset we reduce the number of independent variables from 13 to two, while providing results that are statistically indistinguishable from those of NNs using the full variable set. We also demonstrate that rule lookup tables obtained directly from the data for the RA models are almost as effective as NNs trained on model variables.
Genescene: Biomedical Text And Data Mining, Gondy Leroy, Hsinchun Chen, Jesse D. Martinez, Shauna Eggers, Ryan R. Falsey, Kerri L. Kislin, Zan Huang, Jiexun Li, Jie Xu, Daniel M. Mcdonald, Gavin Ng
Genescene: Biomedical Text And Data Mining, Gondy Leroy, Hsinchun Chen, Jesse D. Martinez, Shauna Eggers, Ryan R. Falsey, Kerri L. Kislin, Zan Huang, Jiexun Li, Jie Xu, Daniel M. Mcdonald, Gavin Ng
CGU Faculty Publications and Research
To access the content of digital texts efficiently, it is necessary to provide more sophisticated access than keyword based searching. GeneScene provides biomedical researchers with research findings and background relations automatically extracted from text and experimental data. These provide a more detailed overview of the information available. The extracted relations were evaluated by qualified researchers and are precise. A qualitative ongoing evaluation of the current online interface indicates that this method to search the literature is more useful and efficient than keyword based searching.
A Pseudo Nearest-Neighbor Approach For Missing Data Recovery On Gaussian Random Data Sets, Xiaolu Huang, Qiuming Zhu
A Pseudo Nearest-Neighbor Approach For Missing Data Recovery On Gaussian Random Data Sets, Xiaolu Huang, Qiuming Zhu
Computer Science Faculty Publications
Missing data handling is an important preparation step for most data discrimination or mining tasks. Inappropriate treatment of missing data may cause large errors or false results. In this paper, we study the effect of a missing data recovery method, namely the pseudo- nearest neighbor substitution approach, on Gaussian distributed data sets that represent typical cases in data discrimination and data mining applications. The error rate of the proposed recovery method is evaluated by comparing the clustering results of the recovered data sets to the clustering results obtained on the originally complete data sets. The results are also compared with …
An Iterative Initial-Points Refinement Algorithm For Categorical Data Clustering, Ying Sun, Qiuming Zhu, Zhengxin Chen
An Iterative Initial-Points Refinement Algorithm For Categorical Data Clustering, Ying Sun, Qiuming Zhu, Zhengxin Chen
Computer Science Faculty Publications
The original k-means clustering algorithm is designed to work primarily on numeric data sets. This prohibits the algorithm from being directly applied to categorical data clustering in many data mining applications. The k-modes algorithm [Z. Huang, Clustering large data sets with mixed numeric and categorical value, in: Proceedings of the First Pacific Asia Knowledge Discovery and Data Mining Conference. World Scientific, Singapore, 1997, pp. 21–34] extended the k-means paradigm to cluster categorical data by using a frequency-based method to update the cluster modes versus the k-means fashion of minimizing a numerically valued cost. However, as is …
Studying The Functional Genomics Of Stress Responses In Loblolly Pine With The Expresso Microarray Experiment Management System, Lenwood S. Heath, Naren Ramakrishnan, Ronald R. Sederoff, Ross W. Whetten, Boris I. Chevone, Craig Struble, Vincent Y. Jouenne, Dawei Chen, Leonel Van Zyl, Ruth Grene
Studying The Functional Genomics Of Stress Responses In Loblolly Pine With The Expresso Microarray Experiment Management System, Lenwood S. Heath, Naren Ramakrishnan, Ronald R. Sederoff, Ross W. Whetten, Boris I. Chevone, Craig Struble, Vincent Y. Jouenne, Dawei Chen, Leonel Van Zyl, Ruth Grene
Mathematics, Statistics and Computer Science Faculty Research and Publications
Conception, design, and implementation of cDNA microarray experiments present a variety of bioinformatics challenges for biologists and computational scientists. The multiple stages of data acquisition and analysis have motivated the design of Expresso, a system for microarray experiment management. Salient aspects of Expresso include support for clone replication and randomized placement; automatic gridding, extraction of expression data from each spot, and quality monitoring; flexible methods of combining data from individual spots into information about clones and functional categories; and the use of inductive logic programming for higher-level data analysis and mining. The development of Expresso is occurring in parallel with …
Predictive Self-Organizing Networks For Text Categorization, Ah-Hwee Tan
Predictive Self-Organizing Networks For Text Categorization, Ah-Hwee Tan
Research Collection School Of Computing and Information Systems
This paper introduces a class of predictive self-organizing neural networks known as Adaptive Resonance Associative Map (ARAM) for classification of free-text documents. Whereas most sta- tistical approaches to text categorization derive classification knowledge based on training examples alone, ARAM performs supervised learn- ing and integrates user-defined classification knowledge in the form of IF-THEN rules. Through our experiments on the Reuters-21578 news database, we showed that ARAM performed reasonably well in mining categorization knowledge from sparse and high dimensional document feature space. In addition, ARAM predictive accuracy and learning efficiency can be improved by incorporating a set of rules derived from …
Clouds: A Decision Tree Classifier For Large Datasets, Khaled Alsabti, Sanjay Ranka, Vineet Singh
Clouds: A Decision Tree Classifier For Large Datasets, Khaled Alsabti, Sanjay Ranka, Vineet Singh
Electrical Engineering and Computer Science - All Scholarship
Classification for very large datasets has many practical applications in data mining. Techniques such as discretization and dataset sampling can be used to scale up decision tree classifiers to large datasets. Unfortunately, both of these techniques can cause a significant loss in accuracy. We present a novel decision tree classifier called CLOUDS, which samples the splitting points for numeric attributes followed by an estimation step to narrow the search space of the best split. CLOUDS reduces computation and I/O complexity substantially compared to state of the art classifiers, while maintaining the quality of the generated trees in terms of accuracy …