Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
Articles 1 - 6 of 6
Full-Text Articles in Physical Sciences and Mathematics
Knowledge Management For Texpros, Jianshun Hu
Knowledge Management For Texpros, Jianshun Hu
Dissertations
Most of the document processing systems today have applied Al technologies to support their system intelligent behaviors. For the application of Al technologies in such systems, the core problem is how to represent and manage different kinds of knowledge to support their inference engine components' functionalities. In other words, knowledge management has become a critical issue in the document processing systems. In this dissertation, within the scope of the TEXt PROcessing System (TEXPROS), we identify knowledge of various kinds that are applicable in the system. We investigate several problems of managing this knowledge and then develop a knowledge base for …
The Intelligent Browser For Texpros, Chih-Ying Wang
The Intelligent Browser For Texpros, Chih-Ying Wang
Dissertations
Browsing is a technique, which helps users to formulate their query and retrieve information in the information retrieval system. This technique provides users with capabilities of understanding their information needs and gaining system knowledge during the course of the browsing and thus it eases the users' burden when issuing queries. The basic components of the browser provides an underlying structure which allows users to navigate and a browsing process controller which provides users with the needed assistance during each browsing session.
In this dissertation, a new infrastructure (OP-Net), transformed from the existing object network is proposed. Each object in the …
Knowledge-Based Document Filing For Texpros, Xien Fan
Knowledge-Based Document Filing For Texpros, Xien Fan
Dissertations
This dissertation presents a knowledge-based document filing system for TEXPROS. The requirements of a. personal document processing system are investigated. In order for the system to be used in various application domains, a flexible, dynamic modeling approach is employed by getting the user involved in document modeling. The office documents are described using a dual-model which consists of a document type hierarchy and a folder organization. The document type hierarchy is used to capture the layout, logical and conceptual structures of documents. The folder organization, which is defined by the user, emulates the real world structure for organizing and storing …
Knowledge Discovering For Document Classification Using Tree Matching In Texpros, Ching-Song Wei
Knowledge Discovering For Document Classification Using Tree Matching In Texpros, Ching-Song Wei
Dissertations
This dissertation describes a knowledge-based system for classifying documents based upon the layout structure and conceptual information extracted from the content of the document. The spatial elements in a document are laid out in rectangular blocks which are represented by nodes in an ordered labelled tree, called the "layout structure tree" (L-S Tree). Each leaf node of a L-S Tree points to its corresponding block content. A knowledge Acquisition Tool (KAT) is devised to create a Document Sample Tree from L-S Tree, in which each of its leaves contains a node content conceptually describing its corresponding block content. Then, applying …
Automatic Office Document Classification And Information Extraction, Xiaolong Hao
Automatic Office Document Classification And Information Extraction, Xiaolong Hao
Dissertations
TEXPR.OS (TEXt PROcessing System) is a document processing system (DPS) to support and assist office workers in their daily work in dealing with information and document management. In this thesis, document classification and information extraction, which are two of the major functional capabilities in TEXPROS, are investigated.
Based on the nature of its content, a document is divided into structured and unstructured (i.e., of free text) parts. The conceptual and content structures are introduced to capture the semantics of the structured and unstructured part of the document respectively. The document is classified and information is extracted based on the analyses …
An Office Document Retrieval System With The Capability Of Processing Incomplete And Vague Queries, Qianhong Liu
An Office Document Retrieval System With The Capability Of Processing Incomplete And Vague Queries, Qianhong Liu
Dissertations
TEXPROS (TEXt PROcessing System) is an intelligent document processing system. The system is a combination of filing and retrieval systems, which supports storing, classifying, categorizing, retrieving and reproducing documents, as well as extracting, browsing, retrieving and synthesizing information from a variety of documents. This dissertation presents a retrieval system for TEXPROS, which is capable of processing incomplete or vague queries and providing semantically meaningful responses to the users. The design of the retrieval system is highly integrated with various mechanisms for achieving these goals. First, a system catalog including a thesaurus is used to store the knowledge about the database. …