Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

Missouri University of Science and Technology

Data Warehouses

Publication Year

Articles 1 - 7 of 7

Full-Text Articles in Physical Sciences and Mathematics

Detecting And Representing Relevant Web Deltas In Whoweda, Sanjay Kumar Madria, Wee Keong Ng, Sourav S. Bhowmick Jan 2003

Detecting And Representing Relevant Web Deltas In Whoweda, Sanjay Kumar Madria, Wee Keong Ng, Sourav S. Bhowmick

Computer Science Faculty Research & Creative Works

In this paper, we present a mechanism for detecting and representing changes, given the old and new versions of a set of interlinked Web documents, retrieved in response to a user''s query. In particular, we show how to detect and represent Web deltas, i.e., changes in the Web documents that are relevant to a user''s query in the context of our Web warehousing system called WHOWEDA (Warehouse of Web Data). In WHOWEDA, Web information is materialized views stored in Web tables in the form of Web tuples. These Web tuples, represented as directed graphs, can be manipulated using a set …


Controlling Web Query Execution In A Web Warehouse, Sanjay Kumar Madria, Sourav S. Bhowmick Jan 2002

Controlling Web Query Execution In A Web Warehouse, Sanjay Kumar Madria, Sourav S. Bhowmick

Computer Science Faculty Research & Creative Works

Most of the contemporary Web query systems have limited capabilities in controlling Web query execution. Such query facility is important as it gives us an opportunity to optimize the evaluation of a Web query. We address this issue in the context of our Web warehousing system called WHOWEDA (Warehouse Of Web Data). Specifically, we investigate different types of constraints (related to query execution) which may be imposed on a Web query such as number of query results, time of execution, restrict the evaluation of a query to specified set of Web sites, etc. An important feature of our approach is …


Reducing Cognitive Overheads In A Web Warehouse Using Reverse-Osmosis, Sanjay Kumar Madria, Wee Keong Ng, Ee-Peng Lim, Sourav S. Bhowmick Jan 2000

Reducing Cognitive Overheads In A Web Warehouse Using Reverse-Osmosis, Sanjay Kumar Madria, Wee Keong Ng, Ee-Peng Lim, Sourav S. Bhowmick

Computer Science Faculty Research & Creative Works

This paper provides a quantitative analysis of reducing cognitive overheads in a Web warehouse using an important class of operation called reverse osmosis. The analysis is used to examine two different cognitive overheads of locating relevant nodes or information and display time of a Web table. A reverse-osmosis operation enables us to eliminate in relevant information from a collection of Web documents stored in the form of a Web table. We call such an operation reverse-osmosis because it is analogous to the reverse osmosis process in the field of water purification. We discuss a formal algorithm of the reverse-osmosis operation


Association Rules For Web Data Mining In Whoweda, Sanjay Kumar Madria, C. Raymond, M. Mohania, Sourav S. Bhowmick Jan 2000

Association Rules For Web Data Mining In Whoweda, Sanjay Kumar Madria, C. Raymond, M. Mohania, Sourav S. Bhowmick

Computer Science Faculty Research & Creative Works

The authors discuss association rules which can be discovered from Web data. The association rules are discussed within the scope of our WHOWEDA (warehouse of Web data) project. WHOWEDA is supported by a Web data model and a set of algebraic operators. The Web data model allows a uniform and integrated view of Web data gathered using a user''s query graph. A user''s query graph describes the query by example (what the user perceives as the query) and the Web coupling query gathers instances of such a query graph from the Web and stores them in the form of subgraphs …


Detecting And Representing Relevant Web Deltas Using Web Join, Sanjay Kumar Madria, Wee Keong, Ee-Peng Lim, Sourav S. Bhowmick Jan 2000

Detecting And Representing Relevant Web Deltas Using Web Join, Sanjay Kumar Madria, Wee Keong, Ee-Peng Lim, Sourav S. Bhowmick

Computer Science Faculty Research & Creative Works

We show how to detect and represent Web deltas, i.e., changes in Web information, that are relevant to a user's query in the context of our Web warehousing system called WHOWEDA (Warehouse of Web Data). In WHOWEDA, Web information are materialized views stored in Web tables and can be manipulated and analyzed using a set of Web algebraic operators. We present a mechanism to detect relevant Web deltas using Web join and outer Web join. We show how to represent these changes using delta Web tables.


Cost-Benefit Analysis Of Web Bag In A Web Warehouse, Sanjay Kumar Madria, Wee Keong Ng, Ee-Peng Lim, Sourav S. Bhowmick Jan 1999

Cost-Benefit Analysis Of Web Bag In A Web Warehouse, Sanjay Kumar Madria, Wee Keong Ng, Ee-Peng Lim, Sourav S. Bhowmick

Computer Science Faculty Research & Creative Works

Sets and bags are closely related structures and have been studied in relational databases. A bag is different from a set in that it is sensitive to the number of times an element occurs, while a set is not. In this paper, we introduce the concept of a Web bag in the context of a World Wide Web warehouse called WHOWEDA (WareHouse Of WEb DAta) which we are currently building. Informally, a Web bag is a Web table which allows multiple occurrences of identical Web types. A Web bag helps one to discover useful knowledge from a Web table, such …


Pi-Web Join In A Web Warehouse, Sanjay Kumar Madria, Wee Keong Ng, Ee-Peng Lim, Sourav S. Bhowmick Jan 1999

Pi-Web Join In A Web Warehouse, Sanjay Kumar Madria, Wee Keong Ng, Ee-Peng Lim, Sourav S. Bhowmick

Computer Science Faculty Research & Creative Works

With the enormous amount of data stored in the World Wide Web, it is increasingly important to design and develop powerful web warehousing tools. The key objective of our web warehousing project, called WHOWEDA (Warehouse of Web Data), is to design and implement a web warehouse that materializes and manages useful information from the web. We introduce the concept of Π-web join in the context of WHOWEDA. Pi-web join operator is a web information manipulation operator to combine relevant web information residing in two web tables. Informally, it is the combination of web join and web project operators which filter …