In this book, you will find discussions on the newest native XML databases, along with information on working with XML-enabled relational database systems. In addition, XML Data Management thoroughly examines benchmarks and analysis techniques for performance of XML databases. This book is best used by students that are knowledgeable in database technology and are familiar with XML.
Table of Contents
I. WHAT IS XML? 1. Information Modeling with XML.
XML as an Information Domain.
How XML Expresses Information.
Patterns in XML.
Common XML Information-Modeling Pitfalls.
Attributes Used as Data Elements.
Data Elements Used as Metadata.
Inadequate Use of Tags.
A Very Simple Way to Design XML.
II. NATIVE XML DATABASES. 2. TaminoSoftware AG's Native XML Server.
Tamino Architecture and APIs.
Collections and Doctypes.
Access to Other DatabasesTamino X-Node.
Mapping Data to FunctionsTamino X-Tension.
Organization on Disk.
Query LanguageTamino X-Query.
Sessions and Transactions.
Handling of Results.
Full Database Functionality.
Conclusion.3. eXist Native XML Database.
Schema-less XML Data Store.
Index-Based Query Processing.
Extensions for Full-Text Searching.
System Architecture Overview.
Pluggable Storage Backends.
Query Language Extensions.
Specifying the Input Document Set.
Programming Java Applications with the XML:DB API.
Accessing eXist with SOAP.
Integration with Cocoon.
Approaches to Query Execution.
Index and Storage Implementation.
Query Language Processing.
Conclusion.4. Embedded XML Databases.
A Primer on Embedded Databases.
Embedded XML Databases.
Building Applications for Embedded XML Databases.
Overview of Berkeley DB XML.
Indexing and Index Types.
XPath Query Processing.
Programming for Transactions.
Two-Phase Locking and Deadlocks.
Recovery Processing after Failures.
III. XML AND RELATIONAL DATABASES. 5. IBM XML-Enabled Data Management Product Architecture and Technology.
Product and Technology Offering Summaries.
DB2 Universal Database.
Information Integration Technology.
Current Architecture and Technology.
Shared Architecture and Technology.
XML Extender Architecture.
XML Extender Technology.
Using Both XML Collections and XML Columns.
Transforming XML Data.
Searching, Parsing, and Validating XML Data.
XML Extender Federated Support.
SQL XML Support Architecture.
SQL XML Support Technology.
Data Management Web Services Architecture.
Data Management Web Services Technology.
Information Integration-Specific Architecture and Technology.
Future Architecture and Technology.
Application Interface, Data Type, and API Goals.
Storage, Engine, and Data Manager Goals.
Why Support Both XML and Relational Storage in One System?
Why Not Object-Relational Long Term?
Impacted Technology Areas.
Notices.6. Supporting XML in Oracle9i.
Storing XML as CLOB.
Using CLOB and the OracleText Cartridge.
Search Predicates in OracleText.
Object Type XMLType.
Processing of XMLType in Java.
Using XSU for Fine-Grained Storage.
Building XML Documents from Relational Data.
SQL Functions existsNode and extract.
The SQL Function SYS_XMLGen.
The SQL Function SYS_XMLAgg.
PL/SQL Package DBMS_XMLGen.
Web Access to the Database.
The Principle of XSQL.
Posting XML Data into the Database.
Special Oracle Features.
Special Java Beans.
Conclusion.7. XML Support in Microsoft SQL Server 2000 165
XML and Relational Data.
XML Access to SQL Server.
Access via HTTP.
Using the XML Features through SQLOLEDB, ADO, and .NET.
Serializing SQL Query Results into XML.
The Raw Mode.
The Auto and Nested Modes.
The Explicit Mode.
Providing Relational Views over XML.
Providing XML Views over Relational Data.
Querying Using XPath.
Updating Using Updategrams.
Conclusion.8. A Generic Architecture for Storing XML Documents in a Relational Database.
The Data Model.
DOM Storage in Relational Databases.
The Nested Sets Model.
Creating the Database.
The Physical Data Model.
Creating User-Defined Data Types.
Creating the Tables.
Serializing a Document out of the Repository.
Building an XML Document Manually.
Connecting to the Repository.
The xmlrepDB Class.
Uploading XML Documents.
The xmlrepSAX Class.
Stored Procedures for Data Entry.
The uploadXML Class.
The extractXML Class.
Querying the Repository.
Ad Hoc SQL Queries.
Searching for Text.
Some More Stored Procedures.
Generating XPath Expressions.
Conclusion.9. An Object-Relational Approach to Building a High-Performance XML Repository.
Overview of XML Use-Case Scenario.
High-Level System Architecture.
Detailed Design Descriptions.
IV. APPLICATIONS OF XML. 10. Knowledge Management in Bioinformatics.
A Brief Molecular Biology Background.
Life Sciences Are Turning to XML to Model Their Information.
A Genetic Information Model.
Integration of BLAST into NeoCore XMS.
Sequence Search Types.
Conclusion.11. Case Studies of XML Used with IBM DB2 Universal Database.
Case Study 1: “Our Most Valued Customers Come First”.
How This Business Problem Is Addressed.
Case Study 2: “Improve Cash Flow”.
How This Business Problem Is Addressed.
Notices.12. The Design and Implementation of an Engineering Data Management System Using XML and J2EE.
Background and Requirements.
Image Query Service.
Using XML in OAI.
Conversion of XML Input into Objects.
Conversion of Database Data into XML.
Conversion of Image Data into XML.
Mass Transfer Capability.
Conclusion.13. Geographical Data Interchange Using XML-Enabled Technology within the GIDB System.
GIDB METOC Data Integration.
GIDB Web Map Service Implementation.
GIDB GML Import and Export.
Conclusion.14. Space Wide Web by Adapters in Distributed Systems Configuration from Reusable Components.
Advanced Concept Description: The Research Problem.
Future Supporting Communications Satellites Constellations.
Integration of Components with Architecture.
Future Generation NASA Institute for Advanced Concepts, Space Wide Web Research, and Boundaries.
Advanced Concept Development.
The Research Approach.
The Research Tasks.
Conclusion.15. XML as a Unifying Framework for Inductive Databases.
Extracting and Evaluating Association Rules.
The Proposed Data Model: XDM.
Classification with XDM.
Association Rules with XDM.
Benefits of XDM.
Toward Flexible and Open Systems.
Conclusion.16. Designing and Managing an XML Warehouse.
Why a View Mechanism for XML?
Data Warehouse Specification.
View Model for XML Documents.
Graphic Tool for Data Warehouse Specification.
Managing the Metadata.
Mediated Schema Definition.
Storage and Management of the Data Warehouse.
The Different Approaches to Storing XML Data.
Mapping XML to Relational.
Extraction of Data.
DAWAX: A Graphic Tool for the Specification and Management of a Data Warehouse.
Data Warehouse Manager.
The Different DAWAX Packages.
Query Languages for XML.
Storing XML Data.
Systems for XML Data Integration.
V. PERFORMANCE AND BENCHMARKS. 17. XML Management System Benchmarks.
Benchmark Data Set.
Existing Benchmarks for XML.
The XOO7 Benchmark.
The XMach-1 Benchmark.
The XMark Benchmark.
Conclusion.18. The Michigan Benchmark: A Micro-Benchmark for XML Query Performance Diagnostics.
Benchmark Data Set.
A Discussion of the Data Characteristics.
Schema of Benchmark Data.
Generating the String Attributes and Element Content.
Using the Benchmark.
Conclusion.19. A Comparison of Database Approaches for Storing XML Documents.
Data Models for XML Documents.
The Nontyped DOM Implementation.
The Typed DOM Implementation.
Databases for Storing XML Documents.
Native XML Databases.
Benchmarking a Relational Database.
Benchmarking an Object-Oriented Database.
Benchmarking a Directory Server.
Benchmarking a Native XML Database.
Evaluation of Performance.
Evaluation of Space.
Studies in Storing and Retrieving XML Documents.
XML and Relational Databases
XML and Object-Relational Databases.
XML and Object-Oriented Databases.
XML and Directory Servers.
Benchmarks for XML Databases.
Guidelines for Benchmarking XML Databases.
Summary.20. Performance Analysis between an XML-Enabled Database and a Native XML Database.
SQL Operations (Single Record).
SQL Operations (Mass Records).
Conclusion.21. Conclusion. References. Contributors. Editors.
Chapter 1: Information Modeling with XML.
Chapter 2: TaminoSoftware AG's Native XML Server.
Chapter 3: eXist Native XML Database.
Chapter 4: Embedded XML Databases.
Chapter 5: IBM XML-Enabled Data Management Product Architecture and Technology.
Chapter 6: Supporting XML in Oracle9i.
Chapter 7: XML Support in Microsoft SQL Server 2000.
Chapter 8: A Generic Architecture for Storing XML Documents in a Relational Database.
Chapter 9: An Object-Relational Approach to Building a High-Performance XML Repository.
Chapter 10: Knowledge Management in Bioinformatics.
Chapter 11: Case Studies of XML Used with IBM DB2 Universal Database.
Chapter 12: The Design and Implementation of an Engineering Data Management System Using XML and J2EE.
Chapter 13: Geographical Data Interchange Using XML-Enabled Technology within the GIDB System.
Chapter 14: Space Wide Web by Adapters in Distributed Systems Configuration from Reusable Components.
Chapter 15: XML as a Unifying Framework for Inductive Databases.
Chapter 16: Designing and Managing an XML Warehouse.
Chapter 17: XML Management System Benchmarks.
Chapter 18: The Michigan Benchmark: A Micro-Benchmark for XML Query Performance Diagnostics.
Chapter 19: A Comparison of Database Approaches for Storing XML Documents.
Chapter 20: Performance Analysis between an XML-Enabled Database and a Native XML Database.Index. 0201844524T02182003
"This is an excellent book that combines a practical and analytical look at the subject."
-Leo Korman, Principal Software Engineer, KANA Software
As organizations begin to employ XML within their information-management and exchange strategies, data management issues pertaining to storage, retrieval, querying, indexing, and manipulation increasingly arise. Moreover, new information-modeling challenges also appear. XML Data Management-with its contributions from experts at the forefront of the XML field-addresses these key issues and challenges, offering insights into the advantages and drawbacks of various XML solutions, best practices for modeling information with XML, and developing custom, in-house solutions.
In this book, you will find discussions on the newest native XML databases, along with information on working with XML-enabled relational database systems. In addition, XML Data Management thoroughly examines benchmarks and analysis techniques for performance of XML databases.
Topics covered include:The power of good grammar and style in modeling information to alleviate the need for redundant domain knowledge Tamino's XML storage, indexing, querying, and data access features The features and APIs of open source eXist Berkeley DB XML's ability to store XML documents natively IBM's DB2 Universal Database and its support for XML applications Xperanto's method of addressing information integration requirements Oracle's XMLType for managing document centric XML documents Microsoft SQL Server 2000's support for exporting and importing XML data A generic architecture for storing XML documents in a relational database X007, XMach-1, XMark, and other benchmarks for evaluating XML database performance
Numerous case studies demonstrate real-world problems, industry-tested solutions, and creative applications of XML data management solutions.
Written for both XML and relational database professionals, XML Data Management provides a promising new approach to data management, one that is sure to positively impact the way organizations manage and exchange information.
Akmal B. Chaudhri works for IBM developerWorks, where he is also Zone Editor for Special Projects. A recognized authority on objects and databases, he has been a regular presenter at many international conferences, including OOPSLA and Object World. In addition, he has edited several books on these topics.
Awais Rashid is a Lecturer in the Computing Department of Lancaster University in the U.K. where he leads research into the application of new technologies, such as XML and aspect-oriented programming, and database systems. He has actively published on these topics and has organized a number of relevant international events.
Roberto Zicari is a full Professor for Databases and Information Systems at the Johann Wolfgang Goethe University in Frankfurt/Main, Germany. He is an internationally recognized expert in Object Technology. He has consulted and lectured in Europe, North America, and Japan.