| | Presentation
Using Data Structure Standards to Foster Efficiency and Opportunity
By Steve Hoenisch
Last updated on Feb. 16, 2005, in New York City
Copyright 1996-2005
www.Criticism.Com
| View as a Slide Show
Table of Contents
1 Outline
2 Using XML for Data Structure Standards
2.1 Developing and Implementing a Standard
2.2 Metadata
2.3 Taxonomies
2.4 Existing Metadata Standards
3 Overview: Fostering Efficiency
3.1 Automated Processing
3.2 Streamlining Work Flow and Sharing Data
3.3 Reusing Content
3.4 Eliminating Redundancy
3.5 Storage, Search, Retrieval
3.6 Report Capabilities
3.7 Software Consolidation
4 Overview: Realizing New Business Opportunities
4.1 Syndication and Redistribution
4.2 Targeted Advertising
4.3 Personalized, Aggregated, and Segmented Content
4.4 Developing Niche Products
4.5 Tapping New Outlets and Launching New Products
4.6 Translating, Localizing, and Internationalizing Content
4.7 Single Sourcing
5 Thriving on the Business Web
6 About this Document
This presentation describes how organizations -- particularly businesses focused on
creating and distributing content -- can use data structure standardization to realize
operating efficiencies and new business opportunities.
1 Outline
-
Developing a Standard: XML-based data structures, metadata, and taxonomies
-
Cultivating Efficiency: Overview followed by a discussion of how organizations can use standardization
to foster efficiency
-
Creating Opportunity: Overview followed by a discussion of how organizations can use standardization
to realize new business opportunities
-
The Business Web: Closing remarks about preparing for the future now.
2 Using XML for Data Structure Standards
![](icons/xml.jpg) XML is the language of choice for standardizing and enforcing data structures.
![](icons/hier.gif) Its hierarchical and extensible nature makes it well suited for formally structuring data.
![](icons/site_map.gif) Its extensibility enables it to include rich sets of metadata -- an aspect of standardization that helps make businesses more efficient.
![](icons/interact.gif) XML enables interoperability and expedites data exchange over the Internet -- as evident with the explosion of XML-based data exchange languages like FPML -- which is a key to realizing new business opportunities, especially in the era of the "business web."
2.1 Developing and Implementing a Standard
![](icons/xml.jpg) A data structure standard can be implemented in XML with a Document Type Definition or a Schema.
![](icons/xml.jpg) Depending on its unique needs, a business can use an existing DTD or develop its own.
![](icons/xml.jpg) Using an existing standard, such as NewsML, may make it easier to distribute data to other businesses.
![](icons/xml.jpg) To develop a standard, an organization identifies and defines the structural and semantic elements and attributes of its data, and fuses the resulting definition with mechanisms for specifying metadata at multiple levels. The standard strives to separate content from presentation.
![](icons/xml.jpg) Content is validated against the DTD to ensure conformity with the standard.
2.2 Metadata
As we will see, a standard that includes a rich set of metadata mechanisms increases the potential for efficiency and opportunity.
An organization has several options for extending its standard to capture metadata:
The approach depends on the current and future needs of the business and the characteristics of its data.
Future needs may be indeterminable: It may prove productive later to use multiple approaches now.
2.3 Taxonomies
![](icons/tax.gif) Businesses can use taxonomies to organize content into logical groupings that bring efficiencies in managing, navigating, and locating data.
XML is well suited to formulating hierarchical arrangements that structure information within a specific body of knowledge.
2.4 Existing Metadata Standards
Here are a few of the existing metadata standards:
-
RDF: The Resource Description Framework (RDF), an
architecture for web metadata designed to support the reuse and exchange of vocabularies as a layer in XML documents. The W3C sees it as a key driver of the Semantic Web.
-
Dublin Core Metadata Initiative: It is developing interoperable metadata standards for describing resources
that enable intelligent information discovery systems while supporting a broad range of purposes and business models.
-
Prism: The Publishing Requirements for Industry Standard Metadata. Defined in RDF using the Dublin Core vocabulary, it is a
publishing-specific XML metadata standard for directing the processing of content. It allows content and related resources to be
formally described by providing standardized properties, controlled vocabularies, and extensibility.
-
OWL: The Web Ontology Language is designed for use by applications that need to process the content of information instead of just presenting information to humans, facilitating greater machine
interpretability of web content.
-
XML Topic Maps: XTM provides a model and grammar for representing the structure of information
resources used to define topics, and the associations (relationships) between topics.
3 Overview: Fostering Efficiency
Data structure standards, including mechanisms for specifying metadata, enable organizations to
foster efficiency by
- Automating the processing of content
- Improving work flow
- Reusing content
- Reducing redundancy and repetition
- Improving searches, storage, archives, retrieval
- Automating the gathering and reporting of statistics and other data
- Consolidating the use of software
3.1 Automated Processing
Data structure standards enable businesses to automate the processing of content,
making them more efficient while reducing the costs associated with manual work.
The possibilities are virtually endless. For instance, publishers can do the
following:
-
Trigger the application of stylesheets to transform structured
content into formats suitable for different web sites, different
publications, or different devices, such as mobile phones; stylesheets can also be applied to transform the content into an XML dialect that can be shared with business partners.
For instance, a business could transform XML content from an in-house dialect into NewsML for distribution to external media partners.
-
Automate the distribution of content to a variety of channels
-
Automate the distribution of content to analysts for review and
refinement
-
Assemble content dynamically with queries based on metadata or other
aspects of the standard
-
Automatically generate tables of contents, indexes, site maps, link
libraries, topic maps, and other navigational aids
3.2 Streamlining Work Flow and Sharing Data
The automation that comes with standardization, along with a content management system, can make an organization's
work flow more efficient:
-
Metadata can be used to automatically route data through a given work
flow, not only internally but also with external business partners
-
Example: Editorial production at a newspaper, with syndication
![](icons/share.gif) On a general note, sharing knowledge within an organization requires cooperation.
Standardization can, if executed with the appropriate metadata, provide
a formal model for cooperative sharing.
3.3 Reusing Content
Standards that separate content from presentation empower
organizations to reuse and repurpose content:
With standardization, documents can include references to other documents or segments of them,
making the business more efficient by enabling it to reuse existing resources.
Documents can be assembled automatically with existing content, in response to user requests.
Standardization can also radically increase efficiency by enabling an organization to publish the same content in multiple formats.
Standardization also enables organizations to automatically include related sidebar content.
All of which reduces the cost of creating information products.
3.4 Eliminating Redundancy
Standardization helps eliminate redundancy and repetition of data:
- Data is created once and then reused by reference, rather than recreated or, for instance, cut and pasted.
- Repetition of manual work both in structuring data and the work flow is reduced or eliminated.
3.5 Storage, Search, Retrieval
Standardization can revolutionize the way organizations store data, bringing massive efficiencies in searching and retrieval.
- Semantic-based structures provide meaningful hooks for searching stored data.
- The appropriate use of structured metadata makes search and retrieval more efficient and effective: Users can more quickly find exactly the data they seek.
- Taxonomies or ontologies that express concepts common across an organization's domains result in well-defined mappings
between domain-specific knowledge representations, producing efficiencies in cross-domain searching, data mining, and knowledge extraction.
3.6 Report Capabilities
Along with a content management system, standardization and metadata provide vehicles for
- Tracking the status of documents and data
- Compiling statistics about what content was delivered to whom when.
The resulting reports help organizations become more efficient by focusing on productive
avenues of information usage and improving access to important streams.
3.7 Software Consolidation
Standardization eases software costs and yield efficiencies by consolidating the tools needed to create content.
All authors, for instance, could use an XML-based tool like Epic Editor or XMetal.
The costs associated with training users in different software are reduced.
4 Overview: Realizing New Business Opportunities
![](icons/profit.gif) Data structure standards and the interoperability they yield bring new business opportunities by
enabling organizations to
- Syndicate and redistribute content, both internally and externally
- Contextualize and focus advertising based on content (both in print and
online)
- Personalize content
- Aggregate diverse content based on user profiles for presentation to the
user
- Segment related content into customized products
- Develop and refine niche products
- Tap new outlets and launch new products
- Translate and localize content
- Deliver content to multiple output devices
4.1 Syndication and Redistribution
![](icons/profit.gif) Because data with formally defined structures can be easily shared, standardization forges new business opportunities through the possibility exchanging data.
- An organization can channel its content to other internal business units or external business partners
4.2 Targeted Advertising
![](icons/profit.gif) A well-defined data structure standard uses metadata to capture the meaning and relationships of content, opening up the possibility of
- Serving contextualized advertising based on content, both in print and online
- Serving contextualized advertising based on a user's request for on-demand content
4.3 Personalized, Aggregated, and Segmented Content
![](icons/profit.gif) Standardization can help retain existing customers and attract new customers by enabling companies to
- Deliver personalized content (with advertising targeted to user's profile)
- Aggregate content across domains to sell to external business partners
- Segment content so users can select only segments of direct interest or aggregating segments into new, customized products
4.4 Developing Niche Products
![](icons/profit.gif) Standardization gives organizations the ability to develop niche products by helping to identify marketable subsets of content
(through, for instance, tracking) or by
combining content from various domains in new and unique ways.
Example: A publisher of business magazines may be able to recombine discrete pieces of content from multiple magazines into a new stream of unified content that has value to a specific set of consumers.
(Information from magazine on celebrity lifestyles + information from yachting magazine = business stream on selling yachts to celebrities.)
4.5 Tapping New Outlets and Launching New Products
![](icons/profit.gif) Standardization also gives organizations the prospect of developing new opportunities through
- Creating dynamic content in response to user requests or profiles
- With standards in place, it is easier to develop and launch new content-based products such as web sites.
4.6 Translating, Localizing, and Internationalizing Content
![](icons/profit.gif) An organization can engineer its standard to use abstract structures for aspects of content tied to locale -- for example, currency -- to ease localization.
There are also standardization methods that improve the efficiency of translating or internationalizing content.
The result: An organization can introduce its offerings in new locales, cultures, and languages.
4.7 Single Sourcing
![](icons/profit.gif) An XML-based standard that separates content from presentation provides the foundation for delivering
content in multiple output formats.
This enables businesses to find new opportunities by distributing their content to new channels. (Formatting is done with stylesheets.)
- WML: mobile devices
- HTML: web sites
- XML: business exchange partners
- XSL:FO: magazines, books, brochures
5 Thriving on the Business Web
We are at the cusp of the "business web" -- a worldwide network of interconnected business services.
Businesses well positioned to exploit it will thrive.
Making an organization's core data and services available through other companies can vastly increasing the market for those services and data.
Data structure standards that enable the rapid and innovative dessimination of business services and information -- including through other companies -- may well be one of the keys to integrating into and thriving within this environment.
6 About this Document
This presentation was created using IBM's Darwin Information Typing Architecture (DITA), a topic-oriented DTD, and a customized XSLT stylesheet.
| |
|
![cover art](http://images.amazon.com/images/P/0764547593.01.THUMBZZZ.jpg) | XML Weekend Crash Course (with CD-ROM)
Wiley
New $16.99
|
![cover art](http://images.amazon.com/images/P/0764519956.01.THUMBZZZ.jpg) | HTML 4 for Dummies, Fourth Edition
For Dummies
New $15.74
|
![cover art](http://images.amazon.com/images/P/0596002963.01.THUMBZZZ.jpg) | HTML Pocket Reference (2nd Edition)
O'Reilly
New $8.95
|
![cover art](http://images.amazon.com/images/P/0764588486.01.THUMBZZZ.jpg) | Syndicating Web Sites with RSS Feeds For Dummies ®
For Dummies
New $16.49
|
![cover art](http://images.amazon.com/images/P/0789728249.01.THUMBZZZ.jpg) | MCAD/MCSD Training Guide (70-320): Developing XML Web Services and Server Components with Visual C# .NET and the .NET Framework
Que
New $49.99
|
![cover art](http://images.amazon.com/images/P/0672326515.01.THUMBZZZ.jpg) | Securing Web Services with WS-Security : Demystifying WS-Security, WS-Policy, SAML, XML Signature, and XML Encryption
Sams
New $28.54
|
![cover art](http://images.amazon.com/images/P/0764557386.01.THUMBZZZ.jpg) | Creating Cool Web Sites with HTML, XHTML, and CSS
Wiley
New $16.49
|
![cover art](http://images.amazon.com/images/P/0596004206.01.THUMBZZZ.jpg) | Learning XML, Second Edition
O'Reilly
New $26.37
|
![cover art](http://images.amazon.com/images/P/0735615861.01.THUMBZZZ.jpg) | MCAD/MCSD Self-Paced Training Kit: Developing XML Web Services and Server Components with Microsoft Visual Basic .NET and Microsoft Visual C# .NET
Microsoft Press
New $44.09
|
![cover art](http://images.amazon.com/images/P/0072229527.01.THUMBZZZ.jpg) | Oracle Database 10g XML & SQL: Design, Build, & Manage XML Applications in Java, C, C++, & PL/SQL (Osborne ORACLE Press Series)
McGraw-Hill Osborne Media
New $37.79
|
|
|
|
(Prices May Change)
Privacy
Information |
|
|
|