Atomik Xport SE: Tutorials << >>

Chapter 4 Tutorial 2: Creating Rulesets

In this tutorial, you will learn about how to set up and manage rulesets, which are the tools that control Atomik Xport’s Automated Matching System (AMS). The Automated Matching System is the functionality within Atomik Xport that enables the extraction of content from QuarkXPress to be automated. Understanding the principles contained in this chapter will enhance your ability to get the most out of Atomik Xport.

Definition: A Document Type Definition (DTD) is a specification that describes the structure of an XML document.

Definition: A ruleset is a Document Type Definition (DTD) together with a set of mappings and preferences. The mappings define which styles used in the QuarkXPress project should be mapped to the individual elements of the DTD. The ruleset preferences provide additional information relating to the extraction process such as the order in which Atomik Xport goes through the layout.

A ruleset is a key component of the extraction process. A ruleset essentially creates structure for content based on the way it is presented in QuarkXPress. This is a process that the human brain can perform almost without thinking. However a computer does not have this inherent ability. For example, when you see a page of a newspaper, you know a headline is a headline because it is larger and bolder than the bodytext of a story. Atomik Xport uses rules like these to associate specific styles (in QuarkXPress) with the type of information those styles represent. In other words, rulesets infer structure from style.

Creating structure from style is a key principle behind Atomik Xport and essential because without it, content held in QuarkXPress layouts remains essentially unstructured. Creating structure adds significant value to content, because you know what each piece of content actually is (e.g. a headline) rather than only what each piece of content says or looks like.

The structure of the output XML is defined by the DTD used in the ruleset. Therefore, creating a DTD is the first and possibly most important step in preparing to extract content from QuarkXPress. Once you have a DTD, you can create a ruleset within Atomik Xport.

The key stages in this tutorial are:

  1. 1 Loading a DTD into Atomik Xport
  2. 2 Creating a ruleset in Atomik Xport
  3. 3 Creating Rules within the ruleset

Objective

The objective of this tutorial is to provide a thorough understanding of how to create rulesets that are used for extracting XML in Atomik Xport.