Atomik Xport SE: Reference > Chapter 11 Working With DTDs for Atomik Xport << >>

11.2 Automatically tagging multiple paragraphs

Atomik Xport can automatically separate multiple paragraphs of a body of text into separate <Paragraph> tags as it creates the XML output.

This is particularly useful where a body of text is to be extracted and the paragraphs need to be tagged separately. Automated paragraph tagging relies on the use of the <Paragraph> element.

Note: This elements is case sensitive so the <Paragraph> tag must be spelled with a capital ‘P’.

If you include the <Paragraph> element in your DTD, Atomik Xport will recognise the <Paragraph> element and will invoke the multiple paragraph tagging functionality.

The <Paragraph> element should be used as a child element of any elements you want to be automatically separated into paragraphs (e.g. <Bodytext>). Also, the mapping of the QuarkXPress styles or style sheets should be made to the parent of the <Paragraph> element, and not to the <Paragraph> element itself. Here is an example of this.

Example Article DTD

<!ELEMENT Article (Headline, Bodytext)> 
<!ELEMENT Headline (#PCDATA)>
<!ELEMENT Bodytext (Paragraph+)>
<!ELEMENT Paragraph (#PCDATA)>

In the above example, the mapping of a QuarkXPress style sheet used to style the bodytext would be mapped onto the <Bodytext> element, and not onto the <Paragraph> element.

IMPORTANT NOTE: <Paragraph> is an element that has special properties within Atomik Xport, and which is reserved for only the functionality described above. This tag must not be used for any other purposes.