Previous Next
856
CHAPTER 10 Document Interchange
PDF logical structure shares basic features with standard document markup
languages such as HTML, SGML, and XML. A document’s logical structure is
expressed as a hierarchy of structure elements, each represented by a dictionary
object. Like their counterparts in other markup languages, PDF structure
elements can have content and attributes. In PDF, rendered document content
takes over the role occupied by text in HTML, SGML, and XML.
A PDF document’s logical structure is stored separately from its visible content,
with pointers from each to the other. This separation allows the ordering and
nesting of logical elements to be entirely independent of the order and location of
graphics objects on the document’s pages.
The MarkInfo entry in the document catalog (see Section 3.6.1, “Document Cata-
log”) specifies a mark information dictionary, whose entries are shown in
Table 10.8. It provides additional information relevant to specialized uses of
structured PDF documents.
TABLE 10.8 Entries in the mark information dictionary
KEY TYPE VALUE
Marked boolean (Optional) A flag indicating whether the document conforms to Tagged PDF
conventions. Default value: false.
Note: If Suspects is true, the document may not completely conform to Tagged PDF
conventions.
UserProperties boolean (Optional; PDF 1.6) A flag indicating the presence of structure elements that
contain user properties attributes (see “User Properties” on page 876). Default
value: false.
Suspects boolean (Optional; PDF 1.6) A flag indicating the presence of tag suspects (see “Page
Content Order” on page 889). Default value: false.
10.6.1 Structure Hierarchy
The logical structure of a document is described by a hierarchy of objects called
the structure hierarchy or structure tree. At the root of the hierarchy is a dictionary
object called the structure tree root, located by means of the StructTreeRoot entry
in the document catalog (see Section 3.6.1, “Document Catalog”). Table 10.9
shows the entries in the structure tree root dictionary. The K entry specifies the
immediate children of the structure tree root, which are structure elements.
Previous Next