Previous Next
948
CHAPTER 10 Document Interchange
HTML file retrieved from the URL < http://www.adobe.com/> has been converted
to three pages in the PDF file. The entry for that URL in the URLS name tree
points to a page set containing the three pages. Similarly, the IDS name tree con-
tains an entry pointing to the same page set, associated with the digital identifier
calculated from the HTML source (the string shown in the figure as 904B …1EA2).
Document catalog
Dictionary
Name dictionary
Name tree
URLS IDS
http://www.adobe.com/ 904B…1EA2
Page set
Page Page Page
FIGURE 10.1 Simple Web Capture file structure
Entries in the URLS and IDS name trees may refer to an array of content sets
instead of just a single content set. The content sets need not have the same sub-
type, but may include both page sets and image sets. In Figure 10.2, for example, a
GIF file has been retrieved from a URL (< http://www.adobe.com/getacro.gif >)
Previous Next