CHAPTER 10
956
Document Interchange
KEY
TYPE
VALUE
S
integer
(Optional)
A code indicating the type of form submission, if any, by which the source
data was accessed (see “Submit-Form Actions” on page 703):
0
1
2
Not accessed by means of a form submission
Accessed by means of an HTTP GET request
Accessed by means of an HTTP POST request
This entry should be present only in source information dictionaries associated with
page sets. Default value: 0.
C
dictionary
(Optional; must be an indirect reference)
A command dictionary (see “Command Dic-
retrieved. This entry should be present only in source information dictionaries associ-
ated with page sets.
In the simplest case, the content set’s
SI
entry just contains a single source infor-
mation dictionary. However, it is not uncommon for the same source data to be
accessible from two or more unrelated URLs. When Web Capture detects such a
condition (by comparing digital identifiers), it generates a single content set from
the source data, containing just one copy of the relevant PDF pages or image
XObjects, but creates multiple source information dictionaries describing the
separate ways in which the original source data can be accessed. It then stores an
array containing these multiple source information dictionaries as the value of
the
SI
entry in the content set dictionary.
A source information dictionary’s
AU
(aliased URLs) entry identifies the URLs
from which the source data was retrieved. If there is only one such URL, a simple
string suffices as the value of this entry. If multiple URLs map to the same loca-
tion through redirection, the
AU
value is a URL alias dictionary representing
them (see “URL Alias Dictionaries,” below).
Note:
For file size efficiency, it is recommended that the entire URL alias dictionary
(excluding the URL strings) be represented as a direct object because its internal
structure should never be shared or externally referenced.
The
TS
(time stamp) entry allows each source location associated with a content
set to have its own time stamp. This is necessary because the time stamp in the
content set dictionary (see Table 10.38 on page 953) merely refers to the creation
date of the content set. A hypothetical “Update Content Set” command might re-
set the time stamp in the source information dictionary to the current time if it
found that the source data had not changed since the time stamp was last set.