Previous Next
957
SECTION 10.9 Web Capture
The E (expiration) entry specifies an expiration date for each source location
associated with a content set. If the current date and time are later than those
specified, the contents of the content set should be considered out of date with
the original source.
URL Alias Dictionaries
When a URL is accessed via HTTP, a response header may be returned indicating
that the requested data is at a different URL. This redirection process may be re-
peated in turn at the new URL and can potentially continue indefinitely. It is not
uncommon to find multiple URLs that all lead eventually to the same destination
through one or more redirections. A URL alias dictionary represents such a set of
URL chains leading to a common destination. Table 10.42 shows the contents of
this type of dictionary.
TABLE 10.42 Entries in a URL alias dictionary
KEY TYPE VALUE
U ASCII (Required) The destination URL to which all of the chains specified by the C entry lead.
string
C array (Optional) An array of one or more arrays of strings, each representing a chain of URLs
leading to the common destination specified by U.
The C (chains) entry should be omitted if the URL alias dictionary contains only
one URL. If C is present, its value is an array of arrays, each representing a chain
of URLs leading to the common destination. Within each chain, the URLs are
stored as ASCII strings in the order in which they occur in the redirection se-
quence. The common destination (the last URL in a chain) may be omitted, since
it is already identified by the U entry. (See implementation note 166 in Appendix
H.)
Command Dictionaries
A Web Capture command dictionary represents a command executed by Web
Capture to retrieve one or more pieces of source data that were used to create new
pages or modify existing pages. The entries in this dictionary represent
parameters that were originally specified interactively by the user who requested
that the Web content be captured. This information is recorded so that the com-
Previous Next