Previous Next
1000
APPENDIX D Character Sets and Encodings
CHAR CODE (OCTAL) CHAR CODE (OCTAL)
CHAR NAME STD MAC WIN PDF CHAR NAME STD MAC WIN PDF
_ underscore 137 137 137 137 ÿ ydieresis — 330 377 377
v v 166 166 166 166 ¥ yen 245 264 245 245
w w 167 167 167 167 z z 172 172 172 172
x x 170 170 170 170 ž zcaron 2 — — 236 236
y y 171 171 171 171 0 zero 060 060 060 060
ý yacute — — 375 375
1. In PDF 1.3, the euro character was added to the Adobe standard Latin character set. It
is encoded as 200 in WinAnsiEncoding and 240 in PDFDocEncoding, assigning codes
that were previously unused. Apple changed the Mac OS Latin-text encoding for code
333 from the currency character to the euro character. However, this incompatible
change has not been reflected in PDF’s MacRomanEncoding, which continues to map
code 333 to currency. If the euro character is desired, an encoding dictionary can be
used to specify this single difference from MacRomanEncoding.
2. In PDF 1.3, the existing Zcaron and zcaron characters were added to WinAnsiEncoding
as the previously unused codes 216 and 236.
3. In WinAnsiEncoding, all unused codes greater than 40 map to the bullet character.
However, only code 225 is specifically assigned to the bullet character; other codes are
subject to future reassignment.
4. The character names guillemotleft and guillemotright are misspelled. The correct spell-
ing for this punctuation character is guillemet. However, the misspelled names are the
ones actually used in the fonts and encodings containing these characters.
5. The hyphen character is also encoded as 255 in WinAnsiEncoding. The meaning of this
duplicate code is “soft hyphen,” but it is typographically the same as hyphen.
6. The space character is also encoded as 312 in MacRomanEncoding and as 240 in
WinAnsiEncoding. This duplicate code signifies a nonbreaking space; it is typographi-
cally the same as space.
Previous Next