Corrections to Chapter 7 of The XML Bible, Foreign Languages and Non-Roman Text

p.168: The paragraph at the top of the page lost the Chinese ideographs. It should read

You would then choose the character you wanted, . The exact details of both the GUI and the transliteration system used to convert typed keys like yang to the ideographic characters like Chinese Ideogram vary from program to program, operating system to operating system, and language to language.

p.170: In Table 7-1 the closing parenthesis after Control-K for ASCII Code 11, the vertical tab, somehow migrated to the previous line. What you see is

vertical tab)
(Control-K

What it should be is

vertical tab
(Control-K)

p. 172: The last column of Table 7-2 incorrectly repeats the second to last column. The correct table follows. (You may need to set your browser's character set to ISO 8859-1 aka Latin-1 or to Windows 1252 aka Cp1252 in order to see some of the correct characters.)

Table 7-2

The ISO 8859-1 Latin-1 Character Set

Code Character Code Character Code Character Code Character
128 Undefined 160 non-breaking space 192 À 224 à
129 Undefined 161 193 Á 225 á
130 Bph 162 194 Â 226 â
131 Nbh 163 195 Ã 227 ã
132 Undefined 164 196 Ä 228 ä
133 Nel 165 197 Å 229 å
134 Ssa 166 198 Æ 230 æ
135 Esa 167 199 Ç 231 ç
136 Hts 168 200 È 232 è
137 Htj 169 © 201 É 233 é
138 Vts 170 202 Ê 234 ê
139 Pld 171 203 Ë 235 ë
140 Plu 172 204 Ì 236 ì
141 Ri 173 Discretionary hyphen 205 Í 237 í
142 ss2 174 ® 206 Î 238 î
143 ss3 175 207 Ï 239 ï
144 Dcs 176 208 Ð 240 ð
145 pu1 177 209 Ñ 241 ñ
146 pu2 178 210 Ò 242 ò
147 Sts 179 211 Ó 243 ó
148 Cch 180 212 Ô 244 ô
149 Mw 181 213 Õ 245 õ
150 Spa 182 214 Ö 246 ö
151 Epa 183 215 247
152 Sos 184 216 Ø 248 ø
153 Undefined 185 217 Ù 249 ù
154 Sci 186 218 Ú 250 ú
155 Csi 187 219 Û 251 û
156 St 188 220 Ü 252 ü
157 Osc 189 221 Ý 253 ý
158 Pm 190 222 Þ 254 þ
159 Apc 191 223 ß 255 ÿ

p. 174: In Table 7-3, in the row for ISO 8859-9, four of the six the Icelandic letters are missing. The last couple of lines of the second column should read, "...take the place of the less commonly used Icelandic letters Ý, ý, Þ, þ, Ð, and ð."

p. 175: In the third paragraph, the Icelandic letters are missing. The third and fourth sentences should read "MacRoman has most of the same extended characters as Latin-1 (except for the Icelandic letters Ý, ý, Þ, þ, Ð, and ð) but the characters are assigned to different numbers. MacRoman is the same as ASCII and Latin-1 in the codes through the first 127 characters."

p.184, The second paragraph of the "Converting to and from Unicode" sectionshoudl have a colon at the end. That is,

For example, the following command converts a text file named myfile.txt from the platform's default encoding to Unicode:

[ Cafe con Leche | Order from amazon.com | The XML Bible Home Page | Table of Contents | Updated Chapters | Examples | Errata from other Chapters ]

Copyright 1999, 2000 Elliotte Rusty Harold
elharo@metalab.unc.edu
Last Modified April 15, 2000