Progress
Internationalization Guide
Code Pages
A code page is a table that assigns a numeric value to each element in a collection of letters, numbers, punctuation, control codes, and other characters. The assignment is one-to-one, meaning that no two characters are assigned the same numeric value and that no two numeric values are assigned the same character.
Figure 2–1 shows the characters and numeric values of the IBM850 code page, widely used in western Europe and the Americas. The white area contains the characters, while the two light gray areas contain the numeric values. By convention, numeric values are in hexadecimal (hex). To compute the numeric value of a character, add the numeric value at the top of the character’s column to the numeric value at the far left of the character’s row. For example, the numeric value of the character “Ö” is 99 hex (90 hex + 9 hex), which equals 153 in decimal.
Figure 2–1: The IBM850 Code Page
![]()
Figure 2–2, which uses the same format as Figure 2–1, shows the characters and numeric values of the ISO8859–1 code page, also widely used in western Europe and the Americas.
Figure 2–2: The ISO8859–1 Code Page
![]()
Here are some other important points about code pages:
- The term code point refers to an element of a code page—that is, to a character and its numeric value. For example, in the ISO8859–1 code page, code point 4B contains the character “K” and the value 4B hex.
- A code page can be single byte, double byte, or triple byte, depending on the maximum size of the numeric value in each entry. An example of a double-byte code page is BIG–5, used for Traditional Chinese. An example of a triple-byte code page is UTF–8, an encoding of Unicode. For more information on using multi-byte code pages in applications, see "Using Multi-byte Code Pages," and Using Unicode."
Copyright © 2004 Progress Software Corporation www.progress.com Voice: (781) 280-4000 Fax: (781) 280-4095 |