Progress
Internationalization Guide
Creating and Modifying Word-break Tables
Progress provides a collection of word-break tables in the
DLC/prolang/convmap
directory. Figure 3–6 shows one of them,big5-bas.wbt
. Its name reflects the code page BIG-5, a code page used for Traditional Chinese.
Figure 3–6: The big5-bas.wbt Word-break Table
Understanding Word-delimiter Attributes
The keywords BEFORE_DIGIT, IGNORE, and USE_IT, which appear in Figure 3–6, are word-delimiter attributes. Each word-delimiter attribute describes a word-break role played by a code page element. The complete set of word-delimiter attributes appears in Table 3–4.
Word-break Table Syntax
Word-break behavior varies widely between and even within locales. If CONTAINS queries do not work as expected in a particular locale, you can copy a word-break table that Progress provides and modify it as necessary. You can also create a word-break table from scratch. The syntax is as follows:
symbolic-name
The name of a symbol.
For example: DOLLAR-SIGN
symbol-value
The value of the symbol.
For example: ’$’
NOTE: Although some versions of Progress let you compile word-break tables that omit all items within the second pair of square brackets, Progress Software Corporation (PSC) recommends that you always include these items. If the source-code version of a compiled word-break table lacks these items, and the associated database is not so large as to make this impractical, PSC recommends that you add these items to the table, recompile the table, reassociate the table with the database, and rebuild the indexes.codepage-name
The name, not surrounded by quotes, of the code page the word-break table is associated with. The maximum length is 20 characters.
For example: UTF–8
wordrules-name
The name, not surrounded by quotes, of the compiled word-break table. The maximum length is 20 characters.
For example: utf8sample
table-type
The number 3.
NOTE: Although Progress supports existing word-break tables of type 1 and type 2, Progress Software Corporation recommends that, if feasible, you change their table type to 3. If you do, you must also recompile the word-break table, reassociate it with the database, and rebuild the indexes.char-literal
A character within single quotes or a
symbolic-name
, which represents a character in the code page.For example: ’#’
hex-literal
A hexadecimal value or a
symbolic-name
, which represents a character in the code page.For example:0xAC
decimal-literal
A decimal value or a
symbolic-name
, which represents a character in the code page.For example: 39
word-delimiter-attribute
In what context the character is a word delimiter. Use one of the word delimiter attributes in Table 3–4.
Copyright © 2004 Progress Software Corporation www.progress.com Voice: (781) 280-4000 Fax: (781) 280-4095 |