This article describes how to write applications that communicate with the user in a language other than English.
Unfortunately many languages use different charsets under Unix and Windows (and other platforms, to make the situation even more complicated). These charsets usually differ in so many characters that it is impossible to use the same texts under all platforms.
The wxWidgets library provides a mechanism that helps you avoid distributing many identical, only differently encoded, packages with your application (e.g. help files and menu items in iso8859-13 and windows-1257). Thanks to this mechanism you can, for example, distribute only iso8859-13 data and it will be handled transparently under all systems.
Please read the Internationalization which describes the locales concept.
In the following text, wherever iso8859-2 and windows-1250 are used, any encodings are meant and any encodings may be substituted there.
The best way to ensure correctly displayed texts in a GUI across platforms is to use locales. Write your in-code messages in English or without diacritics and put real messages into the message catalog (see Internationalization).
A standard .po file begins with a header like this:
Note this particular line:
It specifies the charset used by the catalog. All strings in the catalog are encoded using this charset.
You have to fill in proper charset information. Your .po file may look like this after doing so:
(Make sure that the header is not marked as fuzzy.)
wxWidgets is able to use this catalog under any supported platform (although iso8859-2 is a Unix encoding and is normally not understood by Windows).
How is this done? When you tell the wxLocale class to load a message catalog that contains a correct header, it checks the charset. The catalog is then converted to the charset used (see wxLocale::GetSystemEncoding and wxLocale::GetSystemEncodingName) by the user's operating system.
By convention, you should only use characters without diacritics (i.e. 7-bit ASCII strings) for msgids in the source code and write them in English.
If you port software to wxWidgets, you may be confronted with legacy source code containing non-English string literals. Instead of translating the strings in the source code to English and putting the original strings into message catalog, you may configure wxWidgets to use non-English msgids and translate to English using message catalogs:
xgettext
to extract the strings from the source code, specify the option --from-code=<source code charset>
. You can use wxMBConv Overview and wxFontMapper to display text:
You may want to store all program data (created documents etc.) in the same encoding, let's say utf-8
. You can use wxCSConv to convert data to the encoding used by the system your application is running on (see wxLocale::GetSystemEncoding).
If you're using wxHtmlHelpController there is no problem at all. You only need to make sure that all the HTML files contain the META tag:
Also, the hhp project file needs one additional line in the OPTIONS
section:
This additional entry tells the HTML help controller what encoding is used in contents and index tables.