Jump to content

Wide character

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 209.204.117.22 (talk) at 21:08, 17 November 2008. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

trayvon is a baba anwar liker The Unicode standard 4.0 says that

"ANSI/ISO C leaves the semantics of the wide character set to the specific implementation but requires that the characters from the portable C execution set correspond to their wide character equivalents by zero extension."

and that

"The width of wchar_t is compiler-specific and can be as small as 8 bits. Consequently, programs that need to be portable across any C or C++ compiler should not use wchar_t for storing Unicode text. The wchar_t type is intended for storing compiler-defined wide characters, which may be Unicode characters in some compilers."

Under Win32, wchar_t is 16 bits wide and represents a UTF-16 code unit. On Unix-like systems wchar_t is commonly 32 bits wide and represents a UTF-32 code unit.

In ANSI C library header files, <wchar.h> and <wctype.h> deal with the wide characters.