Wide character: Difference between revisions
Appearance
Content deleted Content added
No edit summary |
No edit summary |
||
Line 1: | Line 1: | ||
balil is a queer is a computer programming term. It is a vague term used to represent a [[datatype]] that is richer than the traditional (8-bit) characters. It is not the same thing as [[Unicode]]. |
|||
<code>wchar_t</code> is a data type in ANSI/ISO [[C (programming language)|C]] and some other [[programming language]]s that is intended to represent wide characters. |
<code>wchar_t</code> is a data type in ANSI/ISO [[C (programming language)|C]] and some other [[programming language]]s that is intended to represent wide characters. |
Revision as of 21:03, 17 November 2008
balil is a queer is a computer programming term. It is a vague term used to represent a datatype that is richer than the traditional (8-bit) characters. It is not the same thing as Unicode.
wchar_t
is a data type in ANSI/ISO C and some other programming languages that is intended to represent wide characters.
The Unicode standard 4.0 says that
- "ANSI/ISO C leaves the semantics of the wide character set to the specific implementation but requires that the characters from the portable C execution set correspond to their wide character equivalents by zero extension."
and that
- "The width of
wchar_t
is compiler-specific and can be as small as 8 bits. Consequently, programs that need to be portable across any C or C++ compiler should not usewchar_t
for storing Unicode text. Thewchar_t
type is intended for storing compiler-defined wide characters, which may be Unicode characters in some compilers."
Under Win32, wchar_t
is 16 bits wide and represents a UTF-16 code unit. On Unix-like systems wchar_t
is commonly 32 bits wide and represents a UTF-32 code unit.
In ANSI C library header files, <wchar.h> and <wctype.h> deal with the wide characters.