Unicode input: Difference between revisions
→Character Mnemonics: exact numbers of di/trigraphs in RFC1345 (this is OR, but whatever) |
→Hex input: Linux bullet was redundant to intro example and did not specify context; just point to that. |
||
Line 17: | Line 17: | ||
* The [[RichEdit]] control on Microsoft Windows (as used in for example [[WordPad]]) supports the following input method: one first enters the character’s '''hexadecimal code''', then immediately presses '''<code>Alt + x</code>'''. For example, entering '''f1''' and then pressing the combination will produce the character ''ñ''. The code must not be preceded by any digit or letters a-f as they will be treated as part of the code to be converted. This also works on [[Microsoft Word]] 2002/2003 for Windows. |
* The [[RichEdit]] control on Microsoft Windows (as used in for example [[WordPad]]) supports the following input method: one first enters the character’s '''hexadecimal code''', then immediately presses '''<code>Alt + x</code>'''. For example, entering '''f1''' and then pressing the combination will produce the character ''ñ''. The code must not be preceded by any digit or letters a-f as they will be treated as part of the code to be converted. This also works on [[Microsoft Word]] 2002/2003 for Windows. |
||
⚫ | * In the [[Vim editor]], the user first types <code>Ctrl-V u</code>, then types in the hexadecimal number of the symbol or character desired, and it will be converted into the symbol. (On Microsoft Windows, <code>Ctrl-Q</code> may be required instead of <code>Ctrl-V</code>. <ref name="vim_w32">[http://www.vim.org/htmldoc/gui_w32.html#gui-clipboard Vim documentation: gui_w32]</ref>) |
||
* In [[Emacs]], <code>[[Meta key|M]]-x ucs-insert<code>. |
|||
⚫ | * In the [[Vim editor]], the user first types <code>Ctrl-V u</code>, then types in the hexadecimal number of the symbol or character desired, and it will be converted into the symbol. (On Microsoft Windows, <code>Ctrl-Q</code> may be required instead of <code>Ctrl-V</code>. <ref name="vim_w32">[http://www.vim.org/htmldoc/gui_w32.html#gui-clipboard Vim documentation: gui_w32]</ref>) |
||
* In Mac OS X and in Mac OS 8.5 and later: one chooses the ''Unicode Hex Input'' keyboard layout. Holding down the [[Option key]], one then types the four-digit [[Hexadecimal|hex]] Unicode code point. On releasing the Option key; the equivalent character will appear.<ref>[http://mac.sillydog.org/archives/001703.php typing special and accented characters]</ref> |
* In Mac OS X and in Mac OS 8.5 and later: one chooses the ''Unicode Hex Input'' keyboard layout. Holding down the [[Option key]], one then types the four-digit [[Hexadecimal|hex]] Unicode code point. On releasing the Option key; the equivalent character will appear.<ref>[http://mac.sillydog.org/archives/001703.php typing special and accented characters]</ref> |
||
* On Microsoft Windows, if the [[Windows registry#Keys and values|registry key]] <code>HKEY_Current_User\Control Panel\Input Method\EnableHexNumpad</code> has a value of "1", holding down alt and pressing the "plus" on the numeric keypad, followed by the hex code, will work.<ref name="mswin">[http://www.fileformat.info/tip/microsoft/enter_unicode.htm How to enter Unicode characters in Microsoft Windows<!-- Bot generated title -->]</ref> |
* On Microsoft Windows, if the [[Windows registry#Keys and values|registry key]] <code>HKEY_Current_User\Control Panel\Input Method\EnableHexNumpad</code> has a value of "1", holding down alt and pressing the "plus" on the numeric keypad, followed by the hex code, will work.<ref name="mswin">[http://www.fileformat.info/tip/microsoft/enter_unicode.htm How to enter Unicode characters in Microsoft Windows<!-- Bot generated title -->]</ref> |
||
* In [[Linux]], see the description of GTK+ at the beginning of this section (if applicable). |
|||
* In [[Linux]]<sup>[which?]</sup> first press '''<code>Ctrl+Shift+U</code>''', then type the desired hexadecimal code. I.e. type "0041" to get the letter "A". |
|||
== Decimal Input == |
== Decimal Input == |
Revision as of 06:01, 21 November 2008
Template:Table Unicode Many systems provide direct Unicode input support in some form to allow selection of arbitrary Unicode characters.
Selection from a screen
Many systems provide a way to select Unicode characters visually. ISO 14755 refers to this as a screen-selection entry method. On some systems this is limited to characters that are present in a specified font, or where a font containing the character exists at all.
Microsoft Windows has provided a Unicode version of the Character Map program since version NT 4.0 - appearing in the consumer edition since XP. This is limited to characters in the Basic Multilingual Plane. Characters are searchable by Unicode character name, and the table can be limited to a particular code block.
Mac OS X provides a "character palette" with much the same functionality, along with searching by related characters, glyph tables in a font, etc.
Equivalent tools (such as gucharmap) exist on most Linux desktop environments.
Hex input
Clause 5.1 of ISO 14755 describes a Basic method whereby a beginning sequence is followed by the hexadecimal representation of the codepoint and the ending sequence. On some systems, this is limited to the BMP (characters up to U+FFFF).
An example of an ISO 14755-conformant system is GTK+, where the beginning sequence is CTRL+SHIFT+U and the ending sequence is null. In some older versions Ctrl and Shift must be held down while entering the number In GTK+ versions before 2.10, Ctrl-Shift-U is not used, only Ctrl-Shift-[hex number]
- The RichEdit control on Microsoft Windows (as used in for example WordPad) supports the following input method: one first enters the character’s hexadecimal code, then immediately presses
Alt + x
. For example, entering f1 and then pressing the combination will produce the character ñ. The code must not be preceded by any digit or letters a-f as they will be treated as part of the code to be converted. This also works on Microsoft Word 2002/2003 for Windows. - In the Vim editor, the user first types
Ctrl-V u
, then types in the hexadecimal number of the symbol or character desired, and it will be converted into the symbol. (On Microsoft Windows,Ctrl-Q
may be required instead ofCtrl-V
. [1]) - In Emacs,
M-x ucs-insert
.
- In Mac OS X and in Mac OS 8.5 and later: one chooses the Unicode Hex Input keyboard layout. Holding down the Option key, one then types the four-digit hex Unicode code point. On releasing the Option key; the equivalent character will appear.[2]
- On Microsoft Windows, if the registry key
HKEY_Current_User\Control Panel\Input Method\EnableHexNumpad
has a value of "1", holding down alt and pressing the "plus" on the numeric keypad, followed by the hex code, will work.[3] - In Linux, see the description of GTK+ at the beginning of this section (if applicable).
Decimal Input
On some applications on Microsoft Windows, particularly those using the RichEdit control, decimal Unicode code points (e.g., 256 for U+0100) are supported with Alt codes.
Character Mnemonics
RFC 1345 defines a large number (1,893) of suggested mnemonics for code points in Unicode 1.0 (as well as characters in ISO 2DIS 10646 and many other character sets in use at the time of publication). Although the document does not restrict the length of a mnemonic (e.g., "10000R" for U+2821), most (1,338), of the mnemonics are digraphs, that is, two characters long, and most (416) of the remaining are trigraphs. While never complete, and targeting obsolescent set definitions, the mnemonics themselves can still be used.
- Vim allows digraph entry in insert mode (the regular mode for typing text) with
Ctrl-K
followed by a two-keystroke RFC 1345 mnemonic; or, in addition, if the :digraph
option is set, by entering the first character followed by a backspace followed by the second character. Custom digraphs can also be defined for arbitrary code points. (E.g. "dig Gr 9881" associates "Gr" with U+2699 GEAR.)
- GNU Screen allows digraph entry with (by default) Ctrl-A Ctrl-V.
See also
Wikibooks has a book on the topic of: Unicode/List of useful symbols
References