Jump to content

Zero-width space: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
m Updated links.
Line 3: Line 3:


== Usage ==
== Usage ==
In [[HTML]] pages, the zero-width space can be used to mark a potential line break ''without'' [[hyphenation]], as can the HTML element [[HTML element#wbr|<code>&lt;wbr></code>]]; for [[Syllabification|hyphenated line breaks]], a [[soft hyphen]] is used. The zero-width space was not supported in some older [[web browser]]s.<ref>{{cite web|url=http://dunae.ca/2009/better-web-typography-with-spaces-and-hyphens/ |first=Alex|last=Dunae |title=Better Web Typography with Spaces and Hyphens |work=dunae.ca |access-date=December 3, 2009 |archive-url=https://web.archive.org/web/20101214223741/http://dunae.ca/2009/better-web-typography-with-spaces-and-hyphens/ |archive-date=December 14, 2010}}</ref>
In [[HTML]] pages, the zero-width space can be used to mark a potential line break ''without'' [[syllabification|hyphenation]], as can the HTML element [[HTML element#wbr|<code>&lt;wbr></code>]]; for hyphenated line breaks, a [[soft hyphen]] is used. The zero-width space was not supported in some older [[web browser]]s.<ref>{{cite web|url=http://dunae.ca/2009/better-web-typography-with-spaces-and-hyphens/ |first=Alex|last=Dunae |title=Better Web Typography with Spaces and Hyphens |work=dunae.ca |access-date=December 3, 2009 |archive-url=https://web.archive.org/web/20101214223741/http://dunae.ca/2009/better-web-typography-with-spaces-and-hyphens/ |archive-date=December 14, 2010}}</ref>


To show the effect of the zero-width space, the following words have been separated with zero-width spaces:
To show the effect of the zero-width space, the following words have been separated with zero-width spaces:

Revision as of 14:06, 20 January 2022

The zero-width space (), abbreviated ZWSP, is a non-printing character used in computerized typesetting to indicate word boundaries to text-processing systems in scripts that do not use explicit spacing, or after characters (such as the slash) that are not followed by a visible space but after which there may nevertheless be a line break. It is also used with languages without visible space between words, for example, Japanese.[1] Normally, it is not a visible separation, but it may expand in passages that are fully justified.[2]

Usage

In HTML pages, the zero-width space can be used to mark a potential line break without hyphenation, as can the HTML element <wbr>; for hyphenated line breaks, a soft hyphen is used. The zero-width space was not supported in some older web browsers.[3]

To show the effect of the zero-width space, the following words have been separated with zero-width spaces:

Lorem​Ipsum​Dolor​Sit​Amet​Consectetur​Adipiscing​Elit​Sed​Do​Eiusmod​Tempor​Incididunt​Ut​Labore​Et​Dolore​Magna​Aliqua​Ut​Enim​Ad​Minim​Veniam​Quis​Nostrud​Exercitation​Ullamco​Laboris​Nisi​Ut​Aliquip​Ex​Ea​Commodo​Consequat​Duis​Aute​Irure​Dolor​In​Reprehenderit​In​Voluptate​Velit​Esse​Cillum​Dolore​Eu​Fugiat​Nulla​Pariatur​Excepteur​Sint​Occaecat​Cupidatat​Non​Proident​Sunt​In​Culpa​Qui​Officia​Deserunt​Mollit​Anim​Id​Est​Laborum

And the following words are not separated with these spaces:

LoremIpsumDolorSitAmetConsecteturAdipiscingElitSedDoEiusmodTemporIncididuntUtLaboreEtDoloreMagnaAliquaUtEnimAdMinimVeniamQuisNostrudExercitationUllamcoLaborisNisiUtAliquipExEaCommodoConsequatDuisAuteIrureDolorInReprehenderitInVoluptateVelitEsseCillumDoloreEuFugiatNullaPariaturExcepteurSintOccaecatCupidatatNonProidentSuntInCulpaQuiOfficiaDeseruntMollitAnimIdEstLaborum

On browsers supporting zero-width spaces, resizing the window will re-break the first text only at word boundaries, while the second text will not be broken at all.

Prohibited in URLs

ICANN rules prohibit domain names from including non-displayed characters such as zero-width space, and most browsers prohibit their use within domain names because they can be used to create a homograph attack, where a malicious URL is visually indistinguishable from a legitimate one.[4][5]

Encoding

The zero-width space character is encoded in Unicode as U+200B ZERO WIDTH SPACE (HTML &#8203;  ·  &#x200B;  ·  &ZeroWidthSpace;  ·  &NegativeVeryThinSpace;).[6]

The TeX representation is \hskip0pt; the LaTeX representation is \hspace{0pt};[7] and the groff representation is \:.[8]

Its semantics and HTML implementation are similar to the soft hyphen, except that soft hyphens display a hyphen character at the point where the line is broken.

See also

References

Citations

  1. ^ "Zones spéciales et caractères de formatage" [Special areas and formatting characters] (PDF). Hapax Quebec (in French). p. 3. Archived from the original (PDF) on 27 December 2005. Retrieved 31 July 2019. Les espaces sans chasse sont conçues pour les langues qui ne séparent pas les mots à l'aide d'espaces visibles, comme le thaï ou le japonais.
  2. ^ The Unicode Standard 6.1, p. 366
  3. ^ Dunae, Alex. "Better Web Typography with Spaces and Hyphens". dunae.ca. Archived from the original on December 14, 2010. Retrieved December 3, 2009.
  4. ^ "Network.IDN.blacklist_chars". mozillaZine. Retrieved 2018-02-07.
  5. ^ "Unicode Character 'Zero Width Space'". FileFormat.Info. Retrieved 2018-02-07.
  6. ^ "General Punctuation – Unicode" (PDF). Retrieved 2013-07-20.
  7. ^ "The LaTeX Companion. Chapter 3: Basic Formatting Tools" (PDF). Retrieved 2019-07-16.
  8. ^ "groff(7) – Linux manual page". Retrieved 2014-02-08.

Sources