0% found this document useful (0 votes)
18 views7 pages

General Punctuation: The Unicode Standard, Version 16.0

This document provides an excerpt from the character code tables and names for The Unicode Standard, Version 16.0, covering general punctuation characters. It includes links to additional resources for errata, character charts, and usage guidelines, along with a disclaimer about the limitations of the charts. The document also outlines terms of use and copyright information related to the Unicode Standard.

Uploaded by

alipezeshk
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views7 pages

General Punctuation: The Unicode Standard, Version 16.0

This document provides an excerpt from the character code tables and names for The Unicode Standard, Version 16.0, covering general punctuation characters. It includes links to additional resources for errata, character charts, and usage guidelines, along with a disclaimer about the limitations of the charts. The document also outlines terms of use and copyright information related to the Unicode Standard.

Uploaded by

alipezeshk
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

General Punctuation

Range: 2000–206F

This file contains an excerpt from the character code tables and list of character names for
The Unicode Standard, Version 16.0

This file may be changed at any time without notice to reflect errata, or other updates to the Unicode Standard.
See https://www.unicode.org/errata/ for an up-to-date list of errata.

See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See
https://www.unicode.org/charts/PDF/Unicode-16.0/ for charts showing only the characters added in Unicode 16.0. See
https://www.unicode.org/Public/16.0.0/charts/ for a complete archived file of character code charts for Unicode 16.0. See
https://www.unicode.org/charts/About.html#Conventions for conventions used in these code charts, and other general
information.

Disclaimer
These charts are provided as the online reference to the character contents of the Unicode Standard, Version 16.0 but do
not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete
understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode
Standard, Version 16.0, online at https://www.unicode.org/versions/Unicode16.0.0/, as well as the Unicode Standard
Annexes, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available
online.
See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/

A thorough understanding of the information contained in these additional sources is required for a successful
implementation.

Fonts
The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be
expected in actual fonts.
See https://www.unicode.org/charts/fonts.html for a list.
Terms of Use
© 1991–2024 Unicode, Inc. This publication is protected by copyright, and permission must be obtained from Unicode,
Inc. prior to any reproduction, modification, or other use not permitted by the Terms of Use
(https://www.unicode.org/copyright.html). Specifically, you may make copies of this publication and may annotate and
translate it solely for personal or internal business purposes and not for public distribution, provided that any such
permitted copies and modifications fully reproduce all copyright and other legal notices contained in the original. You
may not make copies of or modifications to this publication for public distribution, or incorporate it in whole or in part
into any product or publication without the express written permission of Unicode.

The Unicode Consortium specifically grants ISO a license to produce such code charts with their associated character
names list to show the repertoire of characters for that standard, as a normatively referenced, integral part of that
standard.

Unicode uses most fonts under restricted license from the original font owner. You may not extract, copy, modify, or
distribute fonts or font data from any Unicode Products, including this publication, without license from the font owner.

Use of all Unicode Products, including this publication, is governed by the Unicode Terms of Use
(https://www.unicode.org/copyright.html). The authors, contributors, and publishers have taken care in the preparation of
this publication, but make no express or implied representation or warranty of any kind and assume no responsibility or
liability for errors or omissions or for consequential or incidental damages that may arise therefrom. This publication is
provided “AS-IS” without charge as a convenience to users.

Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries.
2000 General Punctuation 206F

200 201 202 203 204 205 206

0  ‐ † ‰ ⁀ q 
2000 2010 2020 2030 2040 2050 2060

1  ‡ ‱ ⁁ r 
2001 2011 2021 2031 2041 2051 2061

2  ‒ • ′ ⁂ s 
2002 2012 2022 2032 2042 2052 2062

3  – ‣ ″ ⁃ ⁓ 
2003 2013 2023 2033 2043 2053 2063

4  — ․ ‴ ⁄ ⁔
2004 2014 2024 2034 2044 2054 2064

5 ― ‥ ‵ ⁅ ⁕
2005 2015 2025 2035 2045 2055

6  ‖ … ‶ ⁆ ⁖ 
2006 2016 2026 2036 2046 2056 2066

7  ‗ ‧ ‷ n t 
2007 2017 2027 2037 2047 2057 2067

8  ‘  ‸ ⁈ ⁘ 
2008 2018 2028 2038 2048 2058 2068

9  ’  ‹ ⁉ ⁙ 
2009 2019 2029 2039 2049 2059 2069

A  ‚  › ⁊ ⁚ 
200A 201A 202A 203A 204A 205A 206A

B  ‛  ※ ⁋ ⁛ 
200B 201B 202B 203B 204B 205B 206B

C  “  ‼ ⁌ ⁜ 
200C 201C 202C 203C 204C 205C 206C

D  ”  ‽ ⁍ ⁝ 
200D 201D 202D 203D 204D 205D 206D

E  „  ‾ o ⁞ 
200E 201E 202E 203E 204E 205E 206E

F  ‟  ‿ p 
200F 201F 202F 203F 204F 205F 206F

216 The Unicode Standard, Version 16.0, Copyright © 1991-2024 Unicode, Inc. All rights reserved.
2000 General Punctuation 201B

For additional general punctuation characters see also Basic Dashes


Latin, Latin-1, Supplemental Punctuation and CJK Symbols 2010 ‐ HYPHEN
and Punctuation. → 002D - hyphen-minus
Spaces → 00AD  soft hyphen
2000  EN QUAD 2011  NON-BREAKING HYPHEN
≡ 2002  en space ≈ <noBreak> 2010 ‐
2001  EM QUAD 2012 ‒ FIGURE DASH
= mutton quad 2013 – EN DASH
≡ 2003  em space 2014 — EM DASH
2002  EN SPACE • may be used in pairs to offset parenthetical text
= nut → 2E3A ⸺ two-em dash
• half an em → 30FC ー katakana-hiragana prolonged sound
≈ 0020  space mark
2003  EM SPACE 2015 ― HORIZONTAL BAR
= mutton = quotation dash
• nominally, a space equal to the type size in • long dash introducing quoted text
points General punctuation
• may scale by the condensation factor of a font 2016 ‖ DOUBLE VERTICAL LINE
≈ 0020  space • used in pairs to indicate norm of a matrix
2004  THREE-PER-EM SPACE → 20E6  combining double vertical stroke
= thick space overlay
≈ 0020  space → 2225 ∥ parallel to
2005  FOUR-PER-EM SPACE → 23F8  double vertical bar
= mid space 2017 ‗ DOUBLE LOW LINE
≈ 0020  space • this is a spacing character
2006  SIX-PER-EM SPACE → 005F _ low line
• in computer typography sometimes equated → 0333 $̳ combining double low line
to thin space
≈ 0020  0333 $̳
≈ 0020  space
2007  FIGURE SPACE Quotation marks and apostrophe
• space equal to tabular width of a font Use of quotation marks differs by language. The character
• this is equivalent to the digit width of fonts names cannot reflect actual usage for all languages.
with fixed-width digits 2018 ‘ LEFT SINGLE QUOTATION MARK
≈ <noBreak> 0020  = single turned comma quotation mark
2008  PUNCTUATION SPACE • this is the preferred character (as opposed to
• space equal to narrow punctuation of a font 201B ‛ )
≈ 0020  space → 0027 ' apostrophe
2009  THIN SPACE → 02BB ʻ modifier letter turned comma
• a fifth of an em (or sometimes a sixth) → 275B ❛ heavy single turned comma quotation
→ 202F  narrow no-break space mark ornament
≈ 0020  space ⁓ 2018 FE00 ‘ non-fullwidth form
200A  HAIR SPACE ⁓ 2018 FE01 ‘ right-justified fullwidth form
• thinner than a thin space 2019 ’ RIGHT SINGLE QUOTATION MARK
• in traditional typography, the thinnest space = single comma quotation mark
available • this is the preferred character to use for
≈ 0020  space apostrophe
→ 0027 ' apostrophe
Format characters
→ 02BC ʼ modifier letter apostrophe
200B  ZERO WIDTH SPACE → 275C ❜ heavy single comma quotation mark
• commonly abbreviated ZWSP ornament
• this character is intended for invisible word ⁓ 2019 FE00 ’ non-fullwidth form
separation and for line break control; it has no ⁓ 2019 FE01 ’ left-justified fullwidth form
width, but its presence between two characters
does not prevent increased letter spacing in 201A ‚ SINGLE LOW-9 QUOTATION MARK
justification = low single comma quotation mark
200C  ZERO WIDTH NON-JOINER • used as opening single quotation mark in some
languages
• commonly abbreviated ZWNJ
201B ‛ SINGLE HIGH-REVERSED-9 QUOTATION MARK
200D  ZERO WIDTH JOINER
= single reversed comma quotation mark
• commonly abbreviated ZWJ • has same semantic as 2018 ‘ , but differs in
200E  LEFT-TO-RIGHT MARK appearance
• commonly abbreviated LRM → 02BD ʽ modifier letter reversed comma
200F  RIGHT-TO-LEFT MARK
• commonly abbreviated RLM
→ 061C  arabic letter mark

The Unicode Standard, Version 16.0, Copyright © 1991-2024 Unicode, Inc. All rights reserved. 217
201C General Punctuation 2035

201C “ LEFT DOUBLE QUOTATION MARK 2027 ‧ HYPHENATION POINT


= double turned comma quotation mark • visible symbol used to indicate correct
• this is the preferred character (as opposed to positions for word breaking, as in dic·tion·ar·ies
201F ‟ ) Separators
→ 0022 " quotation mark
2028  LINE SEPARATOR
→ 275D ❝ heavy double turned comma
quotation mark ornament • may be used to represent this semantic
unambiguously
→ 301D 〝 reversed double prime quotation
mark 2029  PARAGRAPH SEPARATOR
⁓ 201C FE00 “ non-fullwidth form • may be used to represent this semantic
unambiguously
⁓ 201C FE01 “ right-justified fullwidth form
201D ” RIGHT DOUBLE QUOTATION MARK Format characters
= double comma quotation mark 202A  LEFT-TO-RIGHT EMBEDDING
→ 0022 " quotation mark • commonly abbreviated LRE
→ 2033 ″ double prime 202B  RIGHT-TO-LEFT EMBEDDING
→ 275E ❞ heavy double comma quotation mark • commonly abbreviated RLE
ornament 202C  POP DIRECTIONAL FORMATTING
→ 301E 〞 double prime quotation mark • commonly abbreviated PDF
⁓ 201D FE00 ” non-fullwidth form 202D  LEFT-TO-RIGHT OVERRIDE
⁓ 201D FE01 ” left-justified fullwidth form • commonly abbreviated LRO
201E „ DOUBLE LOW-9 QUOTATION MARK 202E  RIGHT-TO-LEFT OVERRIDE
= low double comma quotation mark
• commonly abbreviated RLO
• used as opening double quotation mark in
some languages Space
→ 2E42 ⹂ double low-reversed-9 quotation 202F  NARROW NO-BREAK SPACE
mark • commonly abbreviated NNBSP
→ 301F 〟 low double prime quotation mark • a narrow form of a no-break space, typically the
201F ‟ DOUBLE HIGH-REVERSED-9 QUOTATION MARK width of a thin space or a mid space
= double reversed comma quotation mark → 00A0  no-break space
• has same semantic as 201C “ , but differs in → 2005  four-per-em space
appearance → 2009  thin space
General punctuation ≈ <noBreak> 0020 
2020 † DAGGER General punctuation
= obelisk, long cross, oblong cross 2030 ‰ PER MILLE SIGN
→ 2E38 ⸸ turned dagger = permille, per thousand
2021 ‡ DOUBLE DAGGER • used, for example, in measures of blood alcohol
= diesis, double obelisk content, salinity, etc.
→ 2E4B ⹋ triple dagger → 0025 % percent sign
2022 • BULLET → 0609 ؉ arabic-indic per mille sign
= black small circle 2031 ‱ PER TEN THOUSAND SIGN
→ 00B7 · middle dot = permyriad
→ 2024 ․ one dot leader • percent of a percent, rarely used
→ 2219 ∙ bullet operator → 0025 % percent sign
→ 25D8 ◘ inverse bullet → 060A ؊ arabic-indic per ten thousand sign
→ 25E6 ◦ white bullet 2032 ′ PRIME
2023 ‣ TRIANGULAR BULLET = minutes, feet
→ 220E ∎ end of proof → 0027 ' apostrophe
→ 25B8 ▸ black right-pointing small triangle → 00B4 ´ acute accent
2024 ․ ONE DOT LEADER → 02B9 ʹ modifier letter prime
• also used as an Armenian semicolon (mijaket) 2033 ″ DOUBLE PRIME
→ 00B7 · middle dot = seconds, inches
→ 2022 • bullet → 0022 " quotation mark
→ 2219 ∙ bullet operator → 02BA ʺ modifier letter double prime
≈ 002E . full stop → 201D ” right double quotation mark
2025 ‥ TWO DOT LEADER → 3003 〃 ditto mark
≈ 002E . 002E . → 301E 〞 double prime quotation mark
2026 … HORIZONTAL ELLIPSIS ≈ 2032 ′ 2032 ′
= three dot leader 2034 ‴ TRIPLE PRIME
→ 22EE ⋮ vertical ellipsis = lines (old measure, 1/12 of an inch)
→ FE19 ︙ presentation form for vertical ≈ 2032 ′ 2032 ′ 2032 ′
horizontal ellipsis 2035 ‵ REVERSED PRIME
≈ 002E . 002E . 002E . → 0060 ` grave accent

218 The Unicode Standard, Version 16.0, Copyright © 1991-2024 Unicode, Inc. All rights reserved.
2036 General Punctuation 2056

2036 ‶ REVERSED DOUBLE PRIME Brackets


→ 301D 〝 reversed double prime quotation 2045 ⁅ LEFT SQUARE BRACKET WITH QUILL
mark → 2E20 ⸠ left vertical bar with quill
≈ 2035 ‵ 2035 ‵ → 2E55 ⹕ left square bracket with stroke
2037 ‷ REVERSED TRIPLE PRIME 2046 ⁆ RIGHT SQUARE BRACKET WITH QUILL
≈ 2035 ‵ 2035 ‵ 2035 ‵
2038 ‸ CARET Double punctuation for vertical text
→ 2303 ⌃ up arrowhead 2047 n DOUBLE QUESTION MARK
→ A788 ꞈ modifier letter low circumflex accent ≈ 003F ? 003F ?
2048 ⁈ QUESTION EXCLAMATION MARK
Quotation marks
≈ 003F ? 0021 !
2039 ‹ SINGLE LEFT-POINTING ANGLE QUOTATION 2049 ⁉ EXCLAMATION QUESTION MARK
MARK
= left pointing single guillemet ≈ 0021 ! 003F ?
• usually opening, sometimes closing General punctuation
→ 003C < less-than sign 204A ⁊ TIRONIAN SIGN ET
→ 2329 〈 left-pointing angle bracket • Irish Gaelic, Old English, ...
→ 3008 〈 left angle bracket → 0026 & ampersand
203A › SINGLE RIGHT-POINTING ANGLE QUOTATION → 2E52 ⹒ tironian sign capital et
MARK → 1F670 script ligature et ornament
= right pointing single guillemet 204B ⁋ REVERSED PILCROW SIGN
• usually closing, sometimes opening → 00B6 ¶ pilcrow sign
→ 003E > greater-than sign → 2E4D ⹍ paragraphus mark
→ 232A 〉 right-pointing angle bracket 204C ⁌ BLACK LEFTWARDS BULLET
→ 3009 〉 right angle bracket 204D ⁍ BLACK RIGHTWARDS BULLET
General punctuation 204E o LOW ASTERISK
203B ※ REFERENCE MARK → 002A * asterisk
= Japanese kome → 0359 $ combining asterisk below
= Urdu paragraph separator 204F p REVERSED SEMICOLON
→ 0FBF ē tibetan ku ru kha bzhi mig can • used occasionally in Sindhi when Sindhi is
→ 200AD ď written in the Arabic script
Double punctuation for vertical text → 003B ; semicolon
→ 061B ‫ ؛‬arabic semicolon
203C ‼ DOUBLE EXCLAMATION MARK 2050 q CLOSE UP
→ 0021 ! exclamation mark • editing mark
≈ 0021 ! 0021 ! → AB5B ꭛ modifier breve with inverted breve
General punctuation 2051 r TWO ASTERISKS ALIGNED VERTICALLY
203D ‽ INTERROBANG 2052 s COMMERCIAL MINUS SIGN
→ 0021 ! exclamation mark = abzüglich (German), med avdrag av (Swedish),
→ 003F ? question mark piska (Swedish, "whip")
→ 2E18 ⸘ inverted interrobang • a common glyph variant and fallback
→ 1F679 heavy interrobang ornament representation looks like ./.
203E ‾ OVERLINE • may also be used as a dingbat to indicate
correctness
= spacing overscore
• used in Finno-Ugric Phonetic Alphabet to
≈ 0020  0305 $̅ indicate a related borrowed form with different
203F ‿ UNDERTIE sound
= Greek enotikon → 0025 % percent sign
→ 2323 ⌣ smile → 066A ٪ arabic percent sign
2040 ⁀ CHARACTER TIE → 00F7 ÷ division sign
= z notation sequence concatenation 2053 ⁓ SWUNG DASH
→ 2322 ⌢ frown → 007E ~ tilde
2041 ⁁ CARET INSERTION POINT 2054 ⁔ INVERTED UNDERTIE
• proofreader’s mark: insert here 2055 ⁕ FLOWER PUNCTUATION MARK
→ 22CC ⋌ right semidirect product = phul, puspika
2042 ⁂ ASTERISM • used as a punctuation mark with Syloti Nagri,
2043 ⁃ HYPHEN BULLET Bengali and other Indic scripts
→ 002D - hyphen-minus → 274B ❋ heavy eight teardrop-spoked
2044 ⁄ FRACTION SLASH propeller asterisk
= solidus (in typography) Archaic punctuation
• for composing arbitrary fractions
→ 002F / solidus 2056 ⁖ THREE DOT PUNCTUATION
→ 2215 ∕ division slash → 10FB ჻ georgian paragraph separator

The Unicode Standard, Version 16.0, Copyright © 1991-2024 Unicode, Inc. All rights reserved. 219
2057 General Punctuation 206F

General punctuation 2064  INVISIBLE PLUS


2057 t QUADRUPLE PRIME • contiguity operator indicating addition
≈ 2032 ′ 2032 ′ 2032 ′ 2032 ′ Format characters
Archaic punctuation 2066  LEFT-TO-RIGHT ISOLATE
See also historic punctuation with multiple dots in the range • commonly abbreviated LRI
2E2A-2E2D. 2067  RIGHT-TO-LEFT ISOLATE
2058 ⁘ FOUR DOT PUNCTUATION • commonly abbreviated RLI
2059 ⁙ FIVE DOT PUNCTUATION 2068  FIRST STRONG ISOLATE
= Greek pentonkion • commonly abbreviated FSI
= quincunx 2069  POP DIRECTIONAL ISOLATE
→ 2684 ĺ die face-5 • commonly abbreviated PDI
205A ⁚ TWO DOT PUNCTUATION Deprecated
• historically used to indicate the end of a Use of these characters is strongly discouraged.
sentence or change of speaker
• extends from baseline to cap height 206A  INHIBIT SYMMETRIC SWAPPING
→ FE30 ︰ presentation form for vertical two 206B  ACTIVATE SYMMETRIC SWAPPING
dot leader 206C  INHIBIT ARABIC FORM SHAPING
→ 1015B 𐅛 greek acrophonic epidaurean two 206D  ACTIVATE ARABIC FORM SHAPING
205B ⁛ FOUR DOT MARK 206E  NATIONAL DIGIT SHAPES
• used by scribes in the margin as highlighter 206F  NOMINAL DIGIT SHAPES
mark
• this is centered on the line, but extends beyond
top and bottom of the line
205C ⁜ DOTTED CROSS
• used by scribes in the margin as highlighter
mark
205D ⁝ TRICOLON
= Epidaurean acrophonic symbol three
→ 22EE ⋮ vertical ellipsis
→ 2AF6 ⫶ triple colon operator
→ FE19 ︙ presentation form for vertical
horizontal ellipsis
205E ⁞ VERTICAL FOUR DOTS
• used in dictionaries to indicate legal but
undesirable word break
• glyph extends the whole height of the line
→ 2E3D ⸽ vertical six dots
Space
205F  MEDIUM MATHEMATICAL SPACE
• abbreviated MMSP
• four-eighteenths of an em
≈ 0020  space
Format character
2060  WORD JOINER
• commonly abbreviated WJ
• a zero width non-breaking space (only)
• intended for disambiguation of functions for
byte order mark
→ FEFF  zero width no-break space
Invisible operators
2061  FUNCTION APPLICATION
• contiguity operator indicating application of a
function
2062  INVISIBLE TIMES
• contiguity operator indicating multiplication
2063  INVISIBLE SEPARATOR
= invisible comma
• contiguity operator indicating that adjacent
mathematical symbols form a list, e.g. when no
visible comma is used between multiple
indices

220 The Unicode Standard, Version 16.0, Copyright © 1991-2024 Unicode, Inc. All rights reserved.
2018 General Punctuation 201D

Standardized Variation Sequences


2018
‘ LEFT SINGLE QUOTATION MARK
2018

‘ non-fullwidth form
2018 FE00

‘ right-justified fullwidth form


2018 FE01
2019
’ RIGHT SINGLE QUOTATION MARK
2019

’ non-fullwidth form
2019 FE00

’ left-justified fullwidth form


2019 FE01
201C
“ LEFT DOUBLE QUOTATION MARK
201C

“ non-fullwidth form
201C FE00

“ right-justified fullwidth form


201C FE01
201D
” RIGHT DOUBLE QUOTATION MARK
201D

” non-fullwidth form
201D FE00

” left-justified fullwidth form


201D FE01

The Unicode Standard, Version 16.0, Copyright © 1991-2024 Unicode, Inc. All rights reserved. 221

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy