Skip to content

Commit 738c19f

Browse files
authored
closes bpo-33376: Update to Unicode 12.0.0. (GH-12256)
1 parent 0411411 commit 738c19f

File tree

10 files changed

+28458
-27552
lines changed

10 files changed

+28458
-27552
lines changed

Doc/library/stdtypes.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -351,7 +351,7 @@ Notes:
351351
The numeric literals accepted include the digits ``0`` to ``9`` or any
352352
Unicode equivalent (code points with the ``Nd`` property).
353353

354-
See http://www.unicode.org/Public/10.0.0/ucd/extracted/DerivedNumericType.txt
354+
See http://www.unicode.org/Public/12.0.0/ucd/extracted/DerivedNumericType.txt
355355
for a complete list of code points with the ``Nd`` property.
356356

357357

Doc/library/unicodedata.rst

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -17,8 +17,8 @@
1717

1818
This module provides access to the Unicode Character Database (UCD) which
1919
defines character properties for all Unicode characters. The data contained in
20-
this database is compiled from the `UCD version 11.0.0
21-
<http://www.unicode.org/Public/11.0.0/ucd>`_.
20+
this database is compiled from the `UCD version 12.0.0
21+
<http://www.unicode.org/Public/12.0.0/ucd>`_.
2222

2323
The module uses the same names and symbols as defined by Unicode
2424
Standard Annex #44, `"Unicode Character Database"
@@ -175,6 +175,6 @@ Examples:
175175

176176
.. rubric:: Footnotes
177177

178-
.. [#] http://www.unicode.org/Public/11.0.0/ucd/NameAliases.txt
178+
.. [#] http://www.unicode.org/Public/12.0.0/ucd/NameAliases.txt
179179
180-
.. [#] http://www.unicode.org/Public/11.0.0/ucd/NamedSequences.txt
180+
.. [#] http://www.unicode.org/Public/12.0.0/ucd/NamedSequences.txt

Doc/reference/lexical_analysis.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -316,7 +316,7 @@ The Unicode category codes mentioned above stand for:
316316
* *Nd* - decimal numbers
317317
* *Pc* - connector punctuations
318318
* *Other_ID_Start* - explicit list of characters in `PropList.txt
319-
<http://www.unicode.org/Public/11.0.0/ucd/PropList.txt>`_ to support backwards
319+
<http://www.unicode.org/Public/12.0.0/ucd/PropList.txt>`_ to support backwards
320320
compatibility
321321
* *Other_ID_Continue* - likewise
322322

Doc/whatsnew/3.8.rst

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -157,6 +157,10 @@ Improved Modules
157157
to cast the result to the desired type: ``OrderedDict(nt._asdict())``.
158158
(Contributed by Raymond Hettinger in :issue:`35864`.)
159159

160+
* The :mod:`unicodedata` module has been upgraded to use the `Unicode 12.0.0
161+
<http://blog.unicode.org/2019/03/announcing-unicode-standard-version-120.html>`_
162+
release.
163+
160164

161165
asyncio
162166
-------

Lib/test/test_unicodedata.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@
2020
class UnicodeMethodsTest(unittest.TestCase):
2121

2222
# update this, if the database changes
23-
expectedchecksum = '97a41f208c53d5e08c77c1175187e95386b82b6f'
23+
expectedchecksum = '9129d6f2bdf008a81c2476e5b5127014a62130c1'
2424

2525
def test_method_checksum(self):
2626
h = hashlib.sha1()
@@ -80,7 +80,7 @@ class UnicodeFunctionsTest(UnicodeDatabaseTest):
8080

8181
# Update this if the database changes. Make sure to do a full rebuild
8282
# (e.g. 'make distclean && make') to get the correct checksum.
83-
expectedchecksum = '4f73278b19c2ec3099724c132f0b90a1d25c19e4'
83+
expectedchecksum = '4cb02a243aed7c251067386dd738189146fddf94'
8484
def test_function_checksum(self):
8585
data = []
8686
h = hashlib.sha1()
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
Update Unicode databases to version 12.0.0.

Modules/unicodedata_db.h

Lines changed: 1307 additions & 1226 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

Modules/unicodename_db.h

Lines changed: 26070 additions & 25405 deletions
Large diffs are not rendered by default.

Objects/unicodetype_db.h

Lines changed: 1067 additions & 912 deletions
Large diffs are not rendered by default.

Tools/unicode/makeunicodedata.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -41,7 +41,7 @@
4141
# * Doc/library/stdtypes.rst, and
4242
# * Doc/library/unicodedata.rst
4343
# * Doc/reference/lexical_analysis.rst (two occurrences)
44-
UNIDATA_VERSION = "11.0.0"
44+
UNIDATA_VERSION = "12.0.0"
4545
UNICODE_DATA = "UnicodeData%s.txt"
4646
COMPOSITION_EXCLUSIONS = "CompositionExclusions%s.txt"
4747
EASTASIAN_WIDTH = "EastAsianWidth%s.txt"

0 commit comments

Comments
 (0)
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy