Skip to content

Commit c8c0b9f

Browse files
committed
Update to Unicode 15.0.0
The DerivedBidiClass.txt format (among others) has changed, with multiple @missing lines for a single property now being used. This means that the text file itself has many fewer explicit entries, and consequently the :bidi-class test (if left alone) would be substantially less rigorous. Adapt the test to keep track of which characters have been checked already; parse the @missing lines; and use a separate test to check the un-tested characters against the specifications in the parsed @missing lines.
1 parent 558c74c commit c8c0b9f

34 files changed

+34158
-64138
lines changed

NEWS

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
11
;;;; -*- coding: utf-8; fill-column: 78 -*-
22

33
changes relative to sbcl-2.3.4:
4-
* enhancement: Unicode support has been updated to support version 14.0.0 of
5-
the Unicode standard, including addition of characters and refinements to
6-
grapheme-, word- and line-breaking algorithms.
4+
* enhancement: Unicode support has been updated to support version 15.0.0 of
5+
the Unicode standard, including addition of characters and their collation
6+
keys, and refinements to grapheme-, word- and line-breaking algorithms.
77

88
changes in sbcl-2.3.4 relative to sbcl-2.3.3:
99
* enhancement: attempts to call non-callable objects can now provide

doc/internals-notes/updating-unicode

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
Updating Unicode versions (notes on the 10.0.0-13.0.0 updates):
1+
Updating Unicode versions (notes on the 10.0.0-15.0.0 updates):
22

33
1. Download ucd.zip for the version to update to, along with UCA's
44
allkeys.txt and CollationTest.zip, and confusables.txt from

src/code/target-unicode.lisp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -827,7 +827,7 @@ The result is not guaranteed to have the same length as the input."
827827
((<= #x1F1E6 cp #x1F1FF) :regional-indicator)
828828
((or
829829
;; Consonant_Preceding_Repha
830-
(= cp #x0D4E) (= cp #x11941) (= cp #x11D46)
830+
(= cp #x0D4E) (= cp #x11941) (= cp #x11D46) (= cp #x11F02)
831831
;; Consonant_Prefixed
832832
(<= #x111C2 cp #x111C3) (= cp #x1193F) (= cp #x11A3A) (<= #x11A84 cp #x11A89)
833833
(proplist-p char :prepended-concatenation-mark))

0 commit comments

Comments
 (0)