X-Git-Url: https://gitweb.factorcode.org/gitweb.cgi?p=factor.git;a=blobdiff_plain;f=basis%2Funicode%2FUCD%2Fextracted%2FDerivedLineBreak.txt;h=7de7dba87dd755636bd3149fe9ea604c8d79d7c8;hp=2504c3f7e184b7c5af34790e6b7e7b26ac31a5fa;hb=e3f197c3bbd776e9bb83d7fa8598687a8842d0b6;hpb=631f909b7c6544e3391bdecb4139e7e2015ae69e diff --git a/basis/unicode/UCD/extracted/DerivedLineBreak.txt b/basis/unicode/UCD/extracted/DerivedLineBreak.txt index 2504c3f7e1..7de7dba87d 100644 --- a/basis/unicode/UCD/extracted/DerivedLineBreak.txt +++ b/basis/unicode/UCD/extracted/DerivedLineBreak.txt @@ -1,11 +1,11 @@ -# DerivedLineBreak-14.0.0.txt -# Date: 2021-07-10, 00:35:09 GMT -# © 2021 Unicode®, Inc. +# DerivedLineBreak-15.0.0.txt +# Date: 2022-08-05, 17:39:33 GMT +# © 2022 Unicode®, Inc. # Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries. -# For terms of use, see http://www.unicode.org/terms_of_use.html +# For terms of use, see https://www.unicode.org/terms_of_use.html # # Unicode Character Database -# For documentation, see http://www.unicode.org/reports/tr44/ +# For documentation, see https://www.unicode.org/reports/tr44/ # ================================================ @@ -16,6 +16,49 @@ # @missing: 0000..10FFFF; Unknown +# 20A0..20CF Currency_Symbols +# @missing: 20A0..20CF; Prefix_Numeric + +# 3400..4DBF CJK_Unified_Ideographs_Extension_A +# @missing: 3400..4DBF; Ideographic + +# 4E00..9FFF CJK_Unified_Ideographs +# @missing: 4E00..9FFF; Ideographic + +# F900..FAFF CJK_Compatibility_Ideographs +# @missing: F900..FAFF; Ideographic + +# 1F000..1F02F Mahjong_Tiles +# 1F030..1F09F Domino_Tiles +# 1F0A0..1F0FF Playing_Cards +# 1F100..1F1FF Enclosed_Alphanumeric_Supplement +# 1F200..1F2FF Enclosed_Ideographic_Supplement +# 1F300..1F5FF Miscellaneous_Symbols_And_Pictographs +# 1F600..1F64F Emoticons +# 1F650..1F67F Ornamental_Dingbats +# 1F680..1F6FF Transport_And_Map_Symbols +# 1F700..1F77F Alchemical_Symbols +# 1F780..1F7FF Geometric_Shapes_Extended +# 1F800..1F8FF Supplemental_Arrows_C +# 1F900..1F9FF Supplemental_Symbols_And_Pictographs +# 1FA00..1FA6F Chess_Symbols +# 1FA70..1FAFF Symbols_And_Pictographs_Extended_A +# @missing: 1F000..1FAFF; Ideographic + +# @missing: 1FC00..1FFFD; Ideographic + +# 20000..2A6DF CJK_Unified_Ideographs_Extension_B +# 2A700..2B73F CJK_Unified_Ideographs_Extension_C +# 2B740..2B81F CJK_Unified_Ideographs_Extension_D +# 2B820..2CEAF CJK_Unified_Ideographs_Extension_E +# 2CEB0..2EBEF CJK_Unified_Ideographs_Extension_F +# 2F800..2FA1F CJK_Compatibility_Ideographs_Supplement +# @missing: 20000..2FFFD; Ideographic + +# 30000..3134F CJK_Unified_Ideographs_Extension_G +# 31350..323AF CJK_Unified_Ideographs_Extension_H +# @missing: 30000..3FFFD; Ideographic + # ================================================ # Line_Break=Unknown @@ -24,8 +67,8 @@ E000..F8FF ; XX # Co [6400] .. F0000..FFFFD ; XX # Co [65534] .. 100000..10FFFD; XX # Co [65534] .. -# The above property value applies to 762997 code points not listed here. -# Total code points: 900465 +# The above property value applies to 762730 code points not listed here. +# Total code points: 900198 # ================================================ @@ -118,10 +161,12 @@ FF62 ; OP # Ps HALFWIDTH LEFT CORNER BRACKET 13288 ; OP # Lo EGYPTIAN HIEROGLYPH O036C 13379 ; OP # Lo EGYPTIAN HIEROGLYPH V011A 13437 ; OP # Cf EGYPTIAN HIEROGLYPH BEGIN SEGMENT +1343C ; OP # Cf EGYPTIAN HIEROGLYPH BEGIN ENCLOSURE +1343E ; OP # Cf EGYPTIAN HIEROGLYPH BEGIN WALLED ENCLOSURE 145CE ; OP # Lo ANATOLIAN HIEROGLYPH A410 BEGIN LOGOGRAM MARK 1E95E..1E95F ; OP # Po [2] ADLAM INITIAL EXCLAMATION MARK..ADLAM INITIAL QUESTION MARK -# Total code points: 92 +# Total code points: 94 # ================================================ @@ -215,9 +260,11 @@ FF64 ; CL # Po HALFWIDTH IDEOGRAPHIC COMMA 13289 ; CL # Lo EGYPTIAN HIEROGLYPH O036D 1337A..1337B ; CL # Lo [2] EGYPTIAN HIEROGLYPH V011B..EGYPTIAN HIEROGLYPH V011C 13438 ; CL # Cf EGYPTIAN HIEROGLYPH END SEGMENT +1343D ; CL # Cf EGYPTIAN HIEROGLYPH END ENCLOSURE +1343F ; CL # Cf EGYPTIAN HIEROGLYPH END WALLED ENCLOSURE 145CF ; CL # Lo ANATOLIAN HIEROGLYPH A410A END LOGOGRAM MARK -# Total code points: 95 +# Total code points: 97 # ================================================ @@ -266,13 +313,16 @@ FF64 ; CL # Po HALFWIDTH IDEOGRAPHIC COMMA 0F12 ; GL # Po TIBETAN MARK RGYA GRAM SHAD 0FD9..0FDA ; GL # Po [2] TIBETAN MARK LEADING MCHAN RTAGS..TIBETAN MARK TRAILING MCHAN RTAGS 180E ; GL # Cf MONGOLIAN VOWEL SEPARATOR +1DCD ; GL # Mn COMBINING DOUBLE CIRCUMFLEX ABOVE +1DFC ; GL # Mn COMBINING DOUBLE INVERTED BREVE BELOW 2007 ; GL # Zs FIGURE SPACE 2011 ; GL # Pd NON-BREAKING HYPHEN 202F ; GL # Zs NARROW NO-BREAK SPACE 13430..13436 ; GL # Cf [7] EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN HIEROGLYPH OVERLAY MIDDLE +13439..1343B ; GL # Cf [3] EGYPTIAN HIEROGLYPH INSERT AT MIDDLE..EGYPTIAN HIEROGLYPH INSERT AT BOTTOM 16FE4 ; GL # Mn KHITAN SMALL SCRIPT FILLER -# Total code points: 26 +# Total code points: 31 # ================================================ @@ -380,7 +430,6 @@ FE13..FE14 ; IS # Po [2] PRESENTATION FORM FOR VERTICAL COLON..PRESENTATION 20B7..20BA ; PR # Sc [4] SPESMILO SIGN..TURKISH LIRA SIGN 20BC..20BD ; PR # Sc [2] MANAT SIGN..RUBLE SIGN 20BF ; PR # Sc BITCOIN SIGN -20C1..20CF ; PR # Cn [15] .. 2116 ; PR # So NUMERO SIGN 2212..2213 ; PR # Sm [2] MINUS SIGN..MINUS-OR-PLUS SIGN FE69 ; PR # Sc SMALL DOLLAR SIGN @@ -389,6 +438,7 @@ FFE1 ; PR # Sc FULLWIDTH POUND SIGN FFE5..FFE6 ; PR # Sc [2] FULLWIDTH YEN SIGN..FULLWIDTH WON SIGN 1E2FF ; PR # Sc WANCHO NGUN SIGN +# The above property value applies to 15 code points not listed here. # Total code points: 67 # ================================================ @@ -405,6 +455,7 @@ FFE5..FFE6 ; PR # Sc [2] FULLWIDTH YEN SIGN..FULLWIDTH WON SIGN 09F9 ; PO # No BENGALI CURRENCY DENOMINATOR SIXTEEN 0D79 ; PO # So MALAYALAM DATE MARK 2030..2037 ; PO # Po [8] PER MILLE SIGN..REVERSED TRIPLE PRIME +2057 ; PO # Po QUADRUPLE PRIME 20A7 ; PO # Sc PESETA SIGN 20B6 ; PO # Sc LIVRE TOURNOIS SIGN 20BB ; PO # Sc NORDIC MARK SIGN @@ -421,7 +472,7 @@ FFE0 ; PO # Sc FULLWIDTH CENT SIGN 1ECAC ; PO # So INDIC SIYAQ PLACEHOLDER 1ECB0 ; PO # Sc INDIC SIYAQ RUPEE MARK -# Total code points: 37 +# Total code points: 38 # ================================================ @@ -481,16 +532,18 @@ ABF0..ABF9 ; NU # Nd [10] MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT NINE 11C50..11C59 ; NU # Nd [10] BHAIKSUKI DIGIT ZERO..BHAIKSUKI DIGIT NINE 11D50..11D59 ; NU # Nd [10] MASARAM GONDI DIGIT ZERO..MASARAM GONDI DIGIT NINE 11DA0..11DA9 ; NU # Nd [10] GUNJALA GONDI DIGIT ZERO..GUNJALA GONDI DIGIT NINE +11F50..11F59 ; NU # Nd [10] KAWI DIGIT ZERO..KAWI DIGIT NINE 16A60..16A69 ; NU # Nd [10] MRO DIGIT ZERO..MRO DIGIT NINE 16AC0..16AC9 ; NU # Nd [10] TANGSA DIGIT ZERO..TANGSA DIGIT NINE 16B50..16B59 ; NU # Nd [10] PAHAWH HMONG DIGIT ZERO..PAHAWH HMONG DIGIT NINE 1D7CE..1D7FF ; NU # Nd [50] MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL MONOSPACE DIGIT NINE 1E140..1E149 ; NU # Nd [10] NYIAKENG PUACHUE HMONG DIGIT ZERO..NYIAKENG PUACHUE HMONG DIGIT NINE 1E2F0..1E2F9 ; NU # Nd [10] WANCHO DIGIT ZERO..WANCHO DIGIT NINE +1E4F0..1E4F9 ; NU # Nd [10] NAG MUNDARI DIGIT ZERO..NAG MUNDARI DIGIT NINE 1E950..1E959 ; NU # Nd [10] ADLAM DIGIT ZERO..ADLAM DIGIT NINE 1FBF0..1FBF9 ; NU # Nd [10] SEGMENTED DIGIT ZERO..SEGMENTED DIGIT NINE -# Total code points: 652 +# Total code points: 672 # ================================================ @@ -855,7 +908,6 @@ ABF0..ABF9 ; NU # Nd [10] MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT NINE 2053 ; AL # Po SWUNG DASH 2054 ; AL # Pc INVERTED UNDERTIE 2055 ; AL # Po FLOWER PUNCTUATION MARK -2057 ; AL # Po QUADRUPLE PRIME 205C ; AL # Po DOTTED CROSS 2061..2064 ; AL # Cf [4] FUNCTION APPLICATION..INVISIBLE PLUS 2070 ; AL # No SUPERSCRIPT ZERO @@ -1300,6 +1352,7 @@ FFED..FFEE ; AL # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CIRCLE 11213..1122B ; AL # Lo [25] KHOJKI LETTER NYA..KHOJKI LETTER LLA 1123A ; AL # Po KHOJKI WORD SEPARATOR 1123D ; AL # Po KHOJKI ABBREVIATION SIGN +1123F..11240 ; AL # Lo [2] KHOJKI LETTER QA..KHOJKI LETTER SHORT I 11280..11286 ; AL # Lo [7] MULTANI LETTER A..MULTANI LETTER GA 11288 ; AL # Lo MULTANI LETTER GHA 1128A..1128D ; AL # Lo [4] MULTANI LETTER CA..MULTANI LETTER JJA @@ -1372,6 +1425,9 @@ FFED..FFEE ; AL # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CIRCLE 11D98 ; AL # Lo GUNJALA GONDI OM 11EE0..11EF2 ; AL # Lo [19] MAKASAR LETTER KA..MAKASAR ANGKA 11EF7..11EF8 ; AL # Po [2] MAKASAR PASSIMBANG..MAKASAR END OF SECTION +11F02 ; AL # Lo KAWI SIGN REPHA +11F04..11F10 ; AL # Lo [13] KAWI LETTER A..KAWI LETTER O +11F12..11F33 ; AL # Lo [34] KAWI LETTER KA..KAWI LETTER JNYA 11FB0 ; AL # Lo LISU LETTER YHA 11FC0..11FD4 ; AL # No [21] TAMIL FRACTION ONE THREE-HUNDRED-AND-TWENTIETH..TAMIL FRACTION DOWNSCALING FACTOR KIIZH 11FD5..11FDC ; AL # So [8] TAMIL SIGN NEL..TAMIL SIGN MUKKURUNI @@ -1385,7 +1441,8 @@ FFED..FFEE ; AL # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CIRCLE 1325E..13281 ; AL # Lo [36] EGYPTIAN HIEROGLYPH O007..EGYPTIAN HIEROGLYPH O033 13283..13285 ; AL # Lo [3] EGYPTIAN HIEROGLYPH O034..EGYPTIAN HIEROGLYPH O036 1328A..13378 ; AL # Lo [239] EGYPTIAN HIEROGLYPH O037..EGYPTIAN HIEROGLYPH V011 -1337C..1342E ; AL # Lo [179] EGYPTIAN HIEROGLYPH V012..EGYPTIAN HIEROGLYPH AA032 +1337C..1342F ; AL # Lo [180] EGYPTIAN HIEROGLYPH V012..EGYPTIAN HIEROGLYPH V011D +13441..13446 ; AL # Lo [6] EGYPTIAN HIEROGLYPH FULL BLANK..EGYPTIAN HIEROGLYPH WIDE LOST SIGN 14400..145CD ; AL # Lo [462] ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLYPH A409 145D0..14646 ; AL # Lo [119] ANATOLIAN HIEROGLYPH A411..ANATOLIAN HIEROGLYPH A530 16800..16A38 ; AL # Lo [569] BAMUM LETTER PHASE-A NGKUE MFON..BAMUM LETTER PHASE-F VUEQ @@ -1425,6 +1482,7 @@ FFED..FFEE ; AL # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CIRCLE 1D1AE..1D1EA ; AL # So [61] MUSICAL SYMBOL PEDAL MARK..MUSICAL SYMBOL KORON 1D200..1D241 ; AL # So [66] GREEK VOCAL NOTATION SYMBOL-1..GREEK INSTRUMENTAL NOTATION SYMBOL-54 1D245 ; AL # So GREEK MUSICAL LEIMMA +1D2C0..1D2D3 ; AL # No [20] KAKTOVIK NUMERAL ZERO..KAKTOVIK NUMERAL NINETEEN 1D2E0..1D2F3 ; AL # No [20] MAYAN NUMERAL ZERO..MAYAN NUMERAL NINETEEN 1D300..1D356 ; AL # So [87] MONOGRAM FOR EARTH..TETRAGRAM FOR FOSTERING 1D360..1D378 ; AL # No [25] COUNTING ROD UNIT DIGIT ONE..TALLY MARK FIVE @@ -1477,12 +1535,16 @@ FFED..FFEE ; AL # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CIRCLE 1DF00..1DF09 ; AL # L& [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK 1DF0A ; AL # Lo LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK 1DF0B..1DF1E ; AL # L& [20] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN SMALL LETTER S WITH CURL +1DF25..1DF2A ; AL # L& [6] LATIN SMALL LETTER D WITH MID-HEIGHT LEFT HOOK..LATIN SMALL LETTER T WITH MID-HEIGHT LEFT HOOK +1E030..1E06D ; AL # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE 1E100..1E12C ; AL # Lo [45] NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PUACHUE HMONG LETTER W 1E137..1E13D ; AL # Lm [7] NYIAKENG PUACHUE HMONG SIGN FOR PERSON..NYIAKENG PUACHUE HMONG SYLLABLE LENGTHENER 1E14E ; AL # Lo NYIAKENG PUACHUE HMONG LOGOGRAM NYAJ 1E14F ; AL # So NYIAKENG PUACHUE HMONG CIRCLED CA 1E290..1E2AD ; AL # Lo [30] TOTO LETTER PA..TOTO LETTER A 1E2C0..1E2EB ; AL # Lo [44] WANCHO LETTER AA..WANCHO LETTER YIH +1E4D0..1E4EA ; AL # Lo [27] NAG MUNDARI LETTER O..NAG MUNDARI LETTER ELL +1E4EB ; AL # Lm NAG MUNDARI SIGN OJOD 1E7E0..1E7E6 ; AL # Lo [7] ETHIOPIC SYLLABLE HHYA..ETHIOPIC SYLLABLE HHYO 1E7E8..1E7EB ; AL # Lo [4] ETHIOPIC SYLLABLE GURAGE HHWA..ETHIOPIC SYLLABLE HHWE 1E7ED..1E7EE ; AL # Lo [2] ETHIOPIC SYLLABLE GURAGE MWI..ETHIOPIC SYLLABLE GURAGE MWEE @@ -1560,7 +1622,7 @@ FFED..FFEE ; AL # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CIRCLE 1FB00..1FB92 ; AL # So [147] BLOCK SEXTANT-1..UPPER HALF INVERSE MEDIUM SHADE AND LOWER HALF BLOCK 1FB94..1FBCA ; AL # So [55] LEFT HALF INVERSE MEDIUM SHADE AND RIGHT HALF BLOCK..WHITE UP-POINTING CHEVRON -# Total code points: 22043 +# Total code points: 22215 # ================================================ @@ -1652,9 +1714,7 @@ FFED..FFEE ; AL # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CIRCLE A016..A48C ; ID # Lo [1143] YI SYLLABLE BIT..YI SYLLABLE YYR A490..A4C6 ; ID # So [55] YI RADICAL QOT..YI RADICAL KE F900..FA6D ; ID # Lo [366] CJK COMPATIBILITY IDEOGRAPH-F900..CJK COMPATIBILITY IDEOGRAPH-FA6D -FA6E..FA6F ; ID # Cn [2] .. FA70..FAD9 ; ID # Lo [106] CJK COMPATIBILITY IDEOGRAPH-FA70..CJK COMPATIBILITY IDEOGRAPH-FAD9 -FADA..FAFF ; ID # Cn [38] .. FE30 ; ID # Po PRESENTATION FORM FOR VERTICAL TWO DOT LEADER FE31..FE32 ; ID # Pd [2] PRESENTATION FORM FOR VERTICAL EM DASH..PRESENTATION FORM FOR VERTICAL EN DASH FE33..FE34 ; ID # Pc [2] PRESENTATION FORM FOR VERTICAL LOW LINE..PRESENTATION FORM FOR VERTICAL WAVY LOW LINE @@ -1696,37 +1756,26 @@ FFDA..FFDC ; ID # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL LETTE FFE2 ; ID # Sm FULLWIDTH NOT SIGN FFE3 ; ID # Sk FULLWIDTH MACRON FFE4 ; ID # So FULLWIDTH BROKEN BAR +11F45..11F4F ; ID # Po [11] KAWI PUNCTUATION SECTION MARKER..KAWI PUNCTUATION CLOSING SPIRAL 17000..187F7 ; ID # Lo [6136] TANGUT IDEOGRAPH-17000..TANGUT IDEOGRAPH-187F7 18800..18AFF ; ID # Lo [768] TANGUT COMPONENT-001..TANGUT COMPONENT-768 18D00..18D08 ; ID # Lo [9] TANGUT IDEOGRAPH-18D00..TANGUT IDEOGRAPH-18D08 1B000..1B122 ; ID # Lo [291] KATAKANA LETTER ARCHAIC E..KATAKANA LETTER ARCHAIC WU 1B170..1B2FB ; ID # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB 1F000..1F02B ; ID # So [44] MAHJONG TILE EAST WIND..MAHJONG TILE BACK -1F02C..1F02F ; ID # Cn [4] .. 1F030..1F093 ; ID # So [100] DOMINO TILE HORIZONTAL BACK..DOMINO TILE VERTICAL-06-06 -1F094..1F09F ; ID # Cn [12] .. 1F0A0..1F0AE ; ID # So [15] PLAYING CARD BACK..PLAYING CARD KING OF SPADES -1F0AF..1F0B0 ; ID # Cn [2] .. 1F0B1..1F0BF ; ID # So [15] PLAYING CARD ACE OF HEARTS..PLAYING CARD RED JOKER -1F0C0 ; ID # Cn 1F0C1..1F0CF ; ID # So [15] PLAYING CARD ACE OF DIAMONDS..PLAYING CARD BLACK JOKER -1F0D0 ; ID # Cn 1F0D1..1F0F5 ; ID # So [37] PLAYING CARD ACE OF CLUBS..PLAYING CARD TRUMP-21 -1F0F6..1F0FF ; ID # Cn [10] .. 1F10D..1F10F ; ID # So [3] CIRCLED ZERO WITH SLASH..CIRCLED DOLLAR SIGN WITH OVERLAID BACKSLASH 1F16D..1F16F ; ID # So [3] CIRCLED CC..CIRCLED HUMAN FIGURE 1F1AD ; ID # So MASK WORK SYMBOL -1F1AE..1F1E5 ; ID # Cn [56] .. 1F200..1F202 ; ID # So [3] SQUARE HIRAGANA HOKA..SQUARED KATAKANA SA -1F203..1F20F ; ID # Cn [13] .. 1F210..1F23B ; ID # So [44] SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED CJK UNIFIED IDEOGRAPH-914D -1F23C..1F23F ; ID # Cn [4] .. 1F240..1F248 ; ID # So [9] TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-672C..TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-6557 -1F249..1F24F ; ID # Cn [7] .. 1F250..1F251 ; ID # So [2] CIRCLED IDEOGRAPH ADVANTAGE..CIRCLED IDEOGRAPH ACCEPT -1F252..1F25F ; ID # Cn [14] .. 1F260..1F265 ; ID # So [6] ROUNDED SYMBOL FOR FU..ROUNDED SYMBOL FOR CAI -1F266..1F2FF ; ID # Cn [154] .. 1F300..1F384 ; ID # So [133] CYCLONE..CHRISTMAS TREE 1F386..1F39B ; ID # So [22] FIREWORKS..CONTROL KNOBS 1F39E..1F3B4 ; ID # So [23] FILM FRAMES..FLOWER PLAYING CARDS @@ -1765,25 +1814,14 @@ FFE4 ; ID # So FULLWIDTH BROKEN BAR 1F6B7..1F6BF ; ID # So [9] NO PEDESTRIANS..SHOWER 1F6C1..1F6CB ; ID # So [11] BATHTUB..COUCH AND LAMP 1F6CD..1F6D7 ; ID # So [11] SHOPPING BAGS..ELEVATOR -1F6D8..1F6DC ; ID # Cn [5] .. -1F6DD..1F6EC ; ID # So [16] PLAYGROUND SLIDE..AIRPLANE ARRIVING -1F6ED..1F6EF ; ID # Cn [3] .. +1F6DC..1F6EC ; ID # So [17] WIRELESS..AIRPLANE ARRIVING 1F6F0..1F6FC ; ID # So [13] SATELLITE..ROLLER SKATE -1F6FD..1F6FF ; ID # Cn [3] .. -1F774..1F77F ; ID # Cn [12] .. -1F7D5..1F7D8 ; ID # So [4] CIRCLED TRIANGLE..NEGATIVE CIRCLED SQUARE -1F7D9..1F7DF ; ID # Cn [7] .. +1F774..1F776 ; ID # So [3] LOT OF FORTUNE..LUNAR ECLIPSE +1F77B..1F77F ; ID # So [5] HAUMEA..ORCUS +1F7D5..1F7D9 ; ID # So [5] CIRCLED TRIANGLE..NINE POINTED WHITE STAR 1F7E0..1F7EB ; ID # So [12] LARGE ORANGE CIRCLE..LARGE BROWN SQUARE -1F7EC..1F7EF ; ID # Cn [4] .. 1F7F0 ; ID # So HEAVY EQUALS SIGN -1F7F1..1F7FF ; ID # Cn [15] .. -1F80C..1F80F ; ID # Cn [4] .. -1F848..1F84F ; ID # Cn [8] .. -1F85A..1F85F ; ID # Cn [6] .. -1F888..1F88F ; ID # Cn [8] .. -1F8AE..1F8AF ; ID # Cn [2] .. 1F8B0..1F8B1 ; ID # So [2] ARROW POINTING UPWARDS THEN NORTH WEST..ARROW POINTING RIGHTWARDS THEN CURVING SOUTH WEST -1F8B2..1F8FF ; ID # Cn [78] .. 1F90D..1F90E ; ID # So [2] WHITE HEART..BROWN HEART 1F910..1F917 ; ID # So [8] ZIPPER-MOUTH FACE..HUGGING FACE 1F920..1F925 ; ID # So [6] FACE WITH COWBOY HAT..LYING FACE @@ -1796,43 +1834,24 @@ FFE4 ; ID # So FULLWIDTH BROKEN BAR 1F9BC..1F9CC ; ID # So [17] MOTORIZED WHEELCHAIR..TROLL 1F9D0 ; ID # So FACE WITH MONOCLE 1F9DE..1F9FF ; ID # So [34] GENIE..NAZAR AMULET -1FA54..1FA5F ; ID # Cn [12] .. 1FA60..1FA6D ; ID # So [14] XIANGQI RED GENERAL..XIANGQI BLACK SOLDIER -1FA6E..1FA6F ; ID # Cn [2] .. -1FA70..1FA74 ; ID # So [5] BALLET SHOES..THONG SANDAL -1FA75..1FA77 ; ID # Cn [3] .. -1FA78..1FA7C ; ID # So [5] DROP OF BLOOD..CRUTCH -1FA7D..1FA7F ; ID # Cn [3] .. -1FA80..1FA86 ; ID # So [7] YO-YO..NESTING DOLLS -1FA87..1FA8F ; ID # Cn [9] .. -1FA90..1FAAC ; ID # So [29] RINGED PLANET..HAMSA -1FAAD..1FAAF ; ID # Cn [3] .. -1FAB0..1FABA ; ID # So [11] FLY..NEST WITH EGGS -1FABB..1FABF ; ID # Cn [5] .. -1FAC0..1FAC2 ; ID # So [3] ANATOMICAL HEART..PEOPLE HUGGING -1FAC6..1FACF ; ID # Cn [10] .. -1FAD0..1FAD9 ; ID # So [10] BLUEBERRIES..JAR -1FADA..1FADF ; ID # Cn [6] .. -1FAE0..1FAE7 ; ID # So [8] MELTING FACE..BUBBLES -1FAE8..1FAEF ; ID # Cn [8] .. -1FAF7..1FAFF ; ID # Cn [9] .. -1FC00..1FFFD ; ID # Cn [1022] .. +1FA70..1FA7C ; ID # So [13] BALLET SHOES..CRUTCH +1FA80..1FA88 ; ID # So [9] YO-YO..FLUTE +1FA90..1FABD ; ID # So [46] RINGED PLANET..WING +1FABF..1FAC2 ; ID # So [4] GOOSE..PEOPLE HUGGING +1FACE..1FADB ; ID # So [14] MOOSE..PEA POD +1FAE0..1FAE8 ; ID # So [9] MELTING FACE..SHAKING FACE 20000..2A6DF ; ID # Lo [42720] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6DF -2A6E0..2A6FF ; ID # Cn [32] .. -2A700..2B738 ; ID # Lo [4153] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B738 -2B739..2B73F ; ID # Cn [7] .. +2A700..2B739 ; ID # Lo [4154] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B739 2B740..2B81D ; ID # Lo [222] CJK UNIFIED IDEOGRAPH-2B740..CJK UNIFIED IDEOGRAPH-2B81D -2B81E..2B81F ; ID # Cn [2] .. 2B820..2CEA1 ; ID # Lo [5762] CJK UNIFIED IDEOGRAPH-2B820..CJK UNIFIED IDEOGRAPH-2CEA1 -2CEA2..2CEAF ; ID # Cn [14] .. 2CEB0..2EBE0 ; ID # Lo [7473] CJK UNIFIED IDEOGRAPH-2CEB0..CJK UNIFIED IDEOGRAPH-2EBE0 -2EBE1..2F7FF ; ID # Cn [3103] .. 2F800..2FA1D ; ID # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D -2FA1E..2FFFD ; ID # Cn [1504] .. 30000..3134A ; ID # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A -3134B..3FFFD ; ID # Cn [60595] .. +31350..323AF ; ID # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF -# Total code points: 172456 +# The above property value applies to 62600 code points not listed here. +# Total code points: 172465 # ================================================ @@ -1978,6 +1997,7 @@ FE19 ; IN # Po PRESENTATION FORM FOR VERTICAL HORIZONTAL ELLIPSIS 0CCC..0CCD ; CM # Mn [2] KANNADA VOWEL SIGN AU..KANNADA SIGN VIRAMA 0CD5..0CD6 ; CM # Mc [2] KANNADA LENGTH MARK..KANNADA AI LENGTH MARK 0CE2..0CE3 ; CM # Mn [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL +0CF3 ; CM # Mc KANNADA SIGN COMBINING ANUSVARA ABOVE RIGHT 0D00..0D01 ; CM # Mn [2] MALAYALAM SIGN COMBINING ANUSVARA ABOVE..MALAYALAM SIGN CANDRABINDU 0D02..0D03 ; CM # Mc [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA 0D3B..0D3C ; CM # Mn [2] MALAYALAM SIGN VERTICAL BAR VIRAMA..MALAYALAM SIGN CIRCULAR VIRAMA @@ -2072,7 +2092,9 @@ FE19 ; IN # Po PRESENTATION FORM FOR VERTICAL HORIZONTAL ELLIPSIS 1CF4 ; CM # Mn VEDIC TONE CANDRA ABOVE 1CF7 ; CM # Mc VEDIC SIGN ATIKRAMA 1CF8..1CF9 ; CM # Mn [2] VEDIC TONE RING ABOVE..VEDIC TONE DOUBLE RING ABOVE -1DC0..1DFF ; CM # Mn [64] COMBINING DOTTED GRAVE ACCENT..COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW +1DC0..1DCC ; CM # Mn [13] COMBINING DOTTED GRAVE ACCENT..COMBINING MACRON-BREVE +1DCE..1DFB ; CM # Mn [46] COMBINING OGONEK ABOVE..COMBINING DELETION MARK +1DFD..1DFF ; CM # Mn [3] COMBINING ALMOST EQUAL TO BELOW..COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW 200C ; CM # Cf ZERO WIDTH NON-JOINER 200E..200F ; CM # Cf [2] LEFT-TO-RIGHT MARK..RIGHT-TO-LEFT MARK 202A..202E ; CM # Cf [5] LEFT-TO-RIGHT EMBEDDING..RIGHT-TO-LEFT OVERRIDE @@ -2152,6 +2174,7 @@ FFF9..FFFB ; CM # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTAT 10AE5..10AE6 ; CM # Mn [2] MANICHAEAN ABBREVIATION MARK ABOVE..MANICHAEAN ABBREVIATION MARK BELOW 10D24..10D27 ; CM # Mn [4] HANIFI ROHINGYA SIGN HARBAHAY..HANIFI ROHINGYA SIGN TASSI 10EAB..10EAC ; CM # Mn [2] YEZIDI COMBINING HAMZA MARK..YEZIDI COMBINING MADDA MARK +10EFD..10EFF ; CM # Mn [3] ARABIC SMALL LOW WORD SAKTA..ARABIC SMALL LOW WORD MADDA 10F46..10F50 ; CM # Mn [11] SOGDIAN COMBINING DOT BELOW..SOGDIAN COMBINING STROKE BELOW 10F82..10F85 ; CM # Mn [4] OLD UYGHUR COMBINING DOT ABOVE..OLD UYGHUR COMBINING TWO DOTS BELOW 11000 ; CM # Mc BRAHMI SIGN CANDRABINDU @@ -2188,6 +2211,7 @@ FFF9..FFFB ; CM # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTAT 11235 ; CM # Mc KHOJKI SIGN VIRAMA 11236..11237 ; CM # Mn [2] KHOJKI SIGN NUKTA..KHOJKI SIGN SHADDA 1123E ; CM # Mn KHOJKI SIGN SUKUN +11241 ; CM # Mn KHOJKI VOWEL SIGN VOCALIC R 112DF ; CM # Mn KHUDAWADI SIGN ANUSVARA 112E0..112E2 ; CM # Mc [3] KHUDAWADI VOWEL SIGN AA..KHUDAWADI VOWEL SIGN II 112E3..112EA ; CM # Mn [8] KHUDAWADI VOWEL SIGN U..KHUDAWADI SIGN VIRAMA @@ -2292,6 +2316,16 @@ FFF9..FFFB ; CM # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTAT 11D97 ; CM # Mn GUNJALA GONDI VIRAMA 11EF3..11EF4 ; CM # Mn [2] MAKASAR VOWEL SIGN I..MAKASAR VOWEL SIGN U 11EF5..11EF6 ; CM # Mc [2] MAKASAR VOWEL SIGN E..MAKASAR VOWEL SIGN O +11F00..11F01 ; CM # Mn [2] KAWI SIGN CANDRABINDU..KAWI SIGN ANUSVARA +11F03 ; CM # Mc KAWI SIGN VISARGA +11F34..11F35 ; CM # Mc [2] KAWI VOWEL SIGN AA..KAWI VOWEL SIGN ALTERNATE AA +11F36..11F3A ; CM # Mn [5] KAWI VOWEL SIGN I..KAWI VOWEL SIGN VOCALIC R +11F3E..11F3F ; CM # Mc [2] KAWI VOWEL SIGN E..KAWI VOWEL SIGN AI +11F40 ; CM # Mn KAWI VOWEL SIGN EU +11F41 ; CM # Mc KAWI SIGN KILLER +11F42 ; CM # Mn KAWI CONJOINER +13440 ; CM # Mn EGYPTIAN HIEROGLYPH MIRROR HORIZONTALLY +13447..13455 ; CM # Mn [15] EGYPTIAN HIEROGLYPH MODIFIER DAMAGED AT TOP START..EGYPTIAN HIEROGLYPH MODIFIER DAMAGED 16AF0..16AF4 ; CM # Mn [5] BASSA VAH COMBINING HIGH TONE..BASSA VAH COMBINING HIGH-LOW TONE 16B30..16B36 ; CM # Mn [7] PAHAWH HMONG MARK CIM TUB..PAHAWH HMONG MARK CIM TAUM 16F4F ; CM # Mn MIAO SIGN CONSONANT MODIFIER BAR @@ -2321,16 +2355,18 @@ FFF9..FFFB ; CM # Cf [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTAT 1E01B..1E021 ; CM # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI 1E023..1E024 ; CM # Mn [2] COMBINING GLAGOLITIC LETTER YU..COMBINING GLAGOLITIC LETTER SMALL YUS 1E026..1E02A ; CM # Mn [5] COMBINING GLAGOLITIC LETTER YO..COMBINING GLAGOLITIC LETTER FITA +1E08F ; CM # Mn COMBINING CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I 1E130..1E136 ; CM # Mn [7] NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACHUE HMONG TONE-D 1E2AE ; CM # Mn TOTO SIGN RISING TONE 1E2EC..1E2EF ; CM # Mn [4] WANCHO TONE TUP..WANCHO TONE KOINI +1E4EC..1E4EF ; CM # Mn [4] NAG MUNDARI SIGN MUHOR..NAG MUNDARI SIGN SUTUH 1E8D0..1E8D6 ; CM # Mn [7] MENDE KIKAKUI COMBINING NUMBER TEENS..MENDE KIKAKUI COMBINING NUMBER MILLIONS 1E944..1E94A ; CM # Mn [7] ADLAM ALIF LENGTHENER..ADLAM NUKTA E0001 ; CM # Cf LANGUAGE TAG E0020..E007F ; CM # Cf [96] TAG SPACE..CANCEL TAG E0100..E01EF ; CM # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256 -# Total code points: 2399 +# Total code points: 2438 # ================================================ @@ -2360,9 +2396,10 @@ A8FC ; BB # Po DEVANAGARI SIGN SIDDHAM 11A3F ; BB # Po ZANABAZAR SQUARE INITIAL HEAD MARK 11A45 ; BB # Po ZANABAZAR SQUARE INITIAL DOUBLE-LINED HEAD MARK 11A9E..11AA0 ; BB # Po [3] SOYOMBO HEAD MARK WITH MOON AND SUN AND TRIPLE FLAME..SOYOMBO HEAD MARK WITH MOON AND SUN +11B00..11B09 ; BB # Po [10] DEVANAGARI HEAD MARK..DEVANAGARI SIGN MINDU 11C70 ; BB # Po MARCHEN HEAD MARK -# Total code points: 45 +# Total code points: 55 # ================================================ @@ -2461,6 +2498,7 @@ ABEB ; BA # Po MEETEI MAYEK CHEIKHEI 11A9A..11A9C ; BA # Po [3] SOYOMBO MARK TSHEG..SOYOMBO MARK DOUBLE SHAD 11AA1..11AA2 ; BA # Po [2] SOYOMBO TERMINAL MARK-1..SOYOMBO TERMINAL MARK-2 11C41..11C45 ; BA # Po [5] BHAIKSUKI DANDA..BHAIKSUKI GAP FILLER-2 +11F43..11F44 ; BA # Po [2] KAWI DANDA..KAWI DOUBLE DANDA 11FFF ; BA # Po TAMIL PUNCTUATION END OF TEXT 12470..12474 ; BA # Po [5] CUNEIFORM PUNCTUATION SIGN OLD ASSYRIAN WORD DIVIDER..CUNEIFORM PUNCTUATION SIGN DIAGONAL QUADCOLON 16A6E..16A6F ; BA # Po [2] MRO DANDA..MRO DOUBLE DANDA @@ -2471,7 +2509,7 @@ ABEB ; BA # Po MEETEI MAYEK CHEIKHEI 1BC9F ; BA # Po DUPLOYAN PUNCTUATION CHINOOK FULL STOP 1DA87..1DA8A ; BA # Po [4] SIGNWRITING COMMA..SIGNWRITING COLON -# Total code points: 247 +# Total code points: 249 # ================================================ @@ -2538,7 +2576,7 @@ FFFC ; CB # So OBJECT REPLACEMENT CHARACTER 0EBD ; SA # Lo LAO SEMIVOWEL SIGN NYO 0EC0..0EC4 ; SA # Lo [5] LAO VOWEL SIGN E..LAO VOWEL SIGN AI 0EC6 ; SA # Lm LAO KO LA -0EC8..0ECD ; SA # Mn [6] LAO TONE MAI EK..LAO NIGGAHITA +0EC8..0ECE ; SA # Mn [7] LAO TONE MAI EK..LAO YAMAKKAN 0EDC..0EDF ; SA # Lo [4] LAO HO NO..LAO LETTER KHMU NYO 1000..102A ; SA # Lo [43] MYANMAR LETTER KA..MYANMAR LETTER AU 102B..102C ; SA # Mc [2] MYANMAR VOWEL SIGN TALL AA..MYANMAR VOWEL SIGN AA @@ -2641,7 +2679,7 @@ AADE..AADF ; SA # Po [2] TAI VIET SYMBOL HO HOI..TAI VIET SYMBOL KOI KOI 1173F ; SA # So AHOM SYMBOL VI 11740..11746 ; SA # Lo [7] AHOM LETTER CA..AHOM LETTER LLA -# Total code points: 757 +# Total code points: 758 # ================================================ @@ -3706,10 +3744,12 @@ FB46..FB4F ; HL # Lo [10] HEBREW LETTER TSADI WITH DAGESH..HEBREW LIGATURE A 31F0..31FF ; CJ # Lo [16] KATAKANA LETTER SMALL KU..KATAKANA LETTER SMALL RO FF67..FF6F ; CJ # Lo [9] HALFWIDTH KATAKANA LETTER SMALL A..HALFWIDTH KATAKANA LETTER SMALL TU FF70 ; CJ # Lm HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK +1B132 ; CJ # Lo HIRAGANA LETTER SMALL KO 1B150..1B152 ; CJ # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO +1B155 ; CJ # Lo KATAKANA LETTER SMALL KO 1B164..1B167 ; CJ # Lo [4] KATAKANA LETTER SMALL WI..KATAKANA LETTER SMALL N -# Total code points: 58 +# Total code points: 60 # ================================================ @@ -3762,9 +3802,9 @@ FF70 ; CJ # Lm HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK 1F9CD..1F9CF ; EB # So [3] STANDING PERSON..DEAF PERSON 1F9D1..1F9DD ; EB # So [13] ADULT..ELF 1FAC3..1FAC5 ; EB # So [3] PREGNANT MAN..PERSON WITH CROWN -1FAF0..1FAF6 ; EB # So [7] HAND WITH INDEX FINGER AND THUMB CROSSED..HEART HANDS +1FAF0..1FAF8 ; EB # So [9] HAND WITH INDEX FINGER AND THUMB CROSSED..RIGHTWARDS PUSHING HAND -# Total code points: 132 +# Total code points: 134 # ================================================