aboutsummaryrefslogtreecommitdiff
path: root/lib/unicode
Commit message (Collapse)AuthorAgeFilesLines
* More 2-stage lookup tablesThomas Voss 2024-05-0413-23078/+61051
|
* Go all in on string views, and fix manualsThomas Voss 2024-05-0410-57/+53
|
* Add tests for u8title() and fix ‘ς’ bugThomas Voss 2024-05-041-1/+1
|
* Add tests for u8lower() and fix ‘ς’ bugThomas Voss 2024-05-041-2/+4
|
* Add more 2-stage lookup tablesThomas Voss 2024-05-0311-7240/+31423
|
* Don’t use binary searchThomas Voss 2024-05-031-1986/+1980
|
* Add more 2-stage lookup tablesThomas Voss 2024-05-033-2849/+8051
|
* Add even more 2-stage lookup tablesThomas Voss 2024-04-304-2158/+6392
|
* Fix algorithm to determine optimal blksizeThomas Voss 2024-04-301-625/+785
|
* Add more 2-stage lookup tablesThomas Voss 2024-04-304-1749/+29789
|
* Use 2-stage lookup table in uprop_get_bc()Thomas Voss 2024-04-301-1526/+3523
|
* Use a 2-stage lookup table for uprop_get_age()Thomas Voss 2024-04-281-1701/+9316
|
* Use a 2-stage lookup table for uprop_get_gc()Thomas Voss 2024-04-281-3285/+4627
|
* Implement boolean props using 2-stage lookupThomas Voss 2024-04-2850-12161/+20192
|
* Add Dutch titlecase tailoring (ijssel → IJssel)Thomas Voss 2024-04-271-7/+15
|
* Fully comply with Unicode 15.1 § 3.13 in u8title()Thomas Voss 2024-04-261-53/+90
|
* Only compute before_dot or more_above for AZ/TR/LTThomas Voss 2024-04-261-17/+19
|
* Fully comply with Unicode 15.1 § 3.13 in u8lower()Thomas Voss 2024-04-263-25/+72
|
* Support titlecasing Dutch ‘IJ’ properlyThomas Voss 2024-04-241-3/+11
|
* Properly upper- and titlecase ‘i’ and ‘j’ in LithuanianThomas Voss 2024-04-244-14/+41
|
* Add u8title()Thomas Voss 2024-04-231-0/+58
|
* Properly lowercase ‘Σ’ to ‘ς’ when at end of wordThomas Voss 2024-04-221-2/+5
|
* Add u8wcnt_human() and u8wnext_human()Thomas Voss 2024-04-222-0/+37
|
* Fix various bugs in word segmentationThomas Voss 2024-04-221-126/+214
|
* Add explicit underlying type for enumsThomas Voss 2024-04-221-1/+1
| | | | | | Due to the size of the lookup tables, we want to use the smallest possible underlying type for the Unicode property enumerations in order to reduce the binary size as much as possible.
* Support word-segmentationThomas Voss 2024-04-212-0/+171
|
* Fix typo (extpic -> extpict)Thomas Voss 2024-04-211-1/+1
|
* Rename u8glen() to u8gcnt()Thomas Voss 2024-04-211-1/+1
|
* Add uprop_get_sb()Thomas Voss 2024-04-211-0/+2447
|
* Add uprop_get_wb()Thomas Voss 2024-04-211-0/+1117
|
* Add uprop_get_scx()Thomas Voss 2024-04-211-0/+180
|
* Add uprop_get_nfkc_cf()Thomas Voss 2024-04-212-6053/+12094
|
* Add uprop_get_nfkc_scf()Thomas Voss 2024-04-201-0/+6075
|
* Add uprop_get_nf{,k}{c,d}_qc()Thomas Voss 2024-04-204-0/+1204
|
* Add uprop_get_jg()Thomas Voss 2024-04-181-0/+230
|
* Add uprop_get_jt()Thomas Voss 2024-04-181-0/+560
|
* Add uprop_get_gcb()Thomas Voss 2024-04-161-0/+1419
|
* Add uprop_get_equideo()Thomas Voss 2024-04-161-0/+369
|
* Add uprop_get_ccc()Thomas Voss 2024-04-161-0/+408
|
* Remove blank lineThomas Voss 2024-04-161-1/+0
|
* Add uprop_get_insc()Thomas Voss 2024-04-161-0/+808
|
* Add uprop_get_inpc()Thomas Voss 2024-04-161-0/+641
|
* Check lookup[0].lo instead of a magic numberThomas Voss 2024-04-161-1/+1
|
* Add uprop_get_hst()Thomas Voss 2024-04-161-0/+823
|
* Add uprop_get_vo()Thomas Voss 2024-04-161-0/+224
|
* Change private symbol prefix from ‘__’ to ‘_’Thomas Voss 2024-04-1672-144/+144
|
* Add uprop_get_bmg()Thomas Voss 2024-04-151-0/+472
|
* Add uprop_get_bc()Thomas Voss 2024-04-151-0/+1538
|
* Implement casefoldingThomas Voss 2024-04-151-0/+24
|
* Add uprop_get_cf()Thomas Voss 2024-04-151-0/+130
|