From c0a983a29af17415ef29058d72f1a9cd99ddd83f Mon Sep 17 00:00:00 2001 From: Thomas Voss Date: Mon, 22 Apr 2024 21:06:52 +0200 Subject: Fix various bugs in word segmentation --- README | 9 ++++----- 1 file changed, 4 insertions(+), 5 deletions(-) (limited to 'README') diff --git a/README b/README index fc6e5de..a158aac 100644 --- a/README +++ b/README @@ -105,7 +105,7 @@ FEATURES: • Properties related to case-mapping are context-aware (via a context-struct argument) • unicode/string.h - • Grapheme iteration and -counting + • Iteration and counting of graphemes and words in a string • Unicode-aware case-mapping of strings with truncation checking • Case-mapping supports optional language-specific quirks (Azeri, Lithuanian, German, etc.) @@ -113,10 +113,9 @@ FEATURES: PLANNED FEATURES: - • Missing Unicode Properties (unicode/prop.h) - • String Case Conversions (unicode/string.h) - • Unicode Normalization (unicode/string.h) - • Word- and Line Segmentation (unicode/string.h) + • String Case Conversions (unicode/string.h) + • Unicode Normalization (unicode/string.h) + • Line- and Sentence Segmentation (unicode/string.h) BUGS: -- cgit v1.2.3