aboutsummaryrefslogtreecommitdiff
path: root/man/grab.1
diff options
context:
space:
mode:
authorThomas Voss <mail@thomasvoss.com> 2024-01-23 01:57:39 +0100
committerThomas Voss <mail@thomasvoss.com> 2024-01-23 01:57:39 +0100
commit1256660e1f0cea877b6d453704343f07d73d6224 (patch)
treefff760729e37626fc3749e587a29be5e94184db2 /man/grab.1
parent52cde82ece54db9796e43f9ee7497d50414a216f (diff)
Properly support UTF-8 in patterns
Diffstat (limited to 'man/grab.1')
-rw-r--r--man/grab.117
1 files changed, 13 insertions, 4 deletions
diff --git a/man/grab.1 b/man/grab.1
index cdcacfa..5e36c53 100644
--- a/man/grab.1
+++ b/man/grab.1
@@ -1,4 +1,4 @@
-.Dd 22 January, 2024
+.Dd 23 January, 2024
.Dt GRAB 1
.Os Grab 2.0.1
.Sh NAME
@@ -206,9 +206,17 @@ and
.Sq G/^1337$/
filters out the specific number 1337.
.Pp
-As you may use whichever delimiter you like, the following is also valid:
+The delimiter used for each given operator can be any valid UTF-8
+codepoint.
+As a result,
+the following pattern using the delimiters
+.Sq | ,
+.Sq \&. ,
+and
+.Sq ä
+is well-formed:
.Pp
-.Dl x|[0\-9]+| g.3. G#^1337#
+.Dl x|[0\-9]+| g.3. Gä^1337ä
.Pp
Operators are not allowed to take empty regular expression arguments with
one exception:
@@ -337,6 +345,7 @@ the newline will
be matched by
.Ql [^a] .
.Sh BUGS
-Input files must be encoded as UTF-8.
+The pattern string provided as a command-line argument as well as the
+provided input files must be encoded as UTF-8.
No other encodings are supported unless they are UTF-8 compatible,
such as ASCII.