diff options
author | Thomas Voss <mail@thomasvoss.com> | 2024-01-23 01:57:39 +0100 |
---|---|---|
committer | Thomas Voss <mail@thomasvoss.com> | 2024-01-23 01:57:39 +0100 |
commit | 1256660e1f0cea877b6d453704343f07d73d6224 (patch) | |
tree | fff760729e37626fc3749e587a29be5e94184db2 /man | |
parent | 52cde82ece54db9796e43f9ee7497d50414a216f (diff) |
Properly support UTF-8 in patterns
Diffstat (limited to 'man')
-rw-r--r-- | man/grab.1 | 17 |
1 files changed, 13 insertions, 4 deletions
@@ -1,4 +1,4 @@ -.Dd 22 January, 2024 +.Dd 23 January, 2024 .Dt GRAB 1 .Os Grab 2.0.1 .Sh NAME @@ -206,9 +206,17 @@ and .Sq G/^1337$/ filters out the specific number 1337. .Pp -As you may use whichever delimiter you like, the following is also valid: +The delimiter used for each given operator can be any valid UTF-8 +codepoint. +As a result, +the following pattern using the delimiters +.Sq | , +.Sq \&. , +and +.Sq ä +is well-formed: .Pp -.Dl x|[0\-9]+| g.3. G#^1337# +.Dl x|[0\-9]+| g.3. Gä^1337ä .Pp Operators are not allowed to take empty regular expression arguments with one exception: @@ -337,6 +345,7 @@ the newline will be matched by .Ql [^a] . .Sh BUGS -Input files must be encoded as UTF-8. +The pattern string provided as a command-line argument as well as the +provided input files must be encoded as UTF-8. No other encodings are supported unless they are UTF-8 compatible, such as ASCII. |