Date: Fri, 23 Nov 2018 19:02:40 +0300 From: Yuri Pankov <yuripv@yuripv.net> To: freebsd-hackers <freebsd-hackers@freebsd.org> Subject: regex, multibyte locales, and word boundaries Message-ID: <5166f3c9-d587-a245-df21-8e50f075a8cc@yuripv.net>
next in thread | raw e-mail | index | archive | help
This is an OpenPGP/MIME signed message (RFC 4880 and 3156) --p8IhWLes9jJtEyQe8bxwD0DPXoEUrQ9Zz Content-Type: multipart/mixed; boundary="07odA7TKf2mtNqjkFg4cktmrTFo1E2kIs"; protected-headers="v1" From: Yuri Pankov <yuripv@yuripv.net> To: freebsd-hackers <freebsd-hackers@freebsd.org> Message-ID: <5166f3c9-d587-a245-df21-8e50f075a8cc@yuripv.net> Subject: regex, multibyte locales, and word boundaries --07odA7TKf2mtNqjkFg4cktmrTFo1E2kIs Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: quoted-printable Hi, We have the following note in the BUGS section of regcomp(3): ---------------------------------------------------------------------- Word-boundary matching does not work properly in multibyte locales. ---------------------------------------------------------------------- It was added ages ago along with multibyte support in our regex implementation, though I can't think of any positive test case to see that the problem is real, and eventually fix it. I'm wondering if anyone has real life examples showing the bug? --07odA7TKf2mtNqjkFg4cktmrTFo1E2kIs-- --p8IhWLes9jJtEyQe8bxwD0DPXoEUrQ9Zz Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- iQEzBAEBCAAdFiEE+Gq3PsPeLT4tL/9wk4vgf7Eq4WwFAlv4JLIACgkQk4vgf7Eq 4WxpGAgAqKQP7R+0Qbc7zGo6QCEfO37P4SG3H3o5pUGvdCOOweUCGLQQALS1cqww WUbgvpWYuMzYVNAhslURF/S1cV0v3nmzkH4vlksnJJ3vYJ0KVipkdsXNN6M5dvYj 5RU0g2EYyLingB1GCFvlazA1mjV7RZ/f91SNOX9fFIQC2u9IfSwmdnePyeDpym6M dR0SrDuO1iQGsuelKNXunTTRZ3oJq4PDFV5FXBg8qWj9jl3wVWXpUa1NERZfLhcr NadmcGMnwtGaxXcocNwjed7gTLoNQ4oYGML5b8i5a2bDO4mJcj8a+ZO3VKuwfuYT NUfCRIed7gQU2jigQWPLCNyUiftNEA== =tTwj -----END PGP SIGNATURE----- --p8IhWLes9jJtEyQe8bxwD0DPXoEUrQ9Zz--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?5166f3c9-d587-a245-df21-8e50f075a8cc>