From owner-freebsd-bugs Tue Sep 28 1: 0: 4 1999 Delivered-To: freebsd-bugs@freebsd.org Received: from freefall.freebsd.org (freefall.FreeBSD.ORG [204.216.27.21]) by hub.freebsd.org (Postfix) with ESMTP id F0FC514D21 for ; Tue, 28 Sep 1999 01:00:01 -0700 (PDT) (envelope-from gnats@FreeBSD.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.9.3/8.9.2) id BAA42964; Tue, 28 Sep 1999 01:00:01 -0700 (PDT) (envelope-from gnats@FreeBSD.org) Received: from sibptus.tomsk.ru (sibptus.tomsk.ru [212.176.23.5]) by hub.freebsd.org (Postfix) with ESMTP id 447F014A0D for ; Tue, 28 Sep 1999 00:57:08 -0700 (PDT) (envelope-from tolik@sibptus.tomsk.ru) Received: (from tolik@localhost) by sibptus.tomsk.ru (8.8.8/8.8.8) id PAA28332; Tue, 28 Sep 1999 15:56:40 +0800 (TSD) (envelope-from tolik) Message-Id: <199909280756.PAA28332@sibptus.tomsk.ru> Date: Tue, 28 Sep 1999 15:56:40 +0800 (TSD) From: tolik@mpeks.tomsk.su Reply-To: tolik@mpeks.tomsk.su To: FreeBSD-gnats-submit@freebsd.org X-Send-Pr-Version: 3.2 Subject: gnu/14011: Unpredictable grep(1) output for 8-bit characters Sender: owner-freebsd-bugs@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org >Number: 14011 >Category: gnu >Synopsis: grep(1) returned unpredictable output for 8-bit characters >Confidential: no >Severity: non-critical >Priority: medium >Responsible: freebsd-bugs >State: open >Quarter: >Keywords: >Date-Required: >Class: sw-bug >Submitter-Id: current-users >Arrival-Date: Tue Sep 28 01:00:00 PDT 1999 >Closed-Date: >Last-Modified: >Originator: Anatoly A. Orehovsky >Release: FreeBSD 2.2.8-RELEASE i386 and others >Organization: SibPTUS >Environment: FreeBSD box where russian (for example) locale is set. >Description: If you wanted to search simple russian phrase with grep(1), then grep(1) returned unpredictable output. >How-To-Repeat: $ export LANG=ru_RU.KOI8-R $ echo '27 ΣΕΞ' | grep '27 ΣΕΞ' # search '27 Sep' by russian and grep's output is nothing If input file contain many strings '27 ΣΕΞ', then grep returned unpredictable output or nothing. >Fix: Workaround: Using grep's -i option (if case sensitive not needed): $ export LANG=ru_RU.KOI8-R $ echo '27 ΣΕΞ' | grep -i '27 ΣΕΞ' 27 ΣΕΞ Patch to fix problem: patch.092899: --- grep/kwset.c.orig Tue Sep 28 15:19:49 1999 +++ grep/kwset.c Tue Sep 28 15:20:42 1999 @@ -527,7 +527,7 @@ { struct kwset *kwset; register unsigned char *d1; - register char *ep, *sp, *tp; + register unsigned char *ep, *sp, *tp; register int d, gc, i, len, md2; kwset = (struct kwset *) kws; __ Anatoly A. Orehovsky. AO9-RIPE. AAO1-RIPN http://www.tekmetrics.com/transcript.shtml?pid=6064 >Release-Note: >Audit-Trail: >Unformatted: To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-bugs" in the body of the message