From owner-soc-status@FreeBSD.ORG Sun Aug 7 19:25:16 2011 Return-Path: Delivered-To: soc-status@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B0DFC106566B for ; Sun, 7 Aug 2011 19:25:16 +0000 (UTC) (envelope-from gabor@FreeBSD.org) Received: from server.mypc.hu (server.mypc.hu [87.229.73.95]) by mx1.freebsd.org (Postfix) with ESMTP id 6E3C98FC0C for ; Sun, 7 Aug 2011 19:25:16 +0000 (UTC) Received: from server.mypc.hu (localhost [127.0.0.1]) by server.mypc.hu (Postfix) with ESMTP id 912E014E5C8A for ; Sun, 7 Aug 2011 21:25:14 +0200 (CEST) X-Virus-Scanned: amavisd-new at server.mypc.hu Received: from server.mypc.hu ([127.0.0.1]) by server.mypc.hu (server.mypc.hu [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 0YySZ6yWSyuX for ; Sun, 7 Aug 2011 21:25:12 +0200 (CEST) Received: from [192.168.1.106] (catv-80-98-232-12.catv.broadband.hu [80.98.232.12]) (using TLSv1 with cipher DHE-RSA-CAMELLIA256-SHA (256/256 bits)) (No client certificate requested) by server.mypc.hu (Postfix) with ESMTPSA id 5DE0014E5C67 for ; Sun, 7 Aug 2011 21:25:12 +0200 (CEST) Message-ID: <4E3EE697.8070502@FreeBSD.org> Date: Sun, 07 Aug 2011 21:25:11 +0200 From: Gabor Kovesdan User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; pt-PT; rv:1.9.2.8) Gecko/20100802 Thunderbird/3.1.2 MIME-Version: 1.0 To: soc-status@freebsd.org Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Subject: regex status report #11 X-BeenThere: soc-status@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Summer of Code Status Reports and Discussion List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 07 Aug 2011 19:25:16 -0000 Hi, during the week I implemented the Boyer-Moore algorithm and I couldn't measure any speed up neither in bsdgrep nor in sed. Probably, bsdgrep has other bottlenecks and sed runs so fast even with big files that the more efficient algorithm is not measurable. So for now, I decided to clean this code up a bit and move on to the heuristical matching and also take a look at how to optimize bsdgrep. It is more convenient to work on the actual bottlenecks and I can get back to this code later when the rest performs well. I just noticed that the suggested pencils down date is coming. (Aug 15) I will work until the hard deadline (Aug 22) to compensate the hours that I missed because of moving and of course, I plan to continue working on my project after the summer to finally replace the regex code and also get rid of GNU grep. Gabor