From owner-freebsd-questions@FreeBSD.ORG Sat May 7 01:13:49 2011 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 8FF20106566C for ; Sat, 7 May 2011 01:13:49 +0000 (UTC) (envelope-from listreader@lazlarlyricon.com) Received: from mailgw4.surf-town.net (mail7.surf-town.net [212.97.132.47]) by mx1.freebsd.org (Postfix) with ESMTP id 4ABC38FC0C for ; Sat, 7 May 2011 01:13:48 +0000 (UTC) Received: by mailgw4.surf-town.net (Postfix, from userid 65534) id 9A811220E6C; Sat, 7 May 2011 03:13:47 +0200 (CEST) Received: from localhost (localhost [127.0.0.1]) by mailgw4.surf-town.net (Postfix) with ESMTP id 6C7BE220E6C; Sat, 7 May 2011 03:13:47 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at mailgw4.surf-town.net X-Spam-Flag: NO X-Spam-Score: -1.44 X-Spam-Level: X-Spam-Status: No, score=-1.44 tagged_above=-999 required=7 tests=[ALL_TRUSTED=-1.44] Received: from mailgw4.surf-town.net ([127.0.0.1]) by localhost (mailgw4.surf-town.net [127.0.0.1]) (amavisd-new, port 10024) with LMTP id eTgoHvsj53+9; Sat, 7 May 2011 03:13:42 +0200 (CEST) Received: from lazlar.kicks-ass.net (c-0987e355.09-42-6e6b7010.cust.bredbandsbolaget.se [85.227.135.9]) by mailgw4.surf-town.net (Postfix) with ESMTPA id D6A861D14F2; Sat, 7 May 2011 03:13:40 +0200 (CEST) Message-ID: <4DC49CC3.8000103@lazlarlyricon.com> Date: Sat, 07 May 2011 03:13:39 +0200 From: Rolf Nielsen User-Agent: Mozilla/5.0 (X11; U; FreeBSD amd64; sv-SE; rv:1.9.2.17) Gecko/20110502 Lightning/1.0b2 Thunderbird/3.1.10 MIME-Version: 1.0 To: Robert Bonomi References: <201105070054.p470sgYR092690@mail.r-bonomi.com> In-Reply-To: <201105070054.p470sgYR092690@mail.r-bonomi.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-questions@freebsd.org Subject: Re: Comparing two lists X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 07 May 2011 01:13:49 -0000 2011-05-07 02:54, Robert Bonomi skrev: >> From owner-freebsd-questions@freebsd.org Fri May 6 19:27:54 2011 >> Date: Sat, 07 May 2011 02:09:26 +0200 >> From: Rolf Nielsen >> To: FreeBSD >> Subject: Comparing two lists >> >> Hello all, >> >> I have two text files, quite extensive ones. They have some lines in >> common and some lines are unique to one of the files. The lines that do >> exist in both files are not necessarily in the same location. Now I need >> to compare the files and output a list of lines that exist in both >> files. Is there a simple way to do this? diff? awk? sed? cmp? Or a >> combination of two or more of them? > > > If the files have only 'minor' differences -- i.e. no long runs of lines > that are in only one fie -- *and* the common lines are in the same order > in each file, you can use diff(1), without any other shennigans. > > If the above is -not- true, and If you need _only_ the common lines, AND > order is not important, then sort(1) both files, and use diff(1) on the > two sorted versions. > > > Beyond that it depends on what you mean by 'extensive' ones. megabytes? > Gigabytes? or what?? > > > Some 10,000 to 20,000 lines each. I do need only the common lines. Order is not essential, but would make life easier. I've tried a little with uniq, as suggested by Polyptron, but I guess 3am is not quite the right time to do these things. Anyway, thanks.