From owner-freebsd-questions@FreeBSD.ORG Mon Apr 27 15:23:34 2009 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id A01E4106566B for ; Mon, 27 Apr 2009 15:23:34 +0000 (UTC) (envelope-from lists@jnielsen.net) Received: from ns1.jnielsen.net (ns1.jnielsen.net [69.55.238.237]) by mx1.freebsd.org (Postfix) with ESMTP id 6906F8FC16 for ; Mon, 27 Apr 2009 15:23:34 +0000 (UTC) (envelope-from lists@jnielsen.net) Received: from [172.17.2.19] (jn@stealth.jnielsen.net [74.218.226.254]) (authenticated bits=0) by ns1.jnielsen.net (8.12.9p2/8.12.9) with ESMTP id n3RFNXCx098574; Mon, 27 Apr 2009 11:23:33 -0400 (EDT) (envelope-from lists@jnielsen.net) From: John Nielsen To: Giorgos Keramidas Date: Mon, 27 Apr 2009 11:23:32 -0400 User-Agent: KMail/1.9.10 References: <200904240535.37460.lists@jnielsen.net> <874owc8bfx.fsf@kobe.laptop> In-Reply-To: <874owc8bfx.fsf@kobe.laptop> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200904271123.32647.lists@jnielsen.net> X-Virus-Scanned: ClamAV version 0.88.4, clamav-milter version 0.88.4 on ns1.jnielsen.net X-Virus-Status: Clean Cc: freebsd-questions@freebsd.org Subject: Re: CVS history access? X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 27 Apr 2009 15:23:34 -0000 On Saturday 25 April 2009 09:12:50 pm Giorgos Keramidas wrote: > On Fri, 24 Apr 2009 05:35:34 -0400, John Nielsen wrote: > > I'm working on a machine learning project and I'd like to use the > > FreeBSD src CVS commit history as a datasource. Is there a > > resource-friendly way for me to download some or all of it? Format > > isn't too big an issue. > > > > I tried a few "cvs history" commands against the anoncvs servers but > > get this: cvs [history aborted]: cannot open history file: > > /home/ncvs/CVSROOT/history: No such file or directory > > Do you really want just the `CVSROOT/history' file? We allow mirroring > of the entire repository, which you can then use to extract any sort of > historical commit data. (Well, _almost_ anything. Some things like > repo-copies and renames of raw repository files have been done without > any sort of record, so it may be impossible to recover *those* > particular bits.) I'm basically looking for a list of all commits over the past N (>2) years with committer, timestamp, affected file(s) and/or subsystems and possibly diff size information, etc. I don't know anything about the "history" file in particular other than that's what cvs complained about when I tried the "cvs history" commands against anoncvs. It looks like the /pub/FreeBSD/development/FreeBSD-CVS/src ftp path may have what I'm looking for (though it may be scattered through the individual files). I'll probably (try to) set up a local CVS repo and source it from there and see where that gets me. My CVS-fu is weak so I'm still open to pointers. > We also have a Subversion repository now, that you can use to grab > commit information. It takes slightly more disk space than the CVS > repository, but subversion can export XML formatted commit logs, which > may be slightly more useful if you plan to automate parts of the > parsing and info-gathering. Yes, I'll definitely be automating the parsing, etc. Is it safe to assume that the cvs2svn migration went successfully? XML logs do sound appealing and aggregated (same time, multiple files) commits would be more useful than per-file. Can I just check everything out from svn://svn.freebsd.org/base/? Thanks! JN