From owner-freebsd-fs@FreeBSD.ORG  Wed Oct 25 21:42:25 2006
Return-Path: <owner-freebsd-fs@FreeBSD.ORG>
X-Original-To: freebsd-fs@freebsd.org
Delivered-To: freebsd-fs@freebsd.org
Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125])
	by hub.freebsd.org (Postfix) with ESMTP id A44C416A407
	for <freebsd-fs@freebsd.org>; Wed, 25 Oct 2006 21:42:25 +0000 (UTC)
	(envelope-from rwatson@FreeBSD.org)
Received: from cyrus.watson.org (cyrus.watson.org [209.31.154.42])
	by mx1.FreeBSD.org (Postfix) with ESMTP id 5582443D45
	for <freebsd-fs@freebsd.org>; Wed, 25 Oct 2006 21:42:25 +0000 (GMT)
	(envelope-from rwatson@FreeBSD.org)
Received: from fledge.watson.org (fledge.watson.org [209.31.154.41])
	by cyrus.watson.org (Postfix) with ESMTP id 0A25C46D96;
	Wed, 25 Oct 2006 17:42:25 -0400 (EDT)
Date: Wed, 25 Oct 2006 22:42:24 +0100 (BST)
From: Robert Watson <rwatson@FreeBSD.org>
X-X-Sender: robert@fledge.watson.org
To: Scott Long <scottl@samsco.org>
In-Reply-To: <453FBCDF.5050807@samsco.org>
Message-ID: <20061025223246.G33725@fledge.watson.org>
References: <200610251642.k9PGgr4t054536@amd64.ott.parse.com>
	<20061025180112.P33725@fledge.watson.org> <453FBCDF.5050807@samsco.org>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed
Cc: freebsd-fs@freebsd.org, Robert Krten <root@parse.com>
Subject: Re: Naive question about encrypted disks
X-BeenThere: freebsd-fs@freebsd.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Filesystems <freebsd-fs.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-fs>,
	<mailto:freebsd-fs-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-fs>
List-Post: <mailto:freebsd-fs@freebsd.org>
List-Help: <mailto:freebsd-fs-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-fs>,
	<mailto:freebsd-fs-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Wed, 25 Oct 2006 21:42:25 -0000


On Wed, 25 Oct 2006, Scott Long wrote:

>> Deriving the key when you have examples of plaintext and ciphertext for 
>> that plaintext is known as a "known-plaintext attack".  Resistence to 
>> known-plaintext attacks is one of the most important properties required of 
>> modern crypto algorithms.  Other examples of cases where resistance to 
>> known-plaintext attacks is critical include:
>> 
>> - IPSEC, where it's often the case that a potential attacker can trigger 
>> known
>>   plaintext to appear in the plaintext, and also through a packet sniffer 
>> gain
>>   access to the ciphertext, but is not permitted to know the secret key.
>> 
>> - SSL web servers, where a customer of an ISP may be able to provide 
>> content
>>   delivered using SSL, and can gain access to the ciphertext, but should 
>> not
>>   be able to derive the key.
>> 
>> There are attacks that reduce the computational cost of deriving keying 
>> materials against known crypto algorithms; however, those attacks typically 
>> do not signifcantly weaken the cipher.  Where they do, we have a special 
>> term we can use to describe the algorithm: "broken".
>> 
>> Many crypto protocols (that is to say, conventions involving the use of 
>> crypto) include "salt" or "initial vectors" (IVs) to limit the 
>> effectiveness of dictionary attacks and known-plaintext attacks by causing 
>> the same plaintext to be encrypted differently each time it is encrypted. 
>> These are typically pseudo-random values, or in the case of chained crypto 
>> modes, earlier data in the ciphertext or cleartext, or in the case of 
>> counter mode, a incrementing counter.
>
> So, if you know that multiple superblock copies are going to be written at 
> predictable places on the disk, and you know that the these copies are 
> identical, unchanging, and have predictable contents, does that give you a 
> starting point for a known-plaintext attack?  I believe that is the question 
> here.  Even if the block granularity of GBDE ensures that the superblocks 
> will be encrypted with other less-predictable data, could you still predict 
> that the outter cylinder groups of the disk might be unused, and therefore 
> have lots of predictable data on them?

A detailed description of the gbde design can be found here:

   http://phk.freebsd.dk/pubs/bsdcon-03.gbde.paper.pdf

While I don't know too much about the details, the paper indicates that there 
are a few elements of the gbde design to counter the known plaintext nature of 
the superblocks, cylinder groups, etc:

(1) Random sector relocation.  Without the keying material necessary to unlock
     the master key sector, you can't trivially locate the block in question.
     A portion of the master key is used to cryptographically generate the
     actual physical sector storage location for the logical sector.

(2) A generated per-sector key, stored in key storage sectors, is used to
     encrypt each sector.  So while the IV paramater to the crypto algorithm is
     zero, the keying material is different, so the same superblock contents
     will be encrypted to different ciphertet.

The first portion addresses an initial concern about locating sectors believed 
to have predictable content, and the second portion addresses the encryption 
of the same data to the same plaintext.  In the administrative directions in 
the paper, it's also recommended that the disk surface be completely 
randomized before beginning so that there isn't a usefully identifiable 
difference between "never written" and "in use or deleted" blocks when 
initialized on a disk with arbitrary (but possibly non-random) contents.  A 
quick look at the gbde(8) admin tool suggests that it does not randomize disk 
contents, but I've not read closely.

Robert N M Watson
Computer Laboratory
University of Cambridge