From owner-freebsd-questions@FreeBSD.ORG Thu Apr 26 16:16:02 2007 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 90D0616A407 for ; Thu, 26 Apr 2007 16:16:02 +0000 (UTC) (envelope-from garrisot@otc.edu) Received: from MX1.otc.edu (mx1.otc.edu [198.209.160.201]) by mx1.freebsd.org (Postfix) with ESMTP id 5447213C4BD for ; Thu, 26 Apr 2007 16:16:02 +0000 (UTC) (envelope-from garrisot@otc.edu) X-ASG-Debug-ID: 1177604161-2e3b002e0000-jLrpzn X-Barracuda-URL: http://198.209.160.201:8000/cgi-bin/mark.cgi X-ASG-Whitelist: Sender Received: from EXM1.otc.edu (exn1.otc.edu [172.16.2.131]) by MX1.otc.edu (Spam Firewall) with ESMTP id 5922D3A42D for ; Thu, 26 Apr 2007 11:16:01 -0500 (CDT) Received: from EXM1.otc.edu (exn1.otc.edu [172.16.2.131]) by MX1.otc.edu with ESMTP id wHXC0DrZ5jjwBrHv for ; Thu, 26 Apr 2007 11:16:01 -0500 (CDT) X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable X-ASG-Orig-Subj: RE: Single Instance Service Date: Thu, 26 Apr 2007 11:16:00 -0500 Message-ID: <06D1B6D4926222458F803D0D3EDCCB7E01D0A790@EXM1.otc.edu> In-Reply-To: <200704260635.l3Q6ZZhL090019@banyan.cs.ait.ac.th> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Single Instance Service Thread-Index: AceHzR6oh+AWJv48QlyL0X7r3Z+8CwAUEYwg From: "GARRISON, TRAVIS J." To: X-Barracuda-Connect: exn1.otc.edu[172.16.2.131] X-Barracuda-Start-Time: 1177604161 X-Barracuda-Virus-Scanned: by OTC E-Mail System at otc.edu Subject: RE: Single Instance Service X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 26 Apr 2007 16:16:02 -0000 > -----Original Message----- > From: owner-freebsd-questions@freebsd.org [mailto:owner-freebsd- > questions@freebsd.org] On Behalf Of Olivier Nicole > Sent: Thursday, April 26, 2007 1:36 AM > To: m.seaman@infracaninophile.co.uk > Cc: GARRISON, TRAVIS J.; freebsd-questions@freebsd.org > Subject: Re: Single Instance Service >=20 > > Sure it is. You will need to write a small shell script to scan > > your disk volume and calculate the checksum of each file. When > > ever it finds a duplicated checksum, then it copies the file into > > the central store and replaces the on-disk copies with symbolic > > links. That's fairly trivial to write. >=20 > Beside, what should be the behaviour when one wishes to modify his own > copy of a document? How does Single Instance acts in that case? >=20 > If you establish a link, there is only one version of the file, once > and forever (unless you go and unlink it manually), so when one > modifies the file, modification applies for everyone. >=20 > Olivier [GARRISON, TRAVIS J.]=20 I know with Windows Storage Server, if a user modifies the file, it will then create the user their own copy of the file. This happens automatically. Exchange Server is another example of this type of storage. When someone sends an attachment to several people, the server saved one copy of the file. I am currently managing 7TB worth of data with roughly 1 to 2TB of duplicate files. This gets fairly expensive with a fiber channel san backend. I know it can be done in the windows world automatically, just wondered if it could be done automatically in the Unix world also.