From owner-freebsd-fs@freebsd.org Wed Sep 9 12:28:44 2015 Return-Path: Delivered-To: freebsd-fs@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 54E87A00E9E for ; Wed, 9 Sep 2015 12:28:44 +0000 (UTC) (envelope-from rmacklem@uoguelph.ca) Received: from esa-annu.net.uoguelph.ca (esa-annu.mail.uoguelph.ca [131.104.91.36]) by mx1.freebsd.org (Postfix) with ESMTP id D1DA11996 for ; Wed, 9 Sep 2015 12:28:43 +0000 (UTC) (envelope-from rmacklem@uoguelph.ca) IronPort-PHdr: 9a23:j0JndBICXHC9zZlXUdmcpTZWNBhigK39O0sv0rFitYgVKfXxwZ3uMQTl6Ol3ixeRBMOAu64C0rad7/CocFdDyKjCmUhKSIZLWR4BhJdetC0bK+nBN3fGKuX3ZTcxBsVIWQwt1Xi6NU9IBJS2PAWK8TWM5DIfUi/yKRBybrysXNWC04Lui6vuq9X6WEZhunmUWftKNhK4rAHc5IE9oLBJDeIP8CbPuWZCYO9MxGlldhq5lhf44dqsrtY4q3wD89pozcNLUL37cqIkVvQYSW1+ayFmrPDtrgTJGAuT+mMHACJRlhtTHxOD4gv3U53qvm39rOU63SCbOcj/S/cwWC++7qFlT1jmkioKPSU1tW/M2fB32ZhSowmhpgB/i7DZZoKcKPdlfuuJY8kdTmkbDu5eUiVABsW3aI5ZXMQbOuMNlYj2pBMrpBC9AQSpTLf1zzZDhXv72IUn1Os8HAXe3EorFoRd4zzvsNzpOfJKAqiOx67SwGCGNqsO1A== X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: A2DBBAD3JPBV/61jaINdg3dpBoMeu3cKhS9KAoFyEQEBAQEBAQEBgQmCHYIGAQEBAwEBAQEgKyALBQsCAQgOBAYCAg0ZAgIhBgEJGAENAgQIBwQBCBQEh3gDCggNtVOPQg2EcAEBAQEGAQEBAQEdgSKFUYN2gQWCT4FrAQEFAhUBMweCLjsSgTEFlRg+hQqFDnWDOYQzjT2DUINqAiaEHCIzB4Z7CBcjgQUBAQE X-IronPort-AV: E=Sophos;i="5.17,496,1437451200"; d="scan'208";a="237419475" Received: from nipigon.cs.uoguelph.ca (HELO zcs1.mail.uoguelph.ca) ([131.104.99.173]) by esa-annu.net.uoguelph.ca with ESMTP; 09 Sep 2015 08:28:42 -0400 Received: from localhost (localhost [127.0.0.1]) by zcs1.mail.uoguelph.ca (Postfix) with ESMTP id 2D2E815F565; Wed, 9 Sep 2015 08:28:42 -0400 (EDT) Received: from zcs1.mail.uoguelph.ca ([127.0.0.1]) by localhost (zcs1.mail.uoguelph.ca [127.0.0.1]) (amavisd-new, port 10032) with ESMTP id KZ39HIe-bpKM; Wed, 9 Sep 2015 08:28:40 -0400 (EDT) Received: from localhost (localhost [127.0.0.1]) by zcs1.mail.uoguelph.ca (Postfix) with ESMTP id C726C15F56D; Wed, 9 Sep 2015 08:28:40 -0400 (EDT) X-Virus-Scanned: amavisd-new at zcs1.mail.uoguelph.ca Received: from zcs1.mail.uoguelph.ca ([127.0.0.1]) by localhost (zcs1.mail.uoguelph.ca [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id wIaoUjDU2DRl; Wed, 9 Sep 2015 08:28:40 -0400 (EDT) Received: from zcs1.mail.uoguelph.ca (zcs1.mail.uoguelph.ca [172.17.95.18]) by zcs1.mail.uoguelph.ca (Postfix) with ESMTP id A243A15F565; Wed, 9 Sep 2015 08:28:40 -0400 (EDT) Date: Wed, 9 Sep 2015 08:28:40 -0400 (EDT) From: Rick Macklem To: Rakshith Venkatesh Cc: Mark Saad , freebsd-fs@freebsd.org, Jordan Hubbard , Outback Dingo Message-ID: <1276471646.4354662.1441801720640.JavaMail.zimbra@uoguelph.ca> In-Reply-To: References: <100306673.40344407.1441279047901.JavaMail.zimbra@uoguelph.ca> <1564D4FA-9BE1-4E37-8E91-F14A009D6B62@icloud.com> <838814506.1858817.1441577912291.JavaMail.zimbra@uoguelph.ca> <8551C101-0E4B-4619-BF52-1B87BC6565D8@longcount.org> Subject: Re: CEPH + FreeBSD MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Originating-IP: [172.17.95.12] X-Mailer: Zimbra 8.0.9_GA_6191 (ZimbraWebClient - FF34 (Win)/8.0.9_GA_6191) Thread-Topic: CEPH + FreeBSD Thread-Index: QzgDbtFzb3x+SYm6lo9C4wIMZtp4Kg== X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 09 Sep 2015 12:28:44 -0000 Rakshith Venkatesh wrote: > If there is a need to support cifs along with NFS then locking surely is > needed to maintain data sanity and avoid problems due to concurrent acces= s > to files. For my case, this doesn't affect choice of cluster file system, since the l= ocking is taken care of by the NFSv4.1 server. You are, however, correct that it w= ould be really nice to have the NFSv4.1 server working more closely with Samba (or other o= pen source CIFS implementations) for the Windows op locks, etc. Right now, the NFSv4.1 server can optionally grab POSIX style advisory lock= s on the underlying file system so that Samba (and local syscalls) can see them. It does nothing w.r.t. the Opens (which are basically Windows op locks, or = whatever Samba calls them? I'll admit I've avoided this, since I know nothing about Samba. (I'm only o= ne retired guy doing this in his spare time.) Soneday, it would be nice if someone who knows the Samb= a side could work on this? rick > On 09-Sep-2015 7:55 am, "Mark Saad" wrote: >=20 > > > > On Sep 8, 2015, at 9:59 PM, Outback Dingo wrot= e: > > > > > > > > On Wed, Sep 9, 2015 at 11:31 AM, Mark Saad wro= te: > > > >> All > >> What about leofs. It's in ports has and s3 obj store and NFS support > >> out of the box > >> > >> > >> http://www.freshports.org/databases/leofs/ > >> http://leo-project.net > > > > > > LeoFS supperts NFSv3 and does not have a lock manager.... > > > > > > Well, nlm it's not a hard requirement for most NFS things . NFS is ther= e > > for direct access of the data . So in its current form it's not a pnfs > > setup , but there is no reason it could not be extended . > > > > Well it could be worse , manta, joyent's compute and storage platform = has > > a NFS server written in node.js . Yes nfs in JavaScript , what else wou= ld > > you expect from them . :) > > > > > > --- > > Mark Saad | nonesuch@longcount.org > > > > > > > >> > >> > >> > >> --- > >> Mark Saad | nonesuch@longcount.org > >> > >> > On Sep 6, 2015, at 6:18 PM, Rick Macklem wrot= e: > >> > > >> > Jordan Hubbard wrote: > >> >> > >> >>> On Sep 3, 2015, at 4:17 AM, Rick Macklem > >> wrote: > >> >>> > >> >>> Slightly off topic but, btw, there is a port of GLusterFS and thos= e > >> folks > >> >>> do seem > >> >>> interested in seeing it brought "up to speed". I am not sure how > >> mature it > >> >>> is at > >> >>> this point, but it has been known to build on amd64. (I don't have= an > >> amd64 > >> >>> machine, > >> >>> so I haven't gotten around to building/testing it, but I do plan t= o > >> try and > >> >>> use > >> >>> it as a basis for a pNFS server, if I can figure out how to get th= e > >> FH info > >> >>> out of it. > >> >>> I'm working on that;-) > >> >> > >> >> There are at least two distributed (multi-node) object stores for > >> FreeBSD > >> >> that I know of. > >> >> > >> >> One is glusterfs, for which I=E2=80=99m not even really clear on th= e status of > >> the > >> >> ports for. I don=E2=80=99t see any glusterfs port in the master br= anch of > >> >> https://github.com/freebsd/freebsd-ports (or > >> >> https://github.com/freebsd/freebsd-ports/tree/branches/2015Q3 for t= hat > >> >> matter). > >> >> > >> >> Our FreeNAS ports tree (https://github.com/freenas/ports), in which > >> we have a > >> >> bit more latitude to add and curate our own ports, has both a > >> net/glusterfs > >> >> and sysutils/glusterfs, from separate sources (looks like we need t= o > >> clean > >> >> things up) - net/glusterfs lists craig001@lerwick.hopto.org as the > >> >> MAINTAINER and is at version 3.6.2. The sysutils/glusterfs port li= sts > >> >> bapt@FreeBSD.org as the MAINTAINER and is at version 20140811. > >> >> > >> >> I=E2=80=99m not really sure about the provenance since we were simp= ly > >> evaluating > >> >> glusterfs for awhile and may have pulled in interim versions from t= hose > >> >> sources, but obviously it would be best to have an official maintai= ner > >> and > >> >> someone in the FreeBSD project actually curating a glusterfs port s= o > >> that > >> >> all users of FreeBSD can use it. It would also be fairly key to yo= ur > >> own > >> >> efforts, assuming you decide to pursue glusterfs as a foundation > >> technology > >> >> for pNFS. > >> >> > >> >> The other object store, which is pretty mature and is currently > >> leading the > >> >> pack (of two :) ) for inclusion into FreeNAS is RiakCS from Basho. > >> There is > >> >> a port currently in databases/riak but it=E2=80=99s pretty out of d= ate at > >> version > >> >> 1.4.12 (the current version is 2.0.1, with 2.0 being a major upgrad= e of > >> >> RiakCS). > >> >> > >> >> We are very interested in investigating various ways of shimming > >> RiakCS to > >> >> NFS, using RiakCS a back-end store. Is that something you=E2=80= =99d be > >> amenable to > >> >> discussing? I=E2=80=99d be happy to send you an amd64 architectur= e machine to > >> >> develop on. :) > >> > Hmm. From a quick look at their web page (I looked once before as > >> well), I don't > >> > think RiakCS has what I need to do pNFS in a reasonable (for me) amo= unt > >> of effort. > >> > Two things that glusterFS has that I am hoping to use (and I don't > >> think RiakCS has > >> > either of these) are: > >> > - A Fuse file system interface which allows the kernel nfsd to acces= s > >> the store as > >> > a file system, so that it can provide the metadata services (NFS > >> without the reads/writes). > >> > - A userland NFSv3 server in each node which will allow the node to = act > >> as a data server. > >> > > >> > If I am wrong and RiakCS does support a VFS file system interface (v= ia > >> Fuse or ???), then > >> > please correct me. With that, it might be a reasonable alternative. > >> > I'll admit I've spent a little time looking at the glusterFS sources > >> and haven't yet > >> > solved the problem of how to generate the file handles I need, but t= hat > >> sounds trivial > >> > compared with an entire Fuse and/or VFS file system interface, I thi= nk? > >> > > >> > In general, using a cloud object store to implement a pNFS server is= a > >> *mis*use of > >> > the technology, imho. I think it may be possible with glusterFS, sin= ce > >> that technology > >> > seems to be based on a cluster file system, which is what a pNFS ser= ver > >> can also use. > >> > > >> > I think there would be a lot of work involved in mapping a POSIX fil= e > >> system onto the > >> > Riak database and then exporting that via NFS, etc. It might also be > >> more practical to > >> > do this via a userland NFS service than the kernel based one current= ly > >> in FreeBSD. > >> > (glusterFS is starting to use the NFS-ganesha server, but I believe = it > >> is pretty Linux specific, > >> > so I doubt it would be useful for Riak running on FreeBSD?) > >> > > >> > rick > >> > > >> >> - Jordan > >> > _______________________________________________ > >> > freebsd-fs@freebsd.org mailing list > >> > https://lists.freebsd.org/mailman/listinfo/freebsd-fs > >> > To unsubscribe, send any mail to "freebsd-fs-unsubscribe@freebsd.org= " > >> _______________________________________________ > >> freebsd-fs@freebsd.org mailing list > >> https://lists.freebsd.org/mailman/listinfo/freebsd-fs > >> To unsubscribe, send any mail to "freebsd-fs-unsubscribe@freebsd.org" > >> > > > > >=20