From owner-freebsd-cluster@FreeBSD.ORG Tue Dec 15 05:04:47 2009 Return-Path: Delivered-To: freebsd-cluster@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0F095106566B for ; Tue, 15 Dec 2009 05:04:46 +0000 (UTC) (envelope-from joey@mingrone.org) Received: from mail-bw0-f213.google.com (mail-bw0-f213.google.com [209.85.218.213]) by mx1.freebsd.org (Postfix) with ESMTP id A40028FC0A for ; Tue, 15 Dec 2009 05:04:45 +0000 (UTC) Received: by bwz5 with SMTP id 5so2745296bwz.3 for ; Mon, 14 Dec 2009 21:04:44 -0800 (PST) MIME-Version: 1.0 Received: by 10.204.10.20 with SMTP id n20mr1003104bkn.161.1260851910545; Mon, 14 Dec 2009 20:38:30 -0800 (PST) Date: Tue, 15 Dec 2009 00:38:30 -0400 Message-ID: From: Joey Mingrone To: freebsd-cluster@freebsd.org Content-Type: text/plain; charset=UTF-8 Cc: "J. P. Bielawski" , Katherine Dunn Subject: Sun Grid Engine: submitting a lot of jobs results in occasional errors X-BeenThere: freebsd-cluster@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Clustering FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 15 Dec 2009 05:04:47 -0000 We are using Freebsd 8.0 / sge-6.2.2.1_1on a computing cluster (http://awarnach.mathstat.dal.ca/). It's been working well, however we see the error below when a lot of jobs are submitted quickly using "qsub". error: commlib error: can't connect to service (Address already in use) Unable to run job: failed sending gdi request. Exiting. Cheers, Joey Mingrone