From owner-freebsd-stable@FreeBSD.ORG Mon Nov 17 17:17:58 2008 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C8206106564A for ; Mon, 17 Nov 2008 17:17:58 +0000 (UTC) (envelope-from asossi@dotcom.ts.it) Received: from mail.dotcom.ts.it (89-96-242-204.ip14.fastwebnet.it [89.96.242.204]) by mx1.freebsd.org (Postfix) with SMTP id 346608FC12 for ; Mon, 17 Nov 2008 17:17:57 +0000 (UTC) (envelope-from asossi@dotcom.ts.it) Received: (qmail 24212 invoked by uid 89); 17 Nov 2008 17:17:57 -0000 Received: by simscan 1.1.0 ppid: 24193, pid: 24206, t: 0.7607s scanners:none X-Spam-Checker-Version: SpamAssassin 3.2.3 (2007-08-08) on web.dotcom.ts.it X-Spam-Level: X-Spam-Status: No, hits=-4.1 required=3.0 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.2.3 Received: from unknown (HELO ?192.168.0.2?) (asossi@dotcom.ts.it@192.168.254.3) by test.dotcom.ts.it with SMTP; 17 Nov 2008 17:17:56 -0000 Message-ID: <4921A6C5.6000107@dotcom.ts.it> Date: Mon, 17 Nov 2008 18:15:49 +0100 From: Sossi Andrej Organization: DotCom Information technology User-Agent: Thunderbird 2.0.0.17 (X11/20080914) MIME-Version: 1.0 To: "Paul B. Mahol" References: <49219959.8040209@dotcom.ts.it> <3a142e750811170859o226d3d91r45ac4106821e6fae@mail.gmail.com> In-Reply-To: <3a142e750811170859o226d3d91r45ac4106821e6fae@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: swap_pager: indefinite wait X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 17 Nov 2008 17:17:59 -0000 Paul B. Mahol wrote: > On 11/17/08, Sossi Andrej wrote: > >> Hello, >> I manage a www (apache), database (postgresql) and e-mail (qmail) with >> about 100 Internet domains. >> We recently purchased a DELL MD3000i server with data access through >> ISCSI protocol. >> I installed a new server with FreeBSD 7.0 compiled kernel with ISCSI >> (v. 2.1) and SCHED_ULE scheduler. >> The server's local disk configuration is: >> 512 MB / >> 1024 MB SWAP >> 254 MB / var >> 2048MB / var/ qmail >> 512 MB /tmp >> ~ 70 GB /usr >> >> The apache data (/usr/local/www), postgre (/usr/local/pgsql) and qmail >> (/usr/local/vpopmail) are instead on the partition on ISCSI. After >> installing all the necessary softwares I tested the machine as mail >> server for a one domain and everything worked fine for 2-3 days. >> I wanted to test the correct functioning of qmail and of the >> reading/writing on ISCSI. On the end of testing I decided to transfer >> all of data (www, DB and e-mail) on the FreeBSD 7.0. I compressed all >> the data on the original server (on tar.gz format). I transferred them >> on the new server and started to decompress the files. Www data have >> been decompressed correctly on the ISCSI disk. But while the messages >> were being decompressed (~80GB of tar.gz) and all the DB were being >> restored, the server crashed. >> > > Which application were used for decompression? > I use this command: tar -xvzf filename.tar.gz (for file decompression as root) and psql -f filename.sql postgres (for restore all databases) >> The error shown was: >> >> swap_pager: indefinite wait buffer: bufobj: 0, blkno: 31, size: 4096 >> > > I believe this can happen any time when to much of swap partition is used. > If this sort of thing happens often, review memory management of application(s) > that is causing such scenario, if that is application fault - you have > two choices: > get more RAM (this may not always help) or use another application. > Increasing swap partition will not help much .... > During decompression i sometimes monitor the system with command top and I don't see the system use swap at all, but I didn't monitor constantly. If the problem is caused by tar, what choices do i have to transport files from original server to another quickly? >> This message repeated every 30 seconds. >> While restarting with the single user mode I checked there wasn't any >> message after the crash in /var/log/messages. >> >> I think 1GB of swap should be enough, since with this configuration on >> FreeBSD 5.5 everything worked for over 270 days without reboot. >> The local disks are 2 of ~80GB in RAID 1 (mirror) with Gmirror >> before installing FreeBSD I tested the 2 disks with the badblocks >> software on Slackware which didn't find any r/w error. >> >> Can somebody help me understand why the system crashed this way or how >> to avoid future crash? >> It's a production server and I can't afford an out-of-service or data loss. >> >> Thank you very much and kind regards Thank for your reply. -- Sossi Andrej ------------------------- DotCom Information technology Via Biancospino, 9 34151 - Opicina (TS) Italy tel: +39 040 2158191 fax: +39 040 0641954 E-mail: asossi@dotcom.ts.it ---------------------------- Ai sensi del D.lgs n. 196 del 30.06.03 (Codice Privacy) si precisa che le informazioni contenute in questo messaggio sono riservate e ad uso esclusivo del destinatario. Qualora il messaggio in parola Le fosse pervenuto per errore, La preghiamo di eliminarlo senza copiarlo e di non inoltrarlo a terzi, dandocene gentilmente comunicazione. Grazie This message, for the D.lgs n. 196 / 30.06.03 (Privacy Code), may contain confidential and/or privileged information. If you are not the addressee or authorized to receive this for the addressee, you must not use, copy, disclose or take any action based on this message or any information herein. If you have received this message in error, please advise the sender immediately by reply e-mail and delete this message. Thank you for your cooperation.