X-Spam-Checker-Version: SpamAssassin 3.4.0-r929098 (2010-03-30) on oss.sgi.com X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=ham version=3.4.0-r929098 Received: from cuda.sgi.com (cuda2.sgi.com [192.48.176.25]) by oss.sgi.com (8.14.3/8.14.3/SuSE Linux 0.8) with ESMTP id o8MJMkUg079887 for ; Wed, 22 Sep 2010 14:22:47 -0500 X-ASG-Debug-ID: 1285183420-4c7601e30000-NocioJ X-Barracuda-URL: http://cuda.sgi.com:80/cgi-bin/mark.cgi Received: from pirx.askja.de (localhost [127.0.0.1]) by cuda.sgi.com (Spam Firewall) with ESMTP id E908B92EA7 for ; Wed, 22 Sep 2010 12:23:40 -0700 (PDT) Received: from pirx.askja.de (pirx.askja.de [83.137.103.148]) by cuda.sgi.com with ESMTP id mCaht5BrLH6ZpCUm for ; Wed, 22 Sep 2010 12:23:40 -0700 (PDT) Received: from localhost (localhost [127.0.0.1]) by pirx.askja.de (Postfix) with ESMTP id 3521E3014B; Wed, 22 Sep 2010 21:23:30 +0200 (CEST) X-Virus-Scanned: ClamAV version 0.94.2, clamav-milter version 0.94.2 on oss.sgi.com X-Virus-Scanned: Debian amavisd-new at pirx.askja.de Received: from pirx.askja.de ([127.0.0.1]) by localhost (pirx.askja.de [127.0.0.1]) (amavisd-new, port 10024) with LMTP id iaz8upIHGbHQ; Wed, 22 Sep 2010 21:23:28 +0200 (CEST) Received: by pirx.askja.de (Postfix, from userid 1012) id 25176302CC; Wed, 22 Sep 2010 21:23:28 +0200 (CEST) Date: Wed, 22 Sep 2010 21:23:28 +0200 From: Ralf Gross To: Emmanuel Florac Cc: xfs@oss.sgi.com X-ASG-Orig-Subj: Re: XFS internal error xfs_da_do_buf(2) Subject: Re: XFS internal error xfs_da_do_buf(2) Message-ID: <20100922192328.GC23326@pirx.askja.de> References: <20100922072653.GA23326@pirx.askja.de> <20100922103134.4e652651@galadriel.home> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20100922103134.4e652651@galadriel.home> User-Agent: Mutt/1.5.18 (2008-05-17) X-Barracuda-Connect: pirx.askja.de[83.137.103.148] X-Barracuda-Start-Time: 1285183420 X-Barracuda-Bayes: INNOCENT GLOBAL 0.0000 1.0000 -2.0210 X-Barracuda-Virus-Scanned: by cuda.sgi.com at sgi.com X-Barracuda-Spam-Score: -2.02 X-Barracuda-Spam-Status: No, SCORE=-2.02 using per-user scores of TAG_LEVEL=2.0 QUARANTINE_LEVEL=1000.0 KILL_LEVEL=2.1 tests= X-Barracuda-Spam-Report: Code version 3.2, rules version 3.2.2.41589 Rule breakdown below pts rule name description ---- ---------------------- -------------------------------------------------- X-Virus-Status: Clean Emmanuel Florac schrieb: > Le Wed, 22 Sep 2010 09:26:53 +0200 vous écriviez: > > > Infortrend RAID with BBU -> DRBD -> LVM -> XFS > > I'm not sure to understand; DRBD is used to mirror devices, are you > using 2 Infortrend arrays, or one split into two volumes? yes, 2 drbd/heartbeat nodes, 2 Infortrend systems > In any case, battery protected write cache shouldn't create any > corruption (OTOH it will enhance performance dramatically). AFAIK all > RAID arrays mirror the drives cache in the array cache until it hits > platters. There may be some "cheating" hard drives (I heard that WD > isn't always totally clear about cache policies) but Seagate and Hitachi > professional grade drives definitely are OK. Hint: don't use desktop > hard drives in any case because they probably cheat about cache > synchronisation. The drives are 15000 RPM SAS drives, so no desktops drives. As far as I understand the documentation, when the "Delayed Drive Write" RAID option is enabled, there is the possibility of data loss. > I see that you may have been running rsync at the time of failure; there > are quite a lot of quite unclear (to me at least) cases of XFS failures > when running rsync to copy large amount of data. I it a fully 64 bits > system? There were many more lines with this error in the logs. Triggered by smbd, rsync, bacula... Ralf