Received: from oss.sgi.com (localhost [127.0.0.1]) by oss.sgi.com (8.12.3/8.12.3) with ESMTP id g4L7LYnC027696 for ; Tue, 21 May 2002 00:21:34 -0700 Received: (from majordomo@localhost) by oss.sgi.com (8.12.3/8.12.3/Submit) id g4L7LYMu027695 for linux-xfs-outgoing; Tue, 21 May 2002 00:21:34 -0700 X-Authentication-Warning: oss.sgi.com: majordomo set sender to owner-linux-xfs@oss.sgi.com using -f Received: from smtpzilla1.xs4all.nl (smtpzilla1.xs4all.nl [194.109.127.137]) by oss.sgi.com (8.12.3/8.12.3) with SMTP id g4L7LPnC027669 for ; Tue, 21 May 2002 00:21:26 -0700 Received: from auto-nb1.xs4all.nl (213-84-127-28.adsl.xs4all.nl [213.84.127.28]) by smtpzilla1.xs4all.nl (8.12.0/8.12.0) with ESMTP id g4L7MBvZ046955; Tue, 21 May 2002 09:22:12 +0200 (CEST) Message-Id: <4.3.2.7.2.20020521085022.03905aa8@pop.xs4all.nl> X-Sender: knuffie@pop.xs4all.nl X-Mailer: QUALCOMM Windows Eudora Version 4.3.2 Date: Tue, 21 May 2002 09:24:05 +0200 To: Sidik Isani , Steve Lord From: Seth Mos Subject: Re: "Corruption of in-memory data" Cc: Sidik Isani , linux-xfs@oss.sgi.com In-Reply-To: <20020520102853.C18897@cfht.hawaii.edu> References: <1021923461.4832.335.camel@jen.americas.sgi.com> <20020520090515.B18897@cfht.hawaii.edu> <1021923461.4832.335.camel@jen.americas.sgi.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed Sender: owner-linux-xfs@oss.sgi.com Precedence: bulk At 10:28 20-5-2002 -1000, Sidik Isani wrote: > It's a software raid5, and only one of 6 disks failed. To XFS, > the device should have been completely functional. That's the whole idea :-) > > looks something like that. Destruction in these areas also appears > > pretty drastic. > > Oh, there's no worry about the contents of this filesystem. I was > actually trying to do the benchmarks on performance that Seth > mentioned had not been done. If I get them done, I'll post the > results here. I'd just really like to understand what happened. > If raid5 is to blame here, there isn't much point in using it! > Any suggestions on the best way to narrow it down? > > Maybe it is worth starting over, with the bad disk still in there, > to see if it happens again. The sequence was: I installed my software raid5 with xfs in degraded mode. It consisted of a 3 disk array with one disk as a failed (which still contained data). Formatted the degraded array with XFS. Copied the data to the degraded raid5 array. Added the old data disk into the raid 5 array and started reconstruction. This worked for me. It might be that the error during resyncing upset the raid/xfs/ide layer for reasons which can not be explained. Cheers -- Seth It might just be your lucky day, if you only knew.