xfs
[Top] [All Lists]

Re: 2.6.18-rc3-git3 - XFS - BUG: unable to handle kernel NULL pointer de

To: "Nathan Scott" <nathans@xxxxxxx>
Subject: Re: 2.6.18-rc3-git3 - XFS - BUG: unable to handle kernel NULL pointer dereference at virtual address 00000078
From: "Jesper Juhl" <jesper.juhl@xxxxxxxxx>
Date: Tue, 15 Aug 2006 13:42:27 +0200
Cc: linux-kernel@xxxxxxxxxxxxxxx, xfs@xxxxxxxxxxx
Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=KdjBohtkCs50dRtlHHkx4xnI2+Znc5UNRohDQfNFu7sQxol/ju2W4yX8fHhSDNDJXidNqRnYiaGQx0KFpcf3fALHaM+aNyArNM+/+GP2OvJVr63G8woUankKW5BH+1nRFmgKIxLbs4v98tTbvvjU1cWcTtwPWiDID6pvVIjxyjA=
In-reply-to: <20060815190343.A2743401@xxxxxxxxxxxxxxxxxxxxxxxx>
References: <9a8748490608080137k596a6290r3567096668449a64@xxxxxxxxxxxxxx> <9a8748490608100431m244207b1v9c9c5087233fcf3a@xxxxxxxxxxxxxx> <20060811083546.B2596458@xxxxxxxxxxxxxxxxxxxxxxxx> <9a8748490608101544n29f863e7o7584ac64f1d4c210@xxxxxxxxxxxxxx> <9a8748490608101552w12822fa6m415a5fb5537c744d@xxxxxxxxxxxxxx> <9a8748490608110133v5f973cf6w1af340f59bb229ec@xxxxxxxxxxxxxx> <9a8748490608110325k25c340e2yac925eb226d1fe4f@xxxxxxxxxxxxxx> <20060814120032.E2698880@xxxxxxxxxxxxxxxxxxxxxxxx> <9a8748490608140049t492742cx7f826a9f40835d71@xxxxxxxxxxxxxx> <20060815190343.A2743401@xxxxxxxxxxxxxxxxxxxxxxxx>
Sender: xfs-bounce@xxxxxxxxxxx
On 15/08/06, Nathan Scott <nathans@xxxxxxx> wrote:
On Mon, Aug 14, 2006 at 09:49:10AM +0200, Jesper Juhl wrote:
> On 14/08/06, Nathan Scott <nathans@xxxxxxx> wrote:
> > > LEAFN node level is 1 inode 412035424 bno = 8388608
> >
> > Ooh.  Can you describe this test case you're using?
> ...
> These filesystems vary in size from 50G to 3.5T
>
> The XFS filesystems contain rsync copies of filesystems on as many servers.
> The workload that triggers the problem is when all the servers start
> updating their copy via rsync - Then within a few hours the problem
> triggers.
>
> So to recreate the same scenario you'll want 28 servers doing rsync of
> filesystems of various sizes between 50G & 3.5T to a central server
> running 2.6.18-rc4 with 28 XFS filesystems.
> ...
> There are millions of files. The data the server recievs is copies of
> websites. Each server that sends data to the server with the 28 XFS
> filesystems hosts between 1800 and 2600 websites, so there are lots of
> files and every concievable strange filename.

Wow, a special kind of hell for a filesystem developer...! ;-)

Its not clear to me where the rename operation happens in all of
this - does rsync create a local, temporary copy of the file and
then rename it?

I'm not sure. I'll investigate and see if I can work out what exactely
rsync does.

Is it that central server going down or one of
those 28 other server machines?  (I assume it is, I can't see an
opportunity for renaming out there...).

It's the central server with all the xfs filesystems that dies. The
one recieving all the rsync data.


When you hit it again, could you grab the contents of the inode
(you'll get that from xfs_repair -n, e.g. 412035424 above) with
xfs_db (see last entry in the XFS FAQ which describes how to do
that), then mail that to me please?

Sure, I'll read up on that and make sure to grab that info next time.


If you can get the source
and target names in the rename that'll help alot too... I can
explain how to use KDB to get that, but maybe you have another
debugger handy already?

An explanation of how exactely to do that would be greatly appreciated.

--
Jesper Juhl <jesper.juhl@xxxxxxxxx>
Don't top-post  http://www.catb.org/~esr/jargon/html/T/top-post.html
Plain text mails only, please      http://www.expita.com/nomime.html


<Prev in Thread] Current Thread [Next in Thread>