xfs
[Top] [All Lists]

RE: Problems with many processes copying large directories across an XF

To: Simon Matter <simon.matter@xxxxxxxxxxxxxxxx>, Adrian Head <adrian.head@xxxxxxxxxxxxxxx>
Subject: RE: Problems with many processes copying large directories across an XFS volume.
From: Adrian Head <adrian.head@xxxxxxxxxxxxxxx>
Date: Mon, 10 Sep 2001 18:29:55 +1000
Cc: linux-xfs@xxxxxxxxxxx
Sender: owner-linux-xfs@xxxxxxxxxxx
Thanks for your reply Simon

Yes the softraid was fully synced before I started any test.

The XFS patch I used to obtain these errors was
patch-2.4.9-xfs-2001-08-19 and the errors were:
Sep 9 05:13:46 ATLAS kernel: 02:86: rw=0, want=156092516, limit=360
Sep 9 05:13:46 ATLAS kernel: attempt to access beyond end of device

When I used a later version of the XFS patch I had more descriptive
errors written to /var/log/messages:
Sep 10 10:14:57 ATLAS kernel: I/O error in filesystem ("md(9,0)")
meta-data  dev 0x900 block 0x9802bdc 
Sep 10 10:14:57 ATLAS kernel: (xlog_iodone") error  5 buf count 32768 
Sep 10 10:14:57 ATLAS kernel:  xfs_force_shutdown(md(9,0),0x2) called
from line 940 of file xfs_log.c.  Return address - 0xd8cb66f8 
Sep 10 10:14:57 ATLAS kernel: Log I/O Error  Detected. Shutting down
filesystem: md(9,0) 
Sep 10 10:14:57 ATLAS kernel:  Please umount the filesystem, and rectify
the problem(s) 
Sep 10 10:14:57 ATLAS kernel: xfs_force_shutdown(md(9,0),0x2) called
from line 714 of file  xfs_log.c. Return address = 0xd8cb65d3 
Sep 10 10:14:57 ATLAS kernel: attempt  to access beyond end of device 
Sep 10 10:14:57 ATLAS kernel: 02:82: rw=0,  want=1602235696, limit=4

I did think at the time that it may have been issues with XFS stomping
all over raid code or raid code stomping all over XFS.  Although I not
sure now as the 2.4.10-pre2-xfs-2001-09-02 patch never wrote any errors
out at all. (please see my 2nd post for more info)

Thanks for taking the time to test this on your own machine.

Adrian Head
Bytecomm P/L

> -----Original Message-----
> From: Simon Matter [SMTP:simon.matter@xxxxxxxxxxxxxxxx]
> Sent: Monday, 10 September 2001 17:45
> To:   adrian.head@xxxxxxxxxxxxxxx
> Cc:   linux-xfs@xxxxxxxxxxx
> Subject:      Re: Problems with many processes copying large
> directories across an XFS  volume.
> 
> Hi Adrian
> 
> I did similar tests two months ago. I was having problems as well but
> ufurtunately I don't remember what is was exactly.
> First question: You created Softraid5, was the raid synced when you
> started the tests?
> 
> > In the /var/log/messages log around the same time as the copy test I
> get
> > entries like:
> > Sep 9 05:13:46 ATLAS kernel: 02:86: rw=0, want=156092516, limit=360
> > Sep 9 05:13:46 ATLAS kernel: attempt to access beyond end of device
> 
> This looks interesting. I don't know what this means exactly but it
> looks to me like you managed to create a filesystem bigger than the
> raid
> volume was? I got the very same error when I tried to restore data
> with
> xfsrestore from DAT (xfsrestore from DLT was fine). The issue is still
> open.
> 
> I have a test system here with SoftRAID5 on 4 U160 SCSI disks. I'll
> try
> to kill it today with cp jobs.
> 
> -Simon
> 


<Prev in Thread] Current Thread [Next in Thread>