Received: (from majordomo@localhost) by oss.sgi.com (8.11.2/8.11.3) id g2K7Qhj22122 for linux-xfs-outgoing; Tue, 19 Mar 2002 23:26:43 -0800 Received: from mail.hs.tecmath.com ([62.16.211.185]) by oss.sgi.com (8.11.2/8.11.3) with SMTP id g2K7QY922093 for ; Tue, 19 Mar 2002 23:26:35 -0800 Received: from [192.168.98.1] (helo=superserver.humanmodeling.tecmath.de) by mail.hs.tecmath.com with esmtp (Exim 3.33 #1) id 16naVe-00072H-00; Wed, 20 Mar 2002 08:27:46 +0100 Received: from [192.168.98.14] (helo=tmsgi7.humanmodeling.tecmath.de) by superserver.humanmodeling.tecmath.de with esmtp (Exim 3.22 #1) id 16naVe-0003Nm-00; Wed, 20 Mar 2002 08:27:46 +0100 Date: Wed, 20 Mar 2002 08:27:46 +0100 From: Martin Apel X-X-Sender: apel@tmsgi7.humanmodeling.tecmath.de To: Steve Lord cc: "ZINKEVICIUS,MATT (HP-Loveland,ex1)" , , Subject: Re: TDB corruption with Samba 2.2.3a In-Reply-To: <1016567299.1770.128.camel@jen.americas.sgi.com> Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-linux-xfs@oss.sgi.com Precedence: bulk On 19 Mar 2002, Steve Lord wrote: > On Tue, 2002-03-19 at 01:03, Martin Apel wrote: > > On Wed, 13 Mar 2002, Jeremy Allison wrote: > > > > > On Wed, Mar 13, 2002 at 05:25:13AM -0500, ZINKEVICIUS,MATT (HP-Loveland,ex1) wrote: > > > > We are also experiencing the TDB corruptions, as reported in the samba log > > > > files. At first there didn't seem to be an consequence, but we are now > > > > beginning to see user's not being able to login to the machine. As a > > > > workaround we have been just deleting the secrets.tdb file, restarting > > > > samba, and rejoining the domain. > > > > > > > > Our server if very similar to Martin's (Samba 2.2.3a + Linux 2.4.17 + XFS). > > > > I'll post more details soon. > > > > > > Can you try using the tdbbackup utility periodically > > > to determine when the corruption may be occurring ? > > > > I let the tdbbackup run for a few days now. The TDB corruption seems to > > happen at the time, when Amanda (a great backup tool) starts to run. > > I have moved the Amanda start time back and forth and the corruption > > starts within 10 minutes after starting Amanda. > > I don't think it's Amanda's fault, I assume that Amanda puts a heavy load > > on the filesystem layer during the first minutes, when it does its estimates. > > A reminder: all partitions on this system are XFS partititions, including > > /var, where Samba stores the TDB files. > > I could try to reformat the /var filesystem with ext2 to see if this has > > any influence. But this will probably need a server reboot, so I cannot > > do this before the weekend. > > > > Martin > > Can you try a kernel from the 2.4 xfs cvs tree - I just pushed some > changes out there which fix a corruption problem under heavy memory > pressure. Hi Steve, since this is our production server with 50 people being dependent on it I would rather not try the current CVS version. Is it possible to isolate your patch relative to XFS 1.0.2? For the record: I put the Samba lock directory onto an ext2 partition yesterday and had no problems since then. Martin ________________________________________________________________________ Martin Apel, Dipl.-Inform. t e c m a t h A G Group Manager Software Development Human Solutions Division phone +49 (0)631 303-5600 Europaallee 10, 67657 Kaiserslautern fax +49 (0)631 303-5700 Germany apel@hs.tecmath.com http://www.tecmath.com ________________________________________________________________________