On Thu, Feb 19, 2009 at 05:19:25PM +1100, Dave Chinner wrote:
> On Wed, Feb 18, 2009 at 10:36:59AM +0100, Carsten Aulbert wrote:
> > >> plus a few more nodes showing the same characteristics
> > >
> > > Hmmmm. Did this show up in 126.96.36.199? Or did it start occurring only
> > > after you upgraded from .10 to .14?
> > As far as I can see this only happened after the upgrade about 14 days
> > ago. What strikes me odd is that we only had this occurring massively on
> > Monday and Tuesday this week.
> > I don't know if a certain access pattern could trigger this somehow.
> I suspect so. We've already had XFS trigger one bug in the new
> lockless pagecache code, and the fix for that went in 188.8.131.52 -
> between the good version and the version that you've been seeing
> these memory corruptions on. I'm wondering if that fix exposed or
> introduced another bug that you've hit....
Highly unlikely. It only introduces constraints on how the
compiler may generate code, so it would have to be a compiler
bug to cause a bug I think.
I wonder how long you've been running with 2.6.27 based kernels