[Dovecot] Better to use a single large storage server or multiple smaller for mdbox?

Ed W lists at wildgooses.com
Sat Apr 14 13:22:37 EEST 2012


On 14/04/2012 04:31, Stan Hoeppner wrote:
> On 4/13/2012 10:31 AM, Ed W wrote:
>> On 13/04/2012 13:33, Stan Hoeppner wrote:
>>> In closing, I'll simply say this:  If hardware, whether a mobo-down SATA
>>> chip, or a $100K SGI SAN RAID controller, allowed silent data corruption
>>> or transmission to occur, there would be no storage industry, and we'll
>>> all still be using pen and paper.  The questions you're asking were
>>> solved by hardware and software engineers decades ago.  You're fretting
>>> and asking about things that were solved decades ago.
>> So why are so many people getting excited about it now?
> "So many"?  I know of one person "getting excited" about it.

You love being vague don't you?  Go on, I'll bite again, do you mean 
yourself?

:-)

> Data densities and overall storage sizes and complexity at the top end
> of the spectrum are increasing at a faster rate than the
> consistency/validation mechanisms.  That's the entire point of the
> various academic studies on the issue.

Again, you love being vague.  By your dismissive "academic studies" 
phrase, do you mean studies done on a major industrial player, ie NetApp 
in this case?  Or do you mean that it's rubbish because they asked 
someone with some background in statistics to do the work, rather than 
asking someone sitting nearby in the office to do it?

I don't think the researcher broke into NetApp to do this research, so 
we have to conclude that the industrial partner was onboard.  NetApp 
seem to do a bunch of engineering of their own (got enough patents..) 
that I think we can safely assume they very much do their own research 
on this and it's not just "academic"...  I doubt they publish all their 
own internal research, be thankful you got to see some of the results 
this way...

>    Note that the one study required
> a sample set of 1.5 million disk drives.  If the phenomenon were a
> regular occurrence as you would have everyone here believe, they could
> have used a much smaller sample set.

Sigh... You could criticise the study if it had a small number of drives 
as being under-representive and now you criticise a large study for 
having too many observations...

You cannot have "too many" observations when measuring a small and 
unpredictable phenomena...

Where does it say that they could NOT have reproduced this study with 
just 10 drives?  If you have 1.5 million available, why not use all the 
results??


> Ed, this is an academic exercise.  Academia leads industry.  Almost
> always has.  Academia blows the whistle and waves hands, prompting
> industry to take action.

Sigh... We are back to the start of the email thread again... Gosh you 
seem to love arguing and muddying the water for zero reason but to have 
the last word?

It's *trivial* to do a google search and hit *lots* of reports of 
corruptions in various parts of the system, from corrupting drivers, to 
hardware which writes incorrectly, to operating system flaws.  I just 
found a bunch more in the Redhat database today while looking for 
something else.  You yourself are very vocal on avoiding certain brands 
of HD controller which have been rumoured to cause corrupted data... 
(and thankyou for revealing that kind of thing - it's very helpful)

Don't veer off at a tangent now: The *original* email this has spawned 
is about a VERY specific point.  RAID1 appears to offer less protection 
against a class of error conditions than does RAID6.  Nothing more, 
nothing less.  Don't veer off and talk about the minutiae of testing 
studies at universities, this is a straightforward claim that you have 
been jumping around and avoiding answering with claims of needing to 
educate me on SCSI protocols and other fatuous responses. Nor deviate 
and discuss that RAID6 is inappropriate for many situations - we all get 
that...



> There is nothing normal users need to do to address this problem.

...except sit tight and hope they don't loose anything important!

:-)


> Having the prestigious degree that you do, you should already understand
> the relationship between academic research and industry, and the
> considerable lead times involved.

I'm guessing you haven't attended higher education then?  You are 
confusing graduate and post-graduate systems...

Byee

Ed W



More information about the dovecot mailing list