Re: [Dovecot] Highly Performance and Availability

22 Feb 2010 · *in theory*

      Hi
...
HA, Consolidated Backup, and a couple of other technologies are what really
make this an enterprise solution, providing near 24x7x365 uptime and rapid
redeployment of an infrastructure after catastrophic loss of the datacenter.
Can you tell me exactly what "Consolidated Backup" means with respect to
ESX please?  From the brief description on the website I'm not quite
sure how it varies to say backing up the raw storage using some kind of
snapshot method?
...
GlusterFS isn't designed as a primary storage system for servers or server
clusters.  A good description of it would be "cloud storage".  It is designed to
mask, or make irrelevant, the location of data storage devices and the distance
to them.  Server and datacenter architects need to know the latency
characteristics and bandwidth of storage devices backing the servers.  GlusterFS
is the antithesis of this.
I can't disagree in terms of achieved performance because I haven't
tested, but in terms of theoretical design it is supposed to vary from
how you describe?
Glusterfs has a growing number of translaters and eventually is likely
to have native NFS & Cifs support straight into the cluster. So *in
theory* (difference between theory and practice? In theory nothing, in
practice everything.) you are getting parallel NFS performance as you
add nodes, with the option of also adding redundancy and HA for free...

I get the impression the current implementation deviates somewhat from
theory, but long term that's the goal...
I was giving this some thought - essentially the whole problem comes
down to either some kind of filesharing system which offers up
individual files, or some kind of block level sharing and you have to
then run your own filesystem over the block device.
Now, if latency were zero and fileserver had infinite CPU/bandwidth then
it would seem like the filesharing system wins because it centralises
the locking and all other problems and leaves relatively thin clients
On the flip side since latency/bandwidth very much deviates from perfect
then to me the block level storage initially seems more attractive
because the client can be given "intelligence" about the constraints and
make appropriate choices about fetching blocks, ordering, caching,
flushing, etc.  However, if we assume active/active clusters are
required then we need GFS or similar and we have just added a whole heap
of latency and locking management. This plus the latency of translating
a disk based protocol (scsi/ata) into network packets suddenly makes the
block level option look a lot less attractive...
So the final conclusion seems like it's a hard problem and the "best"
solution is going to come down to an engineering decision - ie where
theory and practice deviate and which one actually gets the job done
fastest in practice?
At least in theory it seems like Gluster should be able to rival the
speed of a high end iSCSI san - whether the practical engineering
problems are ever solved is a different matter...  (Random quote -
http://www.voicesofit.com/blogs/blog1.php/2009/12/29/gluster-the-red-hat-of-...

Gluster claim 131,000 IOPS on some random benchmark using 8 servers
and 18TB of storage...)

Interesting seeing how this stuff is maturing though!  Sounds like the
SAN is still the king for people just want something fast reliable and
off the shelf today...
Ed W