Zimbra offers Open Source email server software and shared calendar for Linux and the Mac
Go Back   Zimbra :: Forums > Zimbra Collaboration Suite > Administrators

Welcome to the Zimbra :: Forums!
Welcome, if you would like to post a comment please register. We also encourage you to explore all things Zimbra with our team and members of the community.

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1 (permalink)  
Old 03-06-2009, 10:55 AM
Intermediate Member
 
Posts: 17
Default High I/O wait times leads to a slow system

We recently went through a major upgrade in our email system (about 3500 active users) where we changed quite a few aspects of the system, all presumably helpful changes that would make the system work better and/or give us greater flexibility to make changes in the future. Now, we are experiencing significant I/O problems, especially when a large volume of messages get delivered to the server all at once, like our daily campus announcements. Briefly, here are the changes that we made:

ZCS 5.0.9 --> ZCS 5.0.12
RHEL5 32-bit --> RHEL5 64-bit
2TB SATA DAS, RAID 1-0 via fibre channel --> 5TB SATA SAN, RAID 5-0 via iSCSI
OS on a dedicated server --> OS installed on VMWare
Single server running ZCS --> One server running MTA, other server running the rest (both virtual servers)

There are three things that I think may be causing this problem, but I'm not sure which is the real culprit: iSCSI, the RAID 5-0, or VMWare. My first guess is the RAID 5-0, but our hope was that going from 4 disks in a RAID0 mirrored to another 4 disks and moving to 14 active disks in a RAID 5-0 would provide an increase in I/O speed since it was writing to many more disks at once and was managed by a SAN which should be able to run a RAID 5-0 quickly and efficiently. Before we wait for a maintenance time to implement a change that may not even work, I'm wondering if anyone has any suggestions as to what we might do to optimize our setup and make this situation better. If more detailed information on any piece of our setup would help, please let me know and I will post it. Thanks in advance for any suggestions!
Reply With Quote
  #2 (permalink)  
Old 03-06-2009, 11:09 AM
Zimbra Consultant & Moderator
 
Posts: 20,312
Default

I believe that parity based raid level will impact your performance and a RAID10 would be more beneficial. Have a look at this wiki article and check the settings for the file system and recommendations for RAID level.
__________________
Regards


Bill
Reply With Quote
  #3 (permalink)  
Old 03-06-2009, 08:55 PM
Moderator
 
Posts: 1,554
Default

have you run like iometer on the vm to see that you're actually getting the performance from the san that you're expecting? we encountered an issue when implimenting vmware esx with iscsi where vms would have horrible io performance unless the vmdk was zero'd out first it had to do with writing to clean sectors.
Reply With Quote
  #4 (permalink)  
Old 03-07-2009, 12:14 AM
Moderator
 
Posts: 7,928
Default

And how much memory is allocated to the VM ?
__________________
Reply With Quote
  #5 (permalink)  
Old 03-09-2009, 09:20 AM
Intermediate Member
 
Posts: 17
Default

phoenix -
I was afraid the RAID level was what was doing it. We're looking towards a second box for our SAN with SAS drives and putting it in a RAID 10.

bdial -
We are working on getting IOMeter up and running so we can get some good data on I/O speeds.

uxbod -
We have 16GB of memory allocated for the VM where Zimbra lives.

Thanks to all three of you for your responses. We're hoping to get these issues all worked out by this Sunday for our maintenance time.
Reply With Quote
  #6 (permalink)  
Old 03-09-2009, 09:52 AM
Former Zimbran
 
Posts: 5,606
Default

Install sar (if not already installed)

and run it. It runs a cron job every 10 minutes, and should give you an idea what's causing it.

We don't recommend RAID5, but many people run it without issue.
Reply With Quote
  #7 (permalink)  
Old 03-09-2009, 09:56 AM
Moderator
 
Posts: 7,928
Default

Is your iSCSI installation using a separate switch from everything else ?
__________________
Reply With Quote
  #8 (permalink)  
Old 03-09-2009, 01:23 PM
Intermediate Member
 
Posts: 17
Default

sar seems to be giving us the I/O data that we need. I'll wait a full day to get a real feel for how the disks behave under different loads at different times of day.

The iSCSI installation is using a dedicated switch independent of our main network.
Reply With Quote
  #9 (permalink)  
Old 03-09-2009, 01:50 PM
Former Zimbran
 
Posts: 5,606
Default

Quote:
Originally Posted by cpalsgrove View Post
sar seems to be giving us the I/O data that we need. I'll wait a full day to get a real feel for how the disks behave under different loads at different times of day.

The iSCSI installation is using a dedicated switch independent of our main network.
Use sar -B for more detailed IO info
Reply With Quote
  #10 (permalink)  
Old 03-26-2009, 01:31 PM
Intermediate Member
 
Posts: 17
Default

Two weeks ago we switched our storage over from the previously mentioned RAID-50 configuration to a RAID-10 configuration and we're still experiencing incredibly high I/O wait times which are leading to great frustration from our users. I'm really not sure what else might be causing this issue. Are there any other parts of our configuration that I should check or anything else that I've previously mentioned that might be causing such a lag in the system?
Reply With Quote
Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes


Similar Threads

Why Join?

Registering let's you ask questions, makes it easier to search, displays any files attached to posts, and notifies you about replies.

blog.zimbra.com




 

SEO by vBSEO ©2011, Crawlability, Inc.