Hello, all. After weeks of troubleshooting, we have discovered a conflict between the zimbra mysql/innodb process and the vserver hashify process. We are running zimbra on CentOS 5.3 on VServer with reasonable success other than this major issue.
Whenever we run vserver <zimbra server name> hashify, we see odd
behaviors:
* The mysql and zimbra database directories are time stamped with
the time hashify ran as if mysql was restarted but there is no
record of mysql being restarted.
* MOST IMPORTANTLY, the innodb log stops recording but it appears
that mysql is still functioning. Mail is displayed, events
(e.g., calendar, tasks) are recorded and are fully functional
The disaster occurs when mysql is restarted, e.g., when restarting the
zimbra service. Since the innodb log has recorded no data since hashify
ran, mysql thinks the system has crashed and backs out all data since
the last hashify! At least, I believe that's what is happening in my
MySQL ignorance.
As some background on hashify, the hashify process is not an essential process but it is the single VServer feature which tipped us to choose VServer over OpenVZ and is a life saver in the managed service provider market.
There may be hundreds of vservers on a host. This density is a result
of the excellent resource utilization of VServer. Since many if not most of the systems may duplicate file, e.g., the majority of the distribution binaries, hashify is used to eliminate the redundancy both on disk and in RAM.
The vserver files are hashed. Any files with the same hash are replaced
with immutable but unlinkable hard links except for one base copy. As a result, one hundred instances of the file will take little more space than a single instance on both disk and in RAM. Explanations can be found here
util-vserver:Vhashify - Linux-VServer and here
Frequently Asked Questions - Linux-VServer
Any ideas on why this scanning/linking process would cause this behavior in innodb? Any suggestions on where to start troubleshooting? Thanks - John