Page 1 of 2 12 LastLast
Results 1 to 10 of 12

Thread: [SOLVED] Message Queue getting stuck

  1. #1
    jefft@iri.columbia.edu is offline Senior Member
    Join Date
    Aug 2007
    Location
    New York
    Posts
    56
    Rep Power
    8

    Default [SOLVED] Message Queue getting stuck

    Over the past 2 days, I have been seeing a problem that the Deferred mail queue starts filling up and no new mail is delivered locally.

    A restart of Zimbra solves the problem, and the queue empties out immediately upon the restart (zmcontrol stop, zmcontrol start)

    In the log files (zimbra.log) I see the following (with names changed to protect the innocent, of course):

    >Jul 9 10:43:43 XXXXXX postfix/qmgr[10896]: 242AD90BE2D: to=<name@mail.host>, relay=none, delay=0.03, delays=0.01/0.02/0/0, dsn=4.4.2, status=deferred (delivery temporarily suspended: conversation with mail.xxx.xxx.xxx[nnn.nnn.nnn.nnn] timed out while receiving the initial server greeting

    Not sure where to look for the problem. Can anyone point me to what I should be looking for? I don't think its a network problem, because as I said, a zimbra restart fixes the problem. And it only started happening over the past two days.

    Has anyone run into this before?

    Thanks for any help,
    Jeffrey Turmelle
    International Research Institute for Climate and Society
    Earth Institute at Columbia University


    Release 7.2.4_GA_2900.RHEL5_64_20130523110956 RHEL5_64 NETWORK edition.

  2. #2
    phoenix is online now Zimbra Consultant & Moderator
    Join Date
    Sep 2005
    Location
    Vannes, France
    Posts
    23,505
    Rep Power
    57

    Default

    Have you done any updates to the system in the past few days? Are there any other errors in the log files around the time you get the error above? Can you also look in the /etc/security/limits.conf file and see if you have the following entries:

    Code:
    zimbra soft nofile 524288
    zimbra hard nofile 524288
    If they're not set to that could you change them and restart your system.
    Regards


    Bill


    Acompli: A new adventure for Co-Founder KevinH.

  3. #3
    jefft@iri.columbia.edu is offline Senior Member
    Join Date
    Aug 2007
    Location
    New York
    Posts
    56
    Rep Power
    8

    Default no other errors that I can see

    The only system change was the recent Zimbra security fix 30754 which I installed early last week

    I checked /var/log/messages, /var/log/maillog, /opt/zimbra/log/mailbox.log in addition to /var/log/zimbra.log and nothing else out of the ordinary except for
    > Jul 9 10:18:12 XXXXXX postfix/qmgr[10896]: warning: connect to transport retry: No such file or directory
    ...
    ...
    > Jul 9 10:34:55 XXXXXX postfix/qmgr[10896]: warning: connect to transport retry: No such file or directory

    Which continuously logs during the email outage (probably on every new incoming email until I restart zimbra), which I guess this means that the qmgr is losing its connection to postfix?, and can't recover, but how would this happen?

    zmcontrol status returns the all/ok messages (as does the GUI)

    The /etc/security/limits.conf file is correct

    Any other log files I might check for clues?

    Thanks again
    Last edited by jefft@iri.columbia.edu; 07-09-2009 at 01:43 PM. Reason: forgot something
    Jeffrey Turmelle
    International Research Institute for Climate and Society
    Earth Institute at Columbia University


    Release 7.2.4_GA_2900.RHEL5_64_20130523110956 RHEL5_64 NETWORK edition.

  4. #4
    phoenix is online now Zimbra Consultant & Moderator
    Join Date
    Sep 2005
    Location
    Vannes, France
    Posts
    23,505
    Rep Power
    57

    Default

    Is your DNS server on the Zimbra server or another machine? My guess is that it's not able to resolve the DNS recods when this problem happens, does the problem recur after a while? Is there any likelihood that HD is getting full on this server?
    Regards


    Bill


    Acompli: A new adventure for Co-Founder KevinH.

  5. #5
    veronica is offline Outstanding Member
    Join Date
    Jun 2008
    Posts
    594
    Rep Power
    8

    Default

    Do you have transports defined ? also master.cf is needed for further analysis.

  6. #6
    jefft@iri.columbia.edu is offline Senior Member
    Join Date
    Aug 2007
    Location
    New York
    Posts
    56
    Rep Power
    8

    Default DNS is not on the same server

    The DNS server is external to our mail server, but we constantly monitor it (our DNS server) for reply times, and haven't seen anything excessive recently.

    Disk space is plentiful.

    I've attached my master.cf, but I think its the default zimbra release.

    We don't have any transports. This server handles all mail, although we do have a front-end spam/virus gateway that delivers mail to Zimbra. But again, we monitor that machine for reply times, and it has been fine.

    Since this is intermittent, I don't think its a 'Zimbra' problem per-se. The problem probably lies in that Zimbra [some process] is not connecting to a service or losing a socket connection after a timeout, but I'm not very good at debugging zimbra and wonder what flags I might turn on to increase log messaging to trace the actual problem.

    Thanks again for your help.
    Attached Files Attached Files
    Jeffrey Turmelle
    International Research Institute for Climate and Society
    Earth Institute at Columbia University


    Release 7.2.4_GA_2900.RHEL5_64_20130523110956 RHEL5_64 NETWORK edition.

  7. #7
    jefft@iri.columbia.edu is offline Senior Member
    Join Date
    Aug 2007
    Location
    New York
    Posts
    56
    Rep Power
    8

    Default

    If it can't resolve the DNS records (even if only momentarily), will that cause it to stay disconnected? I am starting to think that this may be the problem.
    Jeffrey Turmelle
    International Research Institute for Climate and Society
    Earth Institute at Columbia University


    Release 7.2.4_GA_2900.RHEL5_64_20130523110956 RHEL5_64 NETWORK edition.

  8. #8
    phoenix is online now Zimbra Consultant & Moderator
    Join Date
    Sep 2005
    Location
    Vannes, France
    Posts
    23,505
    Rep Power
    57

    Default

    Quote Originally Posted by jefft@iri.columbia.edu View Post
    If it can't resolve the DNS records (even if only momentarily), will that cause it to stay disconnected? I am starting to think that this may be the problem.
    It's quite possible. Is this happening on a regular basis? What happens when this problem occurs, are you able to ssh into the Zimbra server? If you can do that try and see if DNS resolution is still working.
    Regards


    Bill


    Acompli: A new adventure for Co-Founder KevinH.

  9. #9
    jefft@iri.columbia.edu is offline Senior Member
    Join Date
    Aug 2007
    Location
    New York
    Posts
    56
    Rep Power
    8

    Default

    yes, I usually login immediately and the DNS is fine. But I was wondering if a small hiccup of the DNS server might cause this? I believe this is probably a postfix problem, so I am going to head over to those forums to try there too.
    Jeffrey Turmelle
    International Research Institute for Climate and Society
    Earth Institute at Columbia University


    Release 7.2.4_GA_2900.RHEL5_64_20130523110956 RHEL5_64 NETWORK edition.

  10. #10
    veronica is offline Outstanding Member
    Join Date
    Jun 2008
    Posts
    594
    Rep Power
    8

    Default

    >ul 9 10:18:12 XXXXXX postfix/qmgr[10896]: warning: connect to transport retry: No such file or directory
    ...
    ...
    > Jul 9 10:34:55 XXXXXX postfix/qmgr[10896]: warning: connect to transport retry: No such file or directory

    Can you paste the lines above this please ? This is half information of the error

Page 1 of 2 12 LastLast

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. 'Couldn't access Yahoo! Zimbra Desktop server"
    By chirag1 in forum Error Reports
    Replies: 37
    Last Post: 06-12-2011, 05:14 PM
  2. [SOLVED] Zimbra desktop slowed down the system
    By hvle in forum General Questions
    Replies: 5
    Last Post: 03-23-2009, 05:32 PM
  3. [SOLVED] Error running mailboxd after script backup
    By ttortosa in forum Administrators
    Replies: 5
    Last Post: 10-22-2008, 01:33 AM
  4. Emails bouncing with "Error Text: 401,'null'"
    By sholden in forum Zimbra Connector for Outlook
    Replies: 27
    Last Post: 08-20-2008, 04:59 PM
  5. [SOLVED] Mailserver down when send file attach of 50Mb
    By ZMilton in forum Administrators
    Replies: 20
    Last Post: 04-10-2008, 11:44 AM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •