Zimbra offers Open Source email server software and shared calendar for Linux and the Mac
Go Back   Zimbra :: Forums > Zimbra Collaboration Suite > Administrators

Welcome to the Zimbra :: Forums!
Welcome, if you would like to post a comment please register. We also encourage you to explore all things Zimbra with our team and members of the community.

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1 (permalink)  
Old 06-08-2011, 11:33 AM
fyd fyd is offline
Elite Member
 
Posts: 373
Default [SOLVED] prevent google from indexing server

Has anyone done the 'robots.txt' way of preventing google from indexing links and pages on a ZCS server? I am trying to figure out a way to prevent this. The closest i could get on this topic is this thread post [SOLVED] robots.txt?
Reply With Quote
  #2 (permalink)  
Old 06-10-2011, 08:41 AM
Senior Member
 
Posts: 60
Default

You need to set the domain attribute 'zimbraMailKeepOutWebCrawlers' to TRUE, i.e.
zmprov ms zimbra.example.com zimbraMailKeepOutWebCrawlers TRUE
Anything listed in
/opt/zimbra/conf/robots.txt
will be appended to the robots.txt file that crawlers see.
__________________
Christopher Lindsey, Technical Program Manager
National Center for Supercomputing Applications
Reply With Quote
  #3 (permalink)  
Old 06-11-2011, 01:59 AM
fyd fyd is offline
Elite Member
 
Posts: 373
Default

Quote:
Originally Posted by lindsey View Post
You need to set the domain attribute 'zimbraMailKeepOutWebCrawlers' to TRUE, i.e.
zmprov ms zimbra.example.com zimbraMailKeepOutWebCrawlers TRUE
Anything listed in
/opt/zimbra/conf/robots.txt
will be appended to the robots.txt file that crawlers see.
Thanks lindsey for the reply , but I read this attribute is from 7.0.1 onwards so I guess I am not lucky with that. ZCS 7.0.1 Is Live!

Any other way I can get this done? Rich Graves says in the first thread that,

"You can add <META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW"> to the <head> section of /opt/zimbra/jetty/webapps/zimbra/public/login.jsp; zmmailboxd stop; flush the /opt/zimbra/jetty/work/zimbra/jsp/org/apache/jsp/public_/ directory; zmmailboxd start"

Will this help to prevent ANY file from getting indexed by Google?
Reply With Quote
  #4 (permalink)  
Old 06-13-2011, 05:49 AM
Senior Member
 
Posts: 60
Default

Dang. It's never that easy, is it?

Unfortunately, that change that you list will only affect the root file. If a spider hits a page at anything but the root level they won't see the META tag and will happily index away.
__________________
Christopher Lindsey, Technical Program Manager
National Center for Supercomputing Applications
Reply With Quote
  #5 (permalink)  
Old 06-13-2011, 06:44 AM
fyd fyd is offline
Elite Member
 
Posts: 373
Default

Quote:
Originally Posted by lindsey View Post
Dang. It's never that easy, is it?
Doh .. yeah right and these days getting someone to reply here is not easy too .. desperate times!

Quote:
Originally Posted by lindsey View Post
Unfortunately, that change that you list will only affect the root file. If a spider hits a page at anything but the root level they won't see the META tag and will happily index away.
Okay you mean that will do just the login page huh? I need to keep away one user's calendar from showing up in google search.

hey thanks again!
Reply With Quote
  #6 (permalink)  
Old 06-24-2011, 11:56 PM
fyd fyd is offline
Elite Member
 
Posts: 373
Default

Zimbra has this feature by default since 6.0.11. A file 'robots.txt' will be placed in /opt/zimbra/jetty/webapps/zimbra/ directory of mailbox server with this content.

User-agent: *
Disallow: /

I had to do it manually on my 6.0.10. Placed the file /opt/zimbra/jetty/webapps/zimbra/robots.txt with ownership zimbra:zimbra on mailstore servers. Mailbox should be restarted after that with 'zmmailboxdctl restart'.
Reply With Quote
  #7 (permalink)  
Old 06-27-2011, 03:05 AM
fyd fyd is offline
Elite Member
 
Posts: 373
Default

Any thoughts on how long it will take to do its thing? Its been 5 days and I still see the results. Perhaps displaying from cache?
Reply With Quote
  #8 (permalink)  
Old 06-27-2011, 07:12 AM
fyd fyd is offline
Elite Member
 
Posts: 373
Default

Okay, It will be like this till google crawlers do the next lookup. Any details of crawling should be in jetty logs.
Reply With Quote
Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes


Similar Threads

Why Join?

Registering let's you ask questions, makes it easier to search, displays any files attached to posts, and notifies you about replies.

blog.zimbra.com




 

SEO by vBSEO ©2011, Crawlability, Inc.