User-agent: * Disallow: /fax/ Disallow: /geo/ Disallow: /invest/ Disallow: /project/ Disallow: /releases/ Disallow: /infoTarget/ Disallow: /email.htm Disallow: /infocus.htm Disallow: /languages.htm Disallow: /mc.htm Disallow: /narrow_tips.htm Disallow: /newman.htm Disallow: /sql.htm Disallow: /timeline.htm Disallow: /wandfeed.htm Disallow: /wandinfo.htm Disallow: /whorwi.htm User-agent: Sosospider Disallow: / User-agent: SemrushBot Disallow: / User-agent: SemrushBot-SA Disallow: / User-agent: SiteBot Disallow: / User-agent: AhrefsBot Disallow: / User-agent: spbot Disallow: / User-agent: sogou spider Disallow: / User-agent: Sogou+web+spider Disallow: / User-agent: Sogou blog Disallow: / User-agent: Sogou inst spider Disallow: / User-agent: Sogou News Spider Disallow: / User-agent: Sogou Orion spider Disallow: / User-agent: Sogou spider2 Disallow: / User-agent: Sogou web spider Disallow: / User-agent: Baiduspider Disallow: / User-agent: Baiduspider-video Disallow: / User-agent: Baiduspider-image Disallow: / Options +FollowSymlinks RewriteEngine On RewriteBase / SetEnvIfNoCase User-Agent "SemrushBot" bad_user SetEnvIfNoCase User-Agent "AhrefsBot" bad_user Deny from env=bad_user # ezooms.com - One of the absolute must to block in every way you can from spying on you !!! # IP 208.115.113.82 Ezooms.com Mozilla/5.0 (compatible; Ezooms/1.0; ezooms.bot@gmail.com) # Mozilla/5.0 (compatible; Ezooms/1.0; ezooms.bot@gmail.com) # 208.115.111.66 208.115.111.67 208.115.111.68 208.115.111.70 208.115.111.71 208.115.111.74 208.115.111.75 # IP-range: 208.115.96.0 - 208.115.127.255 (they don't give out bot name!). The CIDR is 208.115.111.64/28 # wowrack dot com says that ezooms.com IP belongs to one of their clients; dotnetdotcom.org and that their main purpose for this machine is to crawl/index the content just like google bot. # The spider from ezooms.com visits robots.txt frequently but ignore the rules written in robots.txt. # Therefore the only way to stop this secret spider is to block the IP-range. # One of the theories is that the spider belongs to http://www.seomoz.org/ (anagram for ezooms) who tries to hide their bot in this way. # The email they give out is fake, just as their web site obviously is !!! # Ezooms is a parasite and they are definitely up to no good !!! User-agent: ezooms Disallow: / # sistrix (IP 5.9.112.64 - 5.9.112.95) User-agent: sistrix Disallow: / # Yandex bot - A rule breaker, just as Baidu spiders User-agent: Yandex Disallow: / # proximic.com/info/spider.php User-agent: proximic Disallow: /php/