Crawler triggering Spam Throttle and creating 4xx errors
-
Hey Folks,
We have a client with an experience I want to ask about.
The Moz crawler is showing 4xx errors. These are happening because the crawler is triggering my client's spam throttling. They could increase from 240 to 480 page loads per minute but this could open the door for spam as well.
Any thoughts on how to proceed?
Thanks! Kirk
-
Thank you Dave!
-
Hey Kirk! We built our crawler to obey robots.txt crawl-delay directives. In the future, if this is ever an issue, you can use the crawl delay to slow Rogerbot down to a more reasonable speed. However, we don't recommend adding a crawl delay larger than 10 or Rogerbot might not be able to finish the crawl of your site.
Just add a crawl delay directive to your robots.txt file like this:
User-agent: rogerbot
Crawl-delay: 10Here's a good article that explains more about this technique: https://moz.com/learn/seo/robotstxt. I hope this helps, feel free to reach out if you have any other questions!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
I bought a domain name ez1.in and it has high MOZ spam score. Althought i
I bought a domain name https://ez1.in and it has a high MOZ spam score. Although it is because of the previous webmaster who used this domain as a URL shortening website. But now I am using it as a service delivery website. I wanted to know once a domain is deleted and ready to resell MOZ doesn't update its Spam score? Or it has periodic updates that are scheduled? How does the whole process work?
Moz Bar | | vadir9690 -
Unusually high Spam Score
Hi! My Spam Score is indicating 39% on MOZ and I can't identify that many signals from the 27-signal list that could be actually raised in order for the score to show such a high value like this. My website is Ronaldo7.net, Is there anyone who could help me understanding what signals exactly I should pay more attention and eventually fix them so the spam score reduces drastically? Are all these Spam Score signals weighed equally? (for example, I don't have a LinkedIn profile/link for my site, so should I assume that for not having it my spam score increases 3-4% (27/100)? I don't think that's the case, so I would really appreciate if anyone could point me in the right directions in order to help me reducing the spam score. One final question, is this Spam Score updated on a daily/weekly or monthly basis? Regards, Guilherme
Moz Bar | | guineto0 -
804 : HTTPS (SSL) error encountered when requesting page.
Hi there, I am attempting a crawl test and am getting "804 : HTTPS (SSL) error encountered when requesting page." with no further information to debug the issue. Site seems to be fine as far as SSL configuration goes. How can we get more debugging information? Thanks,
Moz Bar | | RickEH
Rick0 -
Crawl report shows that it gets 4xx errors for pages that work fine. Why?
On the crawl report it has all these "Critical Crawler Issues". They all say "4xx Error", yet when i click on the link from the crawler report, it goes to a perfectly functioning page, not a 404 page or anything. If i click in it actually says it's a 403 error. It's all for pages generated by the IDX solution for our real estate website. Is Moz broken or am i missing something? Here are a couple examples: <dl class="crawl-page-details-list"> <dd class="crawl-page-details-list-emphasis">https://teamvivi.com/homes-for-sale-map-search/</dd> <dd class="crawl-page-details-list-emphasis"> <dl class="crawl-page-details-list"> <dd class="crawl-page-details-list-emphasis">https://teamvivi.com/email-alerts/</dd> </dl> </dd> </dl>
Moz Bar | | TeamViviRealEstate0 -
Http:// https:// google search console crawl errors
How to direct http:// to https:// to get rid of 404 errors in google webmaster search console (http:// crawl errors)
Moz Bar | | O.D.0 -
MOZ crawler 404 errors on wordpress
Hi all, I've got hundreds of issues coming up on the MOZ crawler with 404 errors, I don't know what these URL's are. Here's a couple of examples; http://www.theswagbagco.co.uk/category/watford/http%3A%2F%2Fwww.theswagbagco.co.uk%2F2015%2F10%2F15%2Fnew-products-2%2F
Moz Bar | | vaineh
http://www.theswagbagco.co.uk/2015/10/01/thank-you-epsom/http%3A%2F%2Fwww.theswagbagco.co.uk%2F2015%2F10%2F01%2Fthank-you-epsom%2F See the first one is one page with a different url appended, the second is the same thank-you-epsom url. How would I find out where these are even being linked from?0 -
Rogerbot will not crawl my site! Site URL is https but keep getting and error that homepage (http) can not be accessed. I set up a second campaign to alter the target url to the newer https version but still getting the same error! What can I do?
Site URL is https but keep getting and error that homepage (http://www.flogas.co.uk/) can not be accessed. I set up a second campaign to alter the target url to the newer https://www.flogas.co.uk/ version but still getting the same error! What can I do? I want to use Moz for everything rather than continuing to use a separate auditing tool!
Moz Bar | | digitalascend0 -
Spam score 9/17 and redirect Question
I sat on a .com domain, which name become increasing popular (xyzselfie) for 2 years ... 4 months ago I hired a VA to do a task. A miscommunication made this person submit my domain to the spammiest directories the internet has to offer. Also because of the domain name and the .com a lot of asian or weird sites/things posted links to my site. I have worked on my site for the last 4 months trying to lower my spam score from a 9. I have:
Moz Bar | | onlinegusto
-Disavowed all the sites that pointed to my site.
-Made more internal links
-Tried to make my content thicker
-Included my email and social profiles to the site In the process my competitors site with exact domain name but .net and more authority came on auction, I bought it and I pointed it with a permanent redirect to my site (hoping my site would in time lose its spam score). This site will generate and income by appearing in search and adsense ads. After months of work I'm at a loss what to do. Does the spam score generally take long to drop? Should i try and stop the permanent redirect and direct my .com to the .net domain? Are there experts who can lower my score? Should I look for non spammy directories in its niche and submit my site to them to increase link authority and nofollow links ? Any feedback or insight would be highly appreciated. fFtTOFk0