20 x '400' errors in site but URLs work fine in browser...
-
Hi, I have a new client set-up in SEOmoz and the crawl completed this morning... I am picking up 20 x '400' errors, but the pages listed in the crawl report load fine... any ideas?
example -
-
Most major robots obey crawl delays. You could check your errors in Google Webmaster Tools to see if your site is serving a lot of error pages when Google crawls.
I suspect Google is pretty smart about slowing down its crawl rate when it encounters too many errors, so it's probably safe to not include a crawl delay for Google.
-
Sorry, one last question.
Do I need to add a similar delay for Google Bots, or is this issue specifically a Roger Bot problem?
Thanks
-
Fantastic, thanks, Cyrus and Tampa, prevented many more hours of scratching head!!!
-
Hi Justin,
Sometimes when rogerbot crawls a site, the servers and/or the content management system can get overwhelmed if roger is going to fast, and this causes your site to deliver error pages as roger crawls.
If the problem persists, you might consider installing a crawl delay for roger in your robots.txt file. It would look something like this:
User-agent: rogerbot
Crawl-delay: 5This would cause the SEOmoz crawlers to wait 5 seconds before fetching each page. Then, if the problem still persists, feel free to contact the help team at help@seomoz.org
Hope this helps! Best of luck with your SEO!
-
Thanks Tampa SEO, good advice.
Interestingly, the URL listed in SEOmoz is as follows:
www.morethansport.co.uk/brand/adidas?sortDirection=ascending&sortField=Price&category=sport and leisure
But when I look at the link in the referring page it is as follows:
/brand/adidas?sortDirection=ascending&sortField=Price&category=sport%20and%20leisure
notice the "%" symbol instead of the spaces.
The actual URL is the one listed in SEOmoz but even if I copy and paste the % version, the browser removed the '%' and the page loads fine.
I still can't get the site to throw-up a 400.
-
Just ran the example link that you provided through two independent HTTP response code checkers, and both are giving me a 200 response, i.e. the site is OK.
This question has been asked before on here; you're definitely not the first person to run into the issue.
One way to diagnose what's going on is to dig a little deeper into the crawling report that SEOmoz generated. Download the CSV file and look at the referring link, i.e. on which page Roger found the link. Then go to that page and look if your CMS is doing anything weird with the way it outputs the links that you create. I recall someone back in December having the same issue and eventually resolved it by noticing that his CMS put all sort of weird slashes (i.e. /.../...) into the link.
Good luck!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Any scripts for automated interlinking of sites?
I have heard about similar plugins for Wordpress, but I need something like this to run on all kind of sites, no matter the CSM. Are there universal scripts capable of doing automatic interlinking of pages to rise their weight for SEO purposes? Could you share links to such scripts/sites?
On-Page Optimization | | poiseo0 -
SEO For Replacement Site
I have a client with a website that has gotten a bit outdated. We've already built his new website and optimized it, but I'm trying to figure out the best way to replace the site while doing the least amount of damage to his current Google rankings. He's ranking #1 for some very competitive keywords that are responsible for the bulk of his revenue, so we want to jeopardize that. We've already built a new site and written all new content, although the homepage page title, h1 header and meta descriptions will all remain what they currently are. I'm also trying to keep the keyword density as close to the current site as possible. I am aware of transferring all existing site URLS using 301 redirects. Can anyone provide any tips that I should use when replacing the site? Should I expect a slight rankings drop or am I worrying about nothing?
On-Page Optimization | | atstickel0 -
Multilingual site with untranslated content
We are developing a site that will have several languages. There will be several thousand pages, the default language will be English. Several sections of the site will not be translated at first, so the main content will be in English but navigation/boilerplate will be translated. We have hreflang alternate tags set up for each individual page pointing to each of the other languages, eg in the English version we have: etc In the spanish version, we would point to the french version and the english version etc. My question is, is this sufficient to avoid a duplicate content penalty for google for the untranslated pages? I am aware that from a user perspective, having untranslated content is bad, but in this case it is unavoidable at first.
On-Page Optimization | | jorgeapartime0 -
Keyword repeats/presence in url's & over-optimisation
Hi I'm about to launch a redesigned site and worried about overdoing kw presence on-page, primarily using in url's since will already be using kw in titles as well as page content. What's current thinking re over optimisation: If kw is in titles and page content is it best not to repeat again in url structure i.e. less is more, even though this will cause things like SeoMoz on-page grade score to fall, or better to keep them/add them ? Personally i think it makes sense to include kw in url again since helps make the page relevant, and so long as matches the content should help as opposed to hinder rankings for the pages target keyword. However when i look into this some say don't do this since is over-optimisation The sites generally ranking quite well for its target kw which i obviously don't want to lose after re-launch & hopefully improve further, in the case of this example they are 'Sports Centre Services' & 'Sports Centre Equipment Rental'). The sites current url structure is similar to this below example: frankssportscentres.com/services/sports-centre-equipment-rental Would it be better to keep following existing/above format or to go with either of the below options i.e. more kw rich urls or less: frankssportscentres.com/sports-centre-services/sports-centre-equipment-rental Or frankssportscentres.com/sports-centre-services/equipment-rental Or even less frankssportscentres.com/services/equipment-rental Many Thanks in advance for any helpful comments Cheers Dan
On-Page Optimization | | Dan-Lawrence0 -
Wordpress category links not working
Hi All of sudden, my category links are not working. Any tips on figuring out what's causing this? Looks like permalink problem with newer wordpress version. I turned off all the plugins see if this cause any problems. Still not being able to find any option. Here's my site http://www.hibebefetaldoppler.com/fetal-doppler-questions-and-answers/ Thanks in advance
On-Page Optimization | | BistosAmerica0 -
Trouble with Old Site Name
Trying to figure out what is causing a site to show up under a former name in Google. The name of the client is Fortenberry Legal. They changed from Fortenberry Law Group over a year ago. I can't find any code on the site that uses the old name. For some reason, it still shows up as "Fortenberry Law Group" in Google. When I search for "Fortenberry Law Group," that shows up in Google with a full set of site links. When I search under the new name (Fortenberry Legal), that also shows up in Google but without the site links. Any thought on what could be causing this?
On-Page Optimization | | Falconberg0 -
Page URL Hiearchy
So I have read on here that page URL Hiearchy is important. My question is from a search engine standpoint which of the following methods would be the best to use (or another if not listed) COMPACT and naturally hierarchical MountainBiking.com MountainBiking.com/adventures ( a list of the pages below ) MountainBiking.com/adventures/in whistler (for each page) MountainBiking.com/adventures/in utah OR VERBOSE but reptetive MountainBiking.com MountainBiking.com/Mountain Biking adventures ( intro + a list of the pages below ) MountainBiking.com/Mountain Biking Adventures/Mounting Biking adventures in whistler MountainBiking.com/Mountain Biking Adventures/Mountain Biking Adventures in Utah It seemed like the blog I read suggested the compact form, but it seems to me that the verbose (though admittedly a bit clunky) seems better so far as exact keyword match etc. Experience and or advice on this?
On-Page Optimization | | bThere0