404s effecting crawl rate?
-
We made a change to our site where we all of a sudden we are creating a large number of 404 pages. Is this effecting the crawl/indexing rate?
Currently we've submitted 3.4 million pages, have over 834K indexed but have over and 330K pages not found. Since the large increase in 404s we've noticed a decrease in pages crawled per day. I found this Q & A in Webmasters (http://googlewebmastercentral.blogspot.com/2011/05/do-404s-hurt-my-site.html) but it seems like the 404s should not have an effect. Is this article out of date?
What do you think fellow Moz-ers? Is this a problem?
-
It's not a problem, just fix those as soon as you can. And yes, it does affect crawl rate from what I've seen.
-
That article you mention is very up to date.
but if you got "hit" by Google bot several times a day for those pages that now you return a 404 response code you will see a decrease in pages crawled per day since once Google sees a 404 response code it will not visit / hit that page that often aftre that...
-
Yes i've seen this numerous of times. Is it just 404's are are there also things like DNS playing along?
But if the amount of 404's jump up really high then for sure google turns down the speed. i guess this gives you some air to fix it in time.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content Showing up on Moz Crawl | www. vs. no-www.
Hello Moz Community! I am new to SEO, Moz and this is my first question. My questions; I have a client that is getting flagged for Duplicate Content. He is getting flagged for having two domains that have the same content i.e. www.mysite.com & mysite.com. I read into this and set up a 301 redirect through my hosting site. I evaluated which site had a stronger Page Authority and had the weaker site redirect to the stronger site. However, I am still getting hit for Duplicate pages caused by the www.mysite.com & mysite.com being duplicates. How should I go about resolving this? Is this an example of a Canonical tag needed in the head of the HTML? Any direction is appreciated. Thank You. B/R Will H.
Technical SEO | | MarketingChimp100 -
Bingbot appears to be crawling a large site extremely frequently?
Hi All! What constitutes a normal crawl rate for daily bingbot server requests for large sites? Are any of you noticing spikes in Bingbot crawl activity? I did find a "mildly" useful thread at Black Hat World containing this quote: "The reason BingBot seems to be terrorizing your site is because of your site's architecture; it has to be misaligned. If you are like most people, you paid no attention to setting up your website to avoid this glitch. In the article referenced by Oxonbeef, the author's issue was that he was engaging in dynamic linking, which pretty much put the BingBot in a constant loop. You may have the same type or similar issue particularly if you set up a WP blog without setting the parameters for noindex from the get go." However, my gut instinct says this isn't it and that it's more likely that someone or something is spoofing bingbot. I'd love to hear what you guys think! Dana
Technical SEO | | danatanseo1 -
Roger bot taking a long time to crawl site
Hi all, I've noticed Roger bot is taking a long time to crawl my new site. It started on the 28th Feb 2013 and is still going. There aren't many pages at the moment. Any ideas please? thanks a lot, Mark.
Technical SEO | | caterfor1 -
Numerous 404 errors on crawl diagnostics (non existent pages)..
As new as them come to SEO so please be gentle.... I have a wordpress site setup for my photography business. Looking at my crawl diagnostics I see several 4xx (client error) alerts. These all show up to non existent pages on my site IE: | http://www.robertswanigan.com/happy-birthday-sara/109,97,105,108,116,111,58,104,116,116,112,58,47,47,109,97,105,108,116,111,58,105,110,102,111,64,114,111,98,101,114,116,115,119,97,110,105,103,97,110,46,99,111,109 | Totally lost on what could be causing this. Thanks in advance for any help!
Technical SEO | | Swanny8110 -
Alternatives to SEOmoz's Crawl Diagnistics
I really like SEOmoz's Crawl diagnostics reports, it goes through the pages and finds all sorts of valuable information, I wanted to know if there are any other services that compete against this specific service, to test the accuracy of their crawl diagnistics. Thanks
Technical SEO | | BestOdds0 -
Keyword Variants - Low conversion rates - Is a site redesign required?
Hi Guys, I'm Chris & i'm a noob to SEO. Thanks for taking the time to read this and for offering your support! It is greatly appreciated! I work for a small family run company called custom designed cables and we manufacture bespoke cable and bespoke retractable cables. I posted a couple of days ago about how to move forward with our site. Our website has been targeting specific keywords and their variants across the relevant pages to somewhat reasonable effect. Since signing up to SEOMOZ, i have been trying to narrow down the keywords that we target per page and also tried to improve the content on the pages with the help of the on page optimisation tool. I have a few questions that i would like somebody to help me with please if it isn't to much trouble. Firstly, for our keywords, there are a lot of variants. For example, we braid cables. Here are some of the variants of this keyword: cable braid, cable braiding, cable screen, cable screening, cable shielding etc etc. The way i have gone in the past has been to try and target all of those keywords on the relevant page. Now, writing content has been the issue as whilst trying to write good informative information, squeezing in all of the keywords has been a problem and more than likely affected the readability of the page. From what i have learnt since i signed up, this is not a good practice and therefore i have tried to narrow it down somewhat yet i don't really want to lose potential customers finding us by only targeting one or two of the keywords. This is a similar situation for most of our keywords on the majority of our pages. What is the best way to approach this? If i was to write a page per keyword, i don't believe that it would look very good as we could end up with over 100 pages with say 5 or 6 of them talking about the same subject which then leads onto the problem of writing good content. It would be difficult to write good content for pretty much the same thing across such a wide number of pages. If this is the solution though then i would more than happily tackle it. What would be the best way to move forward? The next issue that I have is that since i have been modifying the pages with the help of SEOMOZ, the number of enquries that we have received have fell off the charts. We used to average around 50 enquiries a month from the website and since i have modified the site, i'd say we've had probably 20 since the end of February. The funny thing is though, we have also averaged around 200 hits more per month since the changes. So the hits have increased yet the enquiries have died. I was wondering if anybody was willing to take a look at our site: http://www.customdesignedcable.co.uk to give me some general feedback as to why this may be happening and what your overall opinions of the site may be with regards to the layout, look and general feel etc because i am beginning to wonder whether or not the design is what could be causing us to convert so little of our new found visitors. If anyone could also provide me with actionable feedback with regards to our keyword targeting for our pages that would also really really help! I am currently considering re-designing the site in it's entirety and i am interested in your opinions on whether or not this would be a good way to go. Any help that you can give me would be greatly appreciated guys! Thanks again! Chris (Just another website/seo noob desperately attempting to avoid the sack)
Technical SEO | | Chris_CDC0 -
404 crawl errors from "tel:" link?
I am seeing thousands of 404 errors. Each of the urls is like this: abc.com/abc123/tel:1231231234 Everything is normal about that url except the "/tel:1231231234" these urls are bad with the tel: extension, they are good without it. The only place I can find this character string is on each page we have this code which is used for Iphones and such. What are we doing wrong? Code: Phone: <a href="[tel:1231231234](tel:7858411943)"> (123) 123-1234a>
Technical SEO | | EugeneF0 -
Magento - Google Webmaster Crawl Errors
Hi guys, Started my free trial - very impressed - just thought I'd ask a question or two while I can. I've set up the website for http://www.worldofbooks.com (large bookseller in the UK), using Magento. I'm getting a huge amount of not found crawl errors (27,808), I think this is due to URL rewrites, all the errors are in this format (non search friendly): http://www.worldofbooks.com/search_inventory.php?search_text=&category=&tag=Ure&gift_code=&dd_sort_by=price_desc&dd_records_per_page=40&dd_page_number=1 As oppose to this format: http://www.worldofbooks.com/arts-books/history-of-art-design-styles/the-art-book-by-phaidon.html (the re-written URL). This doesn't seem to really be affecting our rankings, we targeted 'cheap books' and 'bargain books' heavily - we're up to 2nd for Cheap Books and 3rd for Bargain Books. So my question is - are these large amount of Crawl errors cause for concern or is it something that will work itself out? And secondly - if it is cause for concern will it be affecting our rankings negatively in any way and what could we do to resolve this issue? Any points in the right direction much appreciated. If you need any more clarification regarding any points I've raised just let me know. Benjamin Edwards
Technical SEO | | Benj250