Magento - Google Webmaster Crawl Errors
-
Hi guys,
Started my free trial - very impressed - just thought I'd ask a question or two while I can.
I've set up the website for http://www.worldofbooks.com (large bookseller in the UK), using Magento.
I'm getting a huge amount of not found crawl errors (27,808), I think this is due to URL rewrites, all the errors are in this format (non search friendly): http://www.worldofbooks.com/search_inventory.php?search_text=&category=&tag=Ure&gift_code=&dd_sort_by=price_desc&dd_records_per_page=40&dd_page_number=1
As oppose to this format: http://www.worldofbooks.com/arts-books/history-of-art-design-styles/the-art-book-by-phaidon.html (the re-written URL).
This doesn't seem to really be affecting our rankings, we targeted 'cheap books' and 'bargain books' heavily - we're up to 2nd for Cheap Books and 3rd for Bargain Books.
So my question is - are these large amount of Crawl errors cause for concern or is it something that will work itself out? And secondly - if it is cause for concern will it be affecting our rankings negatively in any way and what could we do to resolve this issue?
Any points in the right direction much appreciated.
If you need any more clarification regarding any points I've raised just let me know.
Benjamin Edwards
-
I've added a picture of my crawl errors in SEOMoz
-
Keep in mind that it sounds like you're bloating your index. If anything about your URL is different (even one character) then spiders see that as a new page entirely. Even if you used canonical tags to reduce your index down to just single pages, you're still going to have bots spider all your pages to realize that you don't need that many indexed.
Did you look at the specific crawl errors in seomoz or are these in Google Webmaster?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Search Console "Text too small to read" Errors
What are the guidelines / best practices for clearing these errors? Google has some pretty vague documentation on how to handle this sort of error. User behavior metrics in GA are pretty much in line with desktop usage and don't show anything concerning Any input is appreciated! Thanks m3F3uOI
Technical SEO | | Digital_Reach2 -
Fetch as Google issues
HI all, Recently, well a couple of months back, I finally got around to switching our sites over to HTTPS://. In terms of rankings etc all looks fine and we have not move about much, only the usual fluctuations of a place or two on a daily basis in a competitive niche. All links have been updated, redirects in place, the usual https domain migration stuff. I am however, troubled by one thing! I cannot for love nor money get Google to fetch my site in GSC. No matter what I have tried it continues to display "Temporarily unreachable". I have checked the robots.txt and it is on a new https:// profile in GSC. Has anyone got a clue as I am stumped! Have I simply become blinded by looking too much??? Site in Q. caravanguard co uk. Cheers and looking forward to your comments.... Tim
Technical SEO | | TimHolmes0 -
Homepage is deindexed in Google
Happened sometime on the 12th or 13th of Feb (is there a way to tell exactly besides referring to GA?).
Technical SEO | | Shinosky
I've been on the Google Webmasters Tools forums trying to nail this down - https://productforums.google.com/forum/?utm_medium=email&utm_source=footer#!msg/webmasters/OgpmNCc3IFA/mmtgUilyXUUJ I can only think that Google is viewing this as duplicate content from an internal page for example: http://mudlifeled.com/shop Very frustrating because we were moving up on the first page for some good brand key words and traffic was climbing. Now I've got my hands up and am at a loss to what I can do.0 -
Google Webmaster Structured Data Error
In google webmaster tool in Structured data it is showing me 396 items with errors i.e. Data Type - Product, Source - Markup:schema.org, Pages -351, Items -351, Items with Errors - 351 When i click on the 351 in that it is showing Missing:Price but when i click on that product i can see the price 2) Data Type - searchresultspage, Source - Markup:schema.org, Pages- 47, Items - 47 Items with errors -45 When i click on the 47 in that it is showing Missing:Price but when i click on that product i can see the price So i am not getting what is the actual error?
Technical SEO | | jackinmathis10 -
CDN Being Crawled and Indexed by Google
I'm doing a SEO site audit, and I've discovered that the site uses a Content Delivery Network (CDN) that's being crawled and indexed by Google. There are two sub-domains from the CDN that are being crawled and indexed. A small number of organic search visitors have come through these two sub domains. So the CDN based content is out-ranking the root domain, in a small number of cases. It's a huge duplicate content issue (tens of thousands of URLs being crawled) - what's the best way to prevent the crawling and indexing of a CDN like this? Exclude via robots.txt? Additionally, the use of relative canonical tags (instead of absolute) appear to be contributing to this problem as well. As I understand it, these canonical tags are telling the SEs that each sub domain is the "home" of the content/URL. Thanks! Scott
Technical SEO | | Scott-Thomas0 -
Does Google pass link juice a page receives if the URL parameter specifies content and has the Crawl setting in Webmaster Tools set to NO?
The page in question receives a lot of quality traffic but is only relevant to a small percent of my users. I want to keep the link juice received from this page but I do not want it to appear in the SERPs.
Technical SEO | | surveygizmo0 -
Will Google Continue to Index the Page with NoIndex Tag Upon Google +1 Button Impression or Click?
The FAQs for Google +1 button suggests as follows: "+1 is a public action, so you should add the button only to public, crawlable pages on your site. Once you add the button, Google may crawl or recrawl the page, and store the page title and other content, in response to a +1 button impression or click." If my page has NoIndex tag, while at the same time inserted with Google +1 button on the page, will Google recognise the NoIndex Tag on the page (and will not index the page) despite the +1 button's impression or clicks send signals to Google spiders?
Technical SEO | | globalsources.com0