Weird 404 Errors in Webmaster Tools
-
Hi,
In a regular check with Webmaster Tools, I have noticed a sudden increase in the number of "not found-404" errors. So I have been looking at them and noticed something weird has been going on.
There are well over 100 pages with 404-errors. The funny thing is, none of the ULR's are correct, For example, if the actual url is something like www.domain.com/latest-reviews , the 404-error points to a non-existent URL like www.domain.com/latest-re And when I checked where they were linked from, they are all from these spammy sites.
Anyone know what could be causing these links, why would anyone link on purpose to a non-existent page?
cheers,
-
I have alike problem: dozen of 404 errors in webmastertools like this:
http://domain.ru/ka...tino-akcia-trexkomnatnaja
http://domain.ru/Sa...e-novosti-za-oktyabr-2012
And there's not linkes to these pages from anywhere. Strange situation, cause i've lot's of pages with urls of different length, but not all of theme comes with error.
-
Thanks. I have actually been adding 301 redirects but didn't want to be spending too much time on it. Some of the links were not even linked. They were just text and Google still treated them as links.
-
Thanks. I've got canonical. So I guess I don't have to do anything.
-
Hi,
When compare you give urls seems someone have posted your shortened urls. As an example on some websites they are shortening the actual url and using as Anchor text.
As an example http://www.seomoz.org/q/wei.. but it correctly has linked to the correct page. But some users with less knowledge, they just copy the Anchor text and post those at blog posts or some other places. Because that anchor text looks like an url.
And also it can be happen because of some other site's activity.
Anyway 404 not found errors will not affect your ranking. So you do not have to worry about this problem. Also suggest you to read this help document about 404 errors.
But I can see some another problem can happen because of this kind of activity. Because if you will get any traffic from a url like that with some suffixed which you have not created. As an example a url like this
www.domain.com/latest-reviews/?refferer=some_reffer
can be have a duplicate content issue. So, I strongly recommend to add rel canonical url in to your page.
Regards
Prasad
-
Google is finding text URLs on sites with limited characters. It's a google crawl problem.
SiteX refers to your article: http://yourdomain.com/blog/austin/steve-rides-to-the-alamo but they hit a charater limit of say 40 characters so they print the URL as "http://yourdomain.com/blog/austin/steve" but link it correctly. Even with a correct link, google will read the text and crawl it the way the text is printed, not linked. Or this happens if it's not linked at all and just a shortened text URL.
To sum it up... Google's got a problem and scrapper sites that chop up URLs are feeding the bots crap. If however the linking domain is a good one and you'd like to take advantage of this little error, then you create a redirect rule on your website for the 404 page.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Remove Directory In Webmaster Tools
Hey Moz'erz, I'm removing some URLS from the index and want to confirm the use of the "remove directory" request. If my structure is this: /blogs/customer-success-stories/tagged/ --- all pages that are /tagged/abc, /tagged/dce etc. will be removed correct? First time trying a directory removal as their are 100 plus of these tagged pages. Comments, suggestions and past experiences welcome!
Technical SEO | | paul-bold0 -
To avoid errors in our Moz crawl, we removed subdomains from our host. (First we tried 301 redirects, also listed as errors.) Now we have backlinks all over the web that are broken. How bad is this, from a pagerank standpoint?
Our MOZ crawl kept telling us we had duplicate page content even though our subdomains were redirected to our main site. (Pages from Wineracks.vigilantinc.com were 301 redirected to vigilantinc.com/wineracks.) Now, to solve that problem, we have removed the wineracks.vigilantinc.com subdomain. The error report is better, but now we have broken backlinks - thousands of them. Is this hurting us worse than the duplicate content problem?
Technical SEO | | KristyFord0 -
Sitemap do they get cleared when its a 404
Hi, Sitemap do they get cleared when its a 404. We have a drupal site and a sitemap that has 60K links and i want to know if in these 4 years we deleted 100's of links and do they have them automatically cleared from Sitemap or we need to build the sitemap again? Thanks
Technical SEO | | mtthompsons0 -
Webmaster tools
Hello, My sites are showing odd "links to your site" data in WMT. Its not showing any links to the homepages and reduced links for other pages. Anyone else seeing this? Penguin refresh maybe?
Technical SEO | | jwdl0 -
Why would SEOMoz and GWT report 404 errors for pages that are not 404ing?
Recently, I've noticed that nearly all of the 404 errors (not soft 404) reported in GWT actually resolve to a legitimate page. This was weird, but I thought it might just be old info, so I would go through the process of checking and "mark as fixed" as necessary. However, I noticed that SEOMoz is picking up on these 404 errors in the diagnostics of the site as well, and now I'm concerned with what the problem could be. Anyone have any insight into this? Rich
Technical SEO | | secretstache0 -
Webmaster Tools 404s
We try to keep our 404s in google webmaster tools to a minimum but in recent months, the volume has simply exploded to over 500k errors. 99.95% of this is complete spam linking to pages that never existed. Have tried marking as resolved but they just end up back in the list eventually and don't like the idea of 301ing so many links when the pages never existed in the first place. We can just ignore them all but this makes it hard to identify legitimate 404s that need redirecting as there is only so much data we can export out of WT. Has anyone had experience with returning 410s? Does google eventually drop these from WT?
Technical SEO | | jandunlop0 -
404 appearing in Sitelinks
I am little embarresed to say that the custom 404 page is appearing in a client's Sitelinks for their brand name search term. I have demoted this page from Sitelinks for the brand term but it still appears 😞 What is the best practise for a custom 404 page appearing in a sitemap and/or being blocked by robots.txt? Thanks
Technical SEO | | mccormackmorrison0 -
Magento - Google Webmaster Crawl Errors
Hi guys, Started my free trial - very impressed - just thought I'd ask a question or two while I can. I've set up the website for http://www.worldofbooks.com (large bookseller in the UK), using Magento. I'm getting a huge amount of not found crawl errors (27,808), I think this is due to URL rewrites, all the errors are in this format (non search friendly): http://www.worldofbooks.com/search_inventory.php?search_text=&category=&tag=Ure&gift_code=&dd_sort_by=price_desc&dd_records_per_page=40&dd_page_number=1 As oppose to this format: http://www.worldofbooks.com/arts-books/history-of-art-design-styles/the-art-book-by-phaidon.html (the re-written URL). This doesn't seem to really be affecting our rankings, we targeted 'cheap books' and 'bargain books' heavily - we're up to 2nd for Cheap Books and 3rd for Bargain Books. So my question is - are these large amount of Crawl errors cause for concern or is it something that will work itself out? And secondly - if it is cause for concern will it be affecting our rankings negatively in any way and what could we do to resolve this issue? Any points in the right direction much appreciated. If you need any more clarification regarding any points I've raised just let me know. Benjamin Edwards
Technical SEO | | Benj250