Bogus Crawl Errors in Webmaster Tools?
-
I am suddenly seeing a ton of crawl errors in webmaster tools.
Almost all of them are URL links coming from scraper sites.that I do not own.
Do you see these in your Webmaster Tools account?
Do you mark them as "fixed" if they are on a scraper site? There are waaaay too many of these to make redirects.
Thanks!
-
Thanks, Marcus,
My numbers are rising rapidly right now... but hopefully the trend will reverse.
I'll let you know if I learn anything.
-
Hey, I know, it's kind of bonkers but I certainly think that assuming Google does not know what they are doing is a good place to start.
For us they just cleared up in time, obviously, this is webmaster tools so it was a good old bit of time (months rather than weeks) but it did sort itself out.
Take care!
Marcus -
Hello Marcus,
Thank you for sharing your experience and finding those posts. I appreciate it.
I think I am going to ignore these and assume that Google doesn't know what they are doing.
It surprises me that the URL errors on spammer sites are being presented to me as something that should be fixed.
Thanks again!
-
Hey EGOL
I have seen this in the past on my own site and on a few client sites in the past (which is not to say I have an answer here).
We were seeing completely random looking URLs that at first made me think the site had been somehow hacked or compromised but further investigation revealed that was not the case. We were just getting the strangest of links to pages that did not exist like xhyx.php?id=jamesbrown (that kind of thing).
We did nothing here and over time it seems to have resolved itself and these pages are not listed any longer. I tend to think of the webmaster tools data as diagnostics and it is telling me these pages don't exist so I can check for problems. Well, there is no problem, they don't exist and I am happy about that. Still, whether to mark them as fixed or not, I am unsure and would err towards not doing anything with them as they are not 'errors' as far as I am concerned. Likewise, I don't want to redirect them in most cases as I don't like the linking sites and have better things to do with my working day (I am not getting that time back - it's the digital equivalent or ironing clothes or some such laborious grind).
I had a look around again and whilst I can't find any specific answers regarding whether to mark them as fixed the following posts are of interest:
- http://productforums.google.com/forum/#!topic/webmasters/3GTOLCE-8pk
- https://productforums.google.com/forum/?hl=en#!category-topic/webmasters/webmaster-tools/rKI-38ohfbc
Particularly this quote from John Mueller at Google (webmaster tools guy I believe):
"In general, if a URL is really a 404, that's fine for us, and not something that would cause your site any problems in the long run. At any rate, you don't need to "fix" this problem (eg with a 301 redirect), if you're sure that the URL should really not exist. Having 404s listed in Webmaster Tools will generally not affect your site's crawling, indexing, or ranking; it's normal for websites to return 404 for URLs that don't exist."
So, my take is not to bother but would be interesting to ask the question in webmaster tools section of the Google product forums: https://productforums.google.com/forum/?hl=en#!categories/webmasters/webmaster-tools
Not an answer as such but hope that helps.
Marcus
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What is a good crawl budget?
Hi Community! I am in the process of updating sitemaps and am trying to obtain a standard for what is considered "strong" crawl budget? Every documentation I've found includes how to make it better or what to watch out for. However, I'm looking for an amount to obtain for (ex: 60% of the sitemap has been crawled, 100%, etc.)
Technical SEO | | yaelslater1 -
Internal Links issue in webmaster
we implemented our website on the basis of WordPress, then we migrate our website to PHP (YII Framework). after a while, we found out an issue around internal links which they were increasing severely. when we check our landing pages in webmaster (for example Price list), in contains 300 internal links but the reality is that there are no href tags on this page. it seems that webmaster calculate most of our links with the links of a single page and show them to us. is it natural or a mis configuration has been happened? Yh1NzPl
Technical SEO | | jacelyn_wiren0 -
Is there a tool to see all redirects?
I'm thinking this is a silly question, but I've never had to deal with it I thought I'd ask. Ok is there a tool out there that will show all the redirects to a domain. I'm working on a project that I keep stumbling on urls that redirect to the site I'm studying. They don't show up in Open Site or ahrefs as linking domains, but they keep popping up on me. Any thoughts?
Technical SEO | | BCutrer0 -
Webmaster tools crawl stats
Hi I have a clients site that was having aprox 30 - 50 pages crawled regularly since site launch up until end of Jan. On the 21st Jan the crawled pages dropped significantly from this average to about 11 - 20 pages per day. This also coincided with a massive rankings drop on the 22nd which i thought was something to do with panda although it later turned out the hosts had changed the DNS and exactly a week after fixing it the rankings returned so i think that was the cause not panda. However i note that the crawl rate still hasn't returned to what it was/previous average and is still following the new average of 10-20 pages per day rather than the 30-50 pages per day. Does anyone have any ideas why this is ? I have since added a site map but hasnt increased crawl rate since A bit of further info if it helps in any way is that In the indexed status section says 48 pages ever crawled with 37 pages indexed. There are 48 pages on the site. The site map section says 37 submitted with 35 indexed. I would have thought that since dynamic site map would submit all urls Any clarity re the above much appreciated ? Cheers Dan
Technical SEO | | Dan-Lawrence0 -
Are 404 Errors a bad thing?
Good Morning... I am trying to clean up my e-commerce site and i created a lot of new categories for my parts... I've made the old category pages (which have had their content removed) "hidden" to anyone who visits the site and starts browsing. The only way you could get to those "hidden" pages is either by knowing the URLS that I used to use or if for some reason one of them is spidering in Google. Since I'm trying to clean up the site and get rid of any duplicate content issues, would i be better served by adding those "hidden" pages that don't have much or any content to the Robots.txt file or should i just De-activate them so now even if you type the old URL you will get a 404 page... In this case, are 404 pages bad? You're typically not going to find those pages in the SERPS so the only way you'd land on these 404 pages is to know the old url i was using that has been disabled. Please let me know if you guys think i should be 404'ing them or adding them to Robots.txt Thanks
Technical SEO | | Prime850 -
4XX (Client Error)
How much will 5 of these errors hurt my search engine ranking for the site itself (ie: the domain) if these 5 pages have this error.
Technical SEO | | bobbabuoy0 -
150 Duplicate page error
I am told that I have 150 duplicate page content. It seems that it is the login link on each of my pages. Is this an error? Is it something I have to change? Thanks Login/Register at http://irishdancingdress.com/wp-login.php?redirect_to=http%3A%2F%2Firishdancingdress.com%2Fdress
Technical SEO | | ukkpower0 -
RSS Feed Errors in Google
We recently (2 months ago) launched RSS feeds for the category pages on our site. Last week we started seeing error pages in Webmaster Tools' Crawl Errors report pop up for feeds of old pages that have been deleted from the site, deleted from the sitemap, and not in Google's index since long before we launched the RSS feeds. Example: www.mysite.com/super-old-page/feed/ I checked and both the URL for the feed and the URL for the actual page are returning 404 statuses. www.mysite.com/super-old-page/ is also showing up in our Crawl Errors. Its been deleted for months but Webmaster Tools is very slow to remove the page from their Crawl Error report. Where is Google finding these feeds that never existed?
Technical SEO | | Hakkasan0