Spammy 404s: Should I Worry?
-
One of my sites is getting a ton of spammy 404s with porno-like URLs. All of these 404s are linked from other sites that I assume also got hacked, and when I click on them, they are also 404s.
So I'm assuming some spam site is tricking the Googlebot into thinking these URLs exist. But is this going to affect my site & SEO directly?
Is it worth disavowing all of the sites linking to me? Is Google even considering these real links? Did these pages ever actually exist anywhere?
Don't have a hacker-brain whatsoever so I need some enlightening.
I've been told I shouldn't worry but it seems like something I should worry about...Any help is greatly appreciated
(I've updated to the newest Wordpress and Sucuri).
-
The pages definitely don't exist anywhere.
Does this mean I have nothing to worry about?
-
There is a link spam technique out there that is used to hide actual links from the site owners. So, if you are logged into your WordPress site, for example, the links and pages won't appear to be there. But, if you are logged out then the pages will be there, visible to the search engines and the public.
Often those injected spam URLs are hidden using javascript. There's a Chrome plugin called Quick Javascript Switcher that will let you toggle JS on and off. Once it's off, if there are injected URLs on your site, you should be able to see them.
-
The first thing I recommend is to make sure that those are actually 404 errors on your site that the search engines (and regular users) can see. There is a link spam technique out there that is used to hide actual links from the site owners. So, if you are logged into your WordPress site, for example, the links and pages won't appear to be there. But, if you are logged out then the pages will be there, visible to the search engines and the public.
I would look in Google to see if those 404 pages on your site are indexed. Try a site:yourdomain.com search to see if they're indexed. Then, use a crawler to crawl your own website to see if the crawler can find those 404 pages.
Typically, when you see those errors, the site has been hacked and now they've been removed. Or, those pages are on your site but when you go to them they appear to be 404s. I recommend you investigate this further to make sure that the pages or the errors do not exist.
-
As to should you worry, we need more info. Of all the links you show in a tool like ahrefs or Majestic, what percentage are these links?
Can you pm me a sample of one or two of them? I will be happy to tell you what I think once I am clear on what they are. We also do a ton with WP so could probably give you some direction there. I am only saying PM so that you can disclose if you don't want to disclose in public. I am not going to in any way try to sell you on our services and if you wanted service I would refer you as I don't like people hawking through Moz Q&A.
Best -
Hi there
Has this been an ongoing issue and you are seeing more and more 404 links coming in? If so, Google has ways of notifying them on potentially spammy / hacked websites, so you could start there.
If it's something where these links are taking up a good portion of your backlink profile, I would do a quick audit and possibly disavow. This may take a bit of work, so if you're not comfortable, Moz has a great recommended companies list of agencies / consultants that will be more than happy to help.
Let me know if this helps or if you have any more questions! Good luck!
Patrick
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
False Soft 404s, Shadow Bans, and Old User Generated Content
What are the best ways to keep old user generated content (UGC) pages from being falsely flagged by Google as soft 404s? I have tried HTML site maps to make sure no page is an orphaned but that has not solved the problem. Could crawled currently not indexed by explained by a shadow ban from Google? I have had problems with Google removing pages from SERPs without telling me about it. It looks like a lot of content is not ranking due to its age. How can one go about refreshing UGC without changing the work of the user?
Technical SEO | | STDCarriers0 -
Sitemaps, 404s and URL structure
Hi All! I recently acquired a client and noticed in Search Console over 1300 404s, all starting around late October this year. What's strange is that I can access the pages that are 404ing by cutting and pasting the URLs and via inbound links from other sites. I suspect the issue might have something to do with Sitemaps. The site has 5 Sitemaps, generated by the Yoast plugin. 2 Sitemaps seem to be working (pages being indexed), 3 Sitemaps seem to be not working (pages have warnings, errors and nothing shows up as indexed). The pages listed in the 3 broken sitemaps seem to be the same pages giving 404 errors. I'm wondering if auto URL structure might be the culprit here. For example, one sitemap that works is called newsletter-sitemap.xml, all the URLs listed follow the structure: http://example.com/newsletter/post-title Whereas, one sitemap that doesn't work is called culture-event-sitemap.xml. Here the URLs underneath follow the structure http://example.com/post-title. Could it be that these URLs are not being crawled / found because they don't follow the structure http://example.com/culture-event/post-title? If not, any other ideas? Thank you for reading this long post and helping out a relatively new SEO!
Technical SEO | | DanielFeldman0 -
Spammy structured data for http://www.heritageprinting.com/ might be dropped from search results
We received the above message, which I'm see may also have. Before I go making hours of edits can someone give me an opinion on what may need fixed? Here's a link to one of our products: http://heritageprinting.com/products/step-and-repeat.phpAll products are uniquely marked upIt may be the $ dollar sign, but I'm not certain.Looking at WMT > Search Appearance > Structured Data, I see no errors for Schema Markup. TY in advance :)KJr
Technical SEO | | KevnJr0 -
404s effecting crawl rate?
We made a change to our site where we all of a sudden we are creating a large number of 404 pages. Is this effecting the crawl/indexing rate? Currently we've submitted 3.4 million pages, have over 834K indexed but have over and 330K pages not found. Since the large increase in 404s we've noticed a decrease in pages crawled per day. I found this Q & A in Webmasters (http://googlewebmastercentral.blogspot.com/2011/05/do-404s-hurt-my-site.html) but it seems like the 404s should not have an effect. Is this article out of date? What do you think fellow Moz-ers? Is this a problem?
Technical SEO | | JoshKimber0 -
Creating unique SEO content for E-Commerce - worried about it being copied
Hi, So, we know we don't have the best content - so we are hiring writers to create unique content for each product. What happens if this is now copied by another website? What does Google see? Do they recognize us as the original content? Has anyone used DMCA.com ? is it worth it? thanks, Ben
Technical SEO | | bjs20100 -
We have duplicate page titles on the footer menu section of our site. Is this considered spammy?
When our new site was in development stages our digital agency convinced me that we should have duplicate menu links in the footer section of the site. The general justification being that the menu links are key word relevant. I have received opposing opinion from SEO advisers indicating that these duplicate menu links could be considered 'spammy'. I would appreciate some views on this please
Technical SEO | | saints0 -
Disallow: /search/ in robots but soft 404s are still showing in GWT and Google search?
Hi guys, I've already added the following syntax in robots.txt to prevent search engines in crawling dynamic pages produce by my website's search feature: Disallow: /search/. But soft 404s are still showing in Google Webmaster Tools. Do I need to wait(it's been almost a week since I've added the following syntax in my robots.txt)? Thanks, JC
Technical SEO | | esiow20130 -
Can spammy links affect indexing?
Meaning, if you have a lot of bad quality links (directories, blog comments) that are giving great rankings for some terms (on a homepage of a site), could the low quality of these links negatively affect the crawling frequency of interior pages or perhaps even give interior pages a ranking penalty?
Technical SEO | | qlkasdjfw0