Spammy 404s: Should I Worry?
-
One of my sites is getting a ton of spammy 404s with porno-like URLs. All of these 404s are linked from other sites that I assume also got hacked, and when I click on them, they are also 404s.
So I'm assuming some spam site is tricking the Googlebot into thinking these URLs exist. But is this going to affect my site & SEO directly?
Is it worth disavowing all of the sites linking to me? Is Google even considering these real links? Did these pages ever actually exist anywhere?
Don't have a hacker-brain whatsoever so I need some enlightening.
I've been told I shouldn't worry but it seems like something I should worry about...Any help is greatly appreciated
(I've updated to the newest Wordpress and Sucuri).
-
The pages definitely don't exist anywhere.
Does this mean I have nothing to worry about?
-
There is a link spam technique out there that is used to hide actual links from the site owners. So, if you are logged into your WordPress site, for example, the links and pages won't appear to be there. But, if you are logged out then the pages will be there, visible to the search engines and the public.
Often those injected spam URLs are hidden using javascript. There's a Chrome plugin called Quick Javascript Switcher that will let you toggle JS on and off. Once it's off, if there are injected URLs on your site, you should be able to see them.
-
The first thing I recommend is to make sure that those are actually 404 errors on your site that the search engines (and regular users) can see. There is a link spam technique out there that is used to hide actual links from the site owners. So, if you are logged into your WordPress site, for example, the links and pages won't appear to be there. But, if you are logged out then the pages will be there, visible to the search engines and the public.
I would look in Google to see if those 404 pages on your site are indexed. Try a site:yourdomain.com search to see if they're indexed. Then, use a crawler to crawl your own website to see if the crawler can find those 404 pages.
Typically, when you see those errors, the site has been hacked and now they've been removed. Or, those pages are on your site but when you go to them they appear to be 404s. I recommend you investigate this further to make sure that the pages or the errors do not exist.
-
As to should you worry, we need more info. Of all the links you show in a tool like ahrefs or Majestic, what percentage are these links?
Can you pm me a sample of one or two of them? I will be happy to tell you what I think once I am clear on what they are. We also do a ton with WP so could probably give you some direction there. I am only saying PM so that you can disclose if you don't want to disclose in public. I am not going to in any way try to sell you on our services and if you wanted service I would refer you as I don't like people hawking through Moz Q&A.
Best -
Hi there
Has this been an ongoing issue and you are seeing more and more 404 links coming in? If so, Google has ways of notifying them on potentially spammy / hacked websites, so you could start there.
If it's something where these links are taking up a good portion of your backlink profile, I would do a quick audit and possibly disavow. This may take a bit of work, so if you're not comfortable, Moz has a great recommended companies list of agencies / consultants that will be more than happy to help.
Let me know if this helps or if you have any more questions! Good luck!
Patrick
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Huge number of crawl anomalies and 404s - non- existent urls
Hi there, Our site was redesigned at the end of January 2020. Since the new site was launched we have seen a big drop in impressions (50-60%) and also a big drop in total and organic traffic (again 50-60%) when compared to the old site. I know in the current climate some businesses will see a drop in traffic, however we are a tech business and some of our core search terms have increased in search volume as a result of remote-working. According to search console there are 82k urls excluded from coverage - the majority of these are classed as 'crawl anomaly' and there are 250+ 404's - almost all of the urls are non-existent, they have our root domain with a string of random characters on the end. Here are a couple of examples: root.domain.com/96jumblestorebb42a1c2320800306682 root.domain.com/01sportsplazac9a3c52miz-63jth601 root.domain.com/39autoparts-agency26be7ff420582220 root.domain.com/05open-kitchenaf69a7a29510363 Is this a cause for concern? I'm thinking that all of these random fake urls could be preventing genuine pages from being indexed / or they could be having an impact on our search visibility. Can somebody advise please? Thanks!
Technical SEO | | nicola-10 -
404s still showing in GWT
Hi, My client recently undertook a site migration. Since the new site's gone live GWT has highlighted over 2000 not found errors. These were fixed nearly 2 weeks ago and they're still being listed in GWT. Do I have to wait for Google to re-crawl the page before they're removed from the list? Or do I need to go through the list, individually check them and mark them as fixed? Any help would be appreciated. Thanks
Technical SEO | | ChannelDigital0 -
Does Google differentiate between a site with spammy link building practices from a victim of a negative SEO attack?
I've be tasked with figuring out how to recover our rankings as we are likely being hurt by an algorithmic penalty. I have no idea if this was the workings of a previously hired SEO or the result of negative SEO, **how does Google differentiate between a site with bad/spammy link building practices from a victim of a negative SEO attack? **
Technical SEO | | Syed_Raza0 -
403s vs 404s
Hey all, Recently launched a new site on S3, and old pages that I haven't been able to redirect yet are showing up as 403s instead of 404s. Is a 403 worse than a 404? They're both just basically dead-ends, right? (I have read the status code guides, yes.)
Technical SEO | | danny.wood1 -
Any need to worry about spammy links in Webmaster Tools from sites that no longer exist?
I own an ecommerce website that had some spammy stuff done on it by an SEO firm through SEOLinkVine a few years ago. I'm working on removing all those links, but some of the sites no longer exist. I'm assuming I don't have to worry about disavowing those in Webmaster Tools? Thanks!
Technical SEO | | CobraJones950 -
Webmaster Tools finding phantom 404s?
We recently (three months now!) switched over a site from .co.uk to .com and all old urls are re-directing to the new site. However, Google Webmaster tools is flagging up hundreds of 404s from the old site and yet doesn't report where the links were found, i.e. in the 'Linked From' tab there is no data and the old links are not in the sitemap. SEOmoz crawls do not report any 404s. Any ideas?
Technical SEO | | Switch_Digital0 -
Hyphenated Domain Names - "Spammy" or Not?
Some say hyphenated domain names are "spammy". I have also noticed that Moz's On Page Keyword Tool does NOT recognize keywords in a non-hyphenated domain name. So one would assume neither do the bots. I noticed obviously misleading words like car in carnival or spa in space or spatula, etc embedded in domain names and pondered the effect. I took it a step further with non-hyphenated domain names. I experimented by selecting totally random three or four letter blocks - Example: randomfactgenerator.net - rand omf act gene rator Each one of those clips returns copious results AND the On-Page Report Card does not credit the domain name as containing "random facts" as keywords**,** whereas www.business-sales-sarasota.com does get credit for "business sales sarasota" in the URL. This seems an obvious situation - unhyphenated domains can scramble the keywords and confuse the bots, as they search all possible combinations. YES - I know the content should carry it but - I do not believe domain names are irrelevant, as many say. I don't believe that hyphenated domain names are not more efficient than non hyphenated ones - as long as you don't overdo it. I have also seen where a weak site in an easy market will quickly top the list because the hyphenated domain name matches the search term - I have done it (in my pre Seo Moz days) with ft-myers-auto-air.com. I built the site in a couple of days and in a couple weeks it was on page one. Any thoughts on this?
Technical SEO | | dcmike0 -
Does Selling the Same Product on Different Domains Look Spammy?
Today, I have watched such a great video on YouTube from Google Webmaster Help with following subject. Does selling the same product on three different domains look spammy? Now, I have same issue with my three websites. You can find out details as follow. Traditional Table Lamp in Brushed Steel Finsih with Green Glass http://www.lampslightingandmore.com/50_63_21594/traditional-table-lamp-in-brushed-steel-finsih-with-green-glass.html http://www.vistastores.com/indoorlighting-roycelighting-rtl50091-12b.html 7.5' Patio Umbrella Push Tilt - Olefinhttp://www.vistapatioumbrellas.com/marketumbrellas-californiaumbrella-slpt758-f13-red.htmlhttp://www.vistastores.com/marketumbrellas-californiaumbrella-slpt758-f13-red.htmlThere are three different website with 4000+ same products, same Title tag, same Meta description, same price, same details, same SKU number, same images and same details.There are very few pages indexed by Google for VistaStores.com. [I have submitted series of question on SEOmoz Q & A forum to resolve my website's crawling issue.]I assume that, this is one of biggest reason to stop my crawling. So, How can I fix it?
Technical SEO | | CommercePundit0