Thousands of 404s
-
Hi there,
I'm working on a site that has a ridiculous number of 404s being returned by webmaster tools. We believe this was because there was an onpage error that was amending the urls and adding in folders that shouldn't have been in a big spiral i.e. /salons/uk/teeth became something like /salons/uk/teeth/salons/edinburgh/hair/teeth...
Anyway, we think the issue is now sorted, but these pages were indexed it seems, and so it looks like Google is still searching for them when it crawls the site. What's my best move? It's the sheers volume (over 13,000) that has me concerned so I thought it best to seek some expert advice before continuing.
Thanks in advance!
-
As it's all sorted now, I really wouldn't worry about them too much. You can use the remove URL functionality in WMT, but this is a manual process so I wouldn't do this. If I were in your position, I'd probably just let the pages keep 404ing'. After a bit, Google will usually stop trying to recrawl the 404 pages. Right now they are probably trying to recrawl incase the 404 was an accident.
If it's causing a bandwidth problem, you can solve with a robots.txt as suggested earlier.
-
Hi Philip!
If these URL's are already indexed, you should 301 Redirect them to the right URL (if they by chance have some inbound links). You could also try the URL removal tool from Google (see https://support.google.com/webmasters/answer/1663416) if all you want is to get rid of them.
Good luck, hope this helps.
//Anders
-
Hi Philip,
If all the urls have the same URL pattern, I would give it a try adding the structure to the robots.txt so you'll prevent Google from crawling the pages. Even better would be if you could add the noindex tags to the page.
Hope this helps!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Are there ways to avoid false positive "soft 404s" by Google
Sometimes I get alerts from Google Search Console that it has detected soft 404s on different websites, and since I take great care to never have true soft 404s, they are always false positives. Today I got one on a website that has pages promoting some events. The language on the page for one event that has sold out says that "tickets are no longer available" which seems to have tripped up Google into thinking the page is a soft 404. It's kind of incredible to me that in the current era we're in, with things like chatGPT that Google doesn't seem to understand natural language. But that has me thinking, are there some strategies or best practices we can use in how we write copy on the page so Google doesn't flag it as soft 404? It seems like anything that could tell a user that an item isn't available could trip it up into thinking it is a 404. In the case of my page, it's actually important information we need to tell the public that an event has sold out, but to use their interest in that event to promote other events. so I don't want the page deindexed or not to rank well!
Technical SEO | | IrvCo_Interactive0 -
Huge number of crawl anomalies and 404s - non- existent urls
Hi there, Our site was redesigned at the end of January 2020. Since the new site was launched we have seen a big drop in impressions (50-60%) and also a big drop in total and organic traffic (again 50-60%) when compared to the old site. I know in the current climate some businesses will see a drop in traffic, however we are a tech business and some of our core search terms have increased in search volume as a result of remote-working. According to search console there are 82k urls excluded from coverage - the majority of these are classed as 'crawl anomaly' and there are 250+ 404's - almost all of the urls are non-existent, they have our root domain with a string of random characters on the end. Here are a couple of examples: root.domain.com/96jumblestorebb42a1c2320800306682 root.domain.com/01sportsplazac9a3c52miz-63jth601 root.domain.com/39autoparts-agency26be7ff420582220 root.domain.com/05open-kitchenaf69a7a29510363 Is this a cause for concern? I'm thinking that all of these random fake urls could be preventing genuine pages from being indexed / or they could be having an impact on our search visibility. Can somebody advise please? Thanks!
Technical SEO | | nicola-10 -
Old forum with 404s, what should I do?
Hello, So I'm helping out some friends with their SEO. I've just run a Screaming Frog crawl of their entire site (which took hours and hours I might add). They used to have a forum connected to the site, which is no longer active. Google is still indexing all of the old URLs, which unsurprisingly return 404 errors. What should they do to prevent Google from indexing these pages? That's assuming they need to do anything at all. They don't have access to these old forum posts and therefore won't be able to fix the URL or resource adding a 301 redirect pointing to the most relevant alternate page. I'm new to SEO but my instinct is that they need to have the page return a 410 ‘Gone’ response code to give search engines a clear signal that the page no longer exists and won’t be returning, and removing the internal links to that URL or resource. 1. Is this interpretation correct?
Technical SEO | | jordanayresaira
2. What is the impact of leaving these 404s? There are over a thousand, so there's a lot 3. What should I recommend?0 -
404s in GWT - Not sure how they are being found
We have been getting multiple 404 errors in GWT that look like this: http://www.example.com/UpdateCart. The problem is that this is not a URL that is part of our structure, it is only a piece. The actual URL has a query string on the end, so if you take the query string off, the page does not work. I can't figure out how Google is finding these pages. Could it be removing the query string? Thanks.
Technical SEO | | Colbys0 -
Missing Meta Tags - "thousands" using WooCommerce?
Recently took a site live for a client using WP/WooCommerce to replace their antiquated shopping cart site and have encountered thousands of "Missing Meta Description Tag" errors. Have researched and tried a couple different approaches, but nothing really seems to fix this problem. I'm happy to continue to research, but have never encountered this problem before. Anyone else encountered similar? If so, how did you fix? Which resources to start with? 2VKDRVx
Technical SEO | | twelvetwo.net0 -
Removing thousands of shady backlinks?
Hey guys, We've been hired to redesign a website that has thousands of backlinks created by a (possibly) shady offshore company, and I'm wondering if anyone out there has experience dealing with a deletion of this size and type. Is it as simple as just disavowing the whole lot? Thanks, Jason
Technical SEO | | JKorolenko0 -
Organic Traffic Dropping. 404s found. Is this it?
My organic traffic has dropped 20% in the past month. I noticed some major 404 errors today on a very commonly clicked link on our website. I had the developers fix the error today, but then I thought I'd look in Google Webmaster Tools. That's when I saw the attached picture. 2454 404 errors were found. We only have 1500 pages in the entire Google index. Could this massive amount of 404 errors attribute to this drop in organic traffic? dU240 dU240
Technical SEO | | davewjones0 -
Webmaster Tools 404s
We try to keep our 404s in google webmaster tools to a minimum but in recent months, the volume has simply exploded to over 500k errors. 99.95% of this is complete spam linking to pages that never existed. Have tried marking as resolved but they just end up back in the list eventually and don't like the idea of 301ing so many links when the pages never existed in the first place. We can just ignore them all but this makes it hard to identify legitimate 404s that need redirecting as there is only so much data we can export out of WT. Has anyone had experience with returning 410s? Does google eventually drop these from WT?
Technical SEO | | jandunlop0