What should I do with all these 404 pages?
-
I have a website that Im currently working on that has been fairly dormant for a while and has just been given a face lift and brought back to life. I have some questions below about dealing with 404 pages.
In Google WMT/search console there are reports of thousands of 404 pages going back some years. It says there are over 5k in total but I am only able to download 1k or so from WMT it seems.
I ran a crawl test with Moz and the report it sent back only had a few hundred 404s in, why is that?
Im not sure what to do with all the 404 pages also, I know that both Google and Moz recommend a mixture of leaving some as 404s and redirect others and Id like to know what the community here suggests.
The 404s are a mix of the following:
Blog posts and articles that have disappeared (some of these have good back-links too)
Urls that look like they used to belong to users (the site used to have a forum) which where deleted when the forum was removed, some of them look like they were removed for spam reasons too eg /user/buy-cheap-meds-online and others like that
Other urls like this /node/4455 (or some other random number)
Im thinking I should permanently redirect the blog posts to the homepage or the blog but Im not sure what to do about all the others? Surely having so many 404s like this is hurting my crawl rate?
-
OK will try that thanks
-
thanks, I have planned to do that, there's so many of them though
-
The posts and articles with good backlinks, does that content still make sense in your renewed site? If so, I'd bring them back. If you don't have the content, you can try the Wayback Machine. The same goes for any old posts you think would be useful to your new readers.
The problem with redirecting a bunch of 404s to the same page (like the homepage) is that you end up with soft 404s and not a very good user experience. Pick the ones that correspond to specific pages that you have on the updated site and redirect those to the equivalent page.
Anything else, I'd let 404. A bunch of old posts, with no good links, the content of which you no longer have a use for on the site don't represent value to searchers—those pages will just drop out of Googles index (and crawl attempts) over time.
[This isn't just theoretical. We changed domains back in November and we had lots of old content—going back 10+ years, which is ancient history for a financial publisher. I ended up with about 6,000 404s. We are now down to about 4,000 404s as pages drop off. Google crawls us quickly and regularly and our organic traffic is up 86.49% .]
-
Remove all internal links leading to 404 pages. If you're using a redirect, your internal links shouldn't link to 404+302->new page either, link straight to the new source.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Pages Not Getting Indexed
Hey there I have a website with pretty much 3-4 pages. All of them had a canonical pointing to one page and the same content ( which happened by mistake ) I removed the canonical URL and added one pointing to its page. Also, I added the original content that was supposed to be there to begin with. It's been weeks but those pages are not getting indexed on the SERPS while the one that they use to point with the canonical does.
Technical SEO | | AngelosS0 -
What should I do about not found pages?
I took over a site that had been hacked. A bunch of pages were created that said domain.com/cms/viagra. The pages are gone but they still show in webmaster tools as not being found, which is what I want. However, should I do anything besides leaving them as 404?
Technical SEO | | EcommerceSite0 -
After I 301 redirect duplicate pages to my rel=canonical page, do I need to add any tags or code to the non canonical pages?
I have many duplicate pages. Some pages have 2-3 duplicates. Most of which have Uppercase and Lowercase paths (generated by Microsoft IIS). Does this implementation of 301 and rel=canonical suffice? Or is there more I could do to optimize the passing of duplicate page link juice to the canonical. THANK YOU!
Technical SEO | | PFTools0 -
Redirecting 404
Hi. I'm working on a wordpress site, which got some old deleted pages indexed and now shows a 404 (also in the results) As these old pages earlier got content and probably also some links pointing towards it, what would then be best practice to do? Should i make an 301 redirect? Make the 404 noindex?
Technical SEO | | Mickelp0 -
Redirecting over-optimised pages
Hi One of my clients websites was affected by Penguin and due to no 'bad link' messages, and nothing really obvious from the backlink profile, I put it down to over-optimisation on the site. I noticed a lot of spammy pages and duplicate content, and submitted recommendations to have these fixed. They dragged their heels for a while and eventually put in plans for a new site (which was happening anyway), but its taken quite a while and is only just going live in a couple of weeks. My question is, should I redirect the URLs of the previously over-optimised pages? Obviously the new pages are nice and clean and from what I can tell there are no bad links pointing to the URLs, so is this an acceptable practice? Will Google notice this and remove the penalty? Thanks
Technical SEO | | Coolpink0 -
Page rank 2 for home page, 3 for service pages
Hey guys, I have noticed with one of our new sites, the home page is showing page rank two, whereas 2 of the internal service pages are showing as 3. I have checked with both open site explorer and yahoo back links and there are by far more links to the home page. All quality and relevant directory submissions and blog comments. The site is only 4 months old, I wonder if anyone can shed any light on the fact 2 of the lesser linked pages are showing higher PR? Thanks 🙂
Technical SEO | | Nextman0 -
Pages not being found in serp
Hi I'm helping a collegue with his website. For what ever reason the pages in the Solutions Menu are not being found in the search result for keywords related to the pages. (Homepage mainly comes up in the search result). Does anyone have any advise to why this may be happening? *To give you a bit of a background understanding, previously all the menu content was copied (which I made him change), he also had hidden text on some pages (i made him remove, white text on white background) plus the url structure changed as well. Persoanlly I think he is over using , links, internal linking is not great & the general content is not great in the menu. Your Thoughts are welcomed, thank you.
Technical SEO | | Socialdude0 -
I have 15,000 pages. How do I have the Google bot crawl all the pages?
I have 15,000 pages. How do I have the Google bot crawl all the pages? My site is 7 years old. But there are only about 3,500 pages being crawled.
Technical SEO | | Ishimoto0