Should I remove pages to concentrate link juice?
-
So our site is database powered and used to have up to 50K pages in google index 3 years ago. After re-design that number was brought down to about 12K currently. Legacy URLs that are now generating 404 have mostly been redirected to appropriate pages (some 13K 301 redirects currently).
Trafficked content accounts for about 2K URLs in the end so my question is should I in context of concentrating link juice to most valuable pages:
- remove non-important / least trafficked pages from site and just have them show 404
- no-index non-important / least trafficked pages from site but still have them visible
- 1 or 2 above plus remove from index via Webmaster Tools
- none of the above but rather something else?
Thanks for any insights/advice!
-
Hi StratosJets,
In general the more pages you have the more content you have. More content is usually a good thing. I think before anybody can give you a solid recommendation we would want to know why you are wanting to remove 10K+ pages from the index.
You see, Google looks at your website like a splash of paint on white canvas. Your website in a whole is going to have a large splash around your main pages, then some secondary splash clusters around other content types. Just because these secondary splashes aren't performing as well as the main splash doesn't necessarily mean you want to remove them, as they help paint a picture of the complete site.
If the purpose of removing pages that are driving "some" traffic is only to try and boost the other pages I would say don't do it. You maybe able to restructure your navigation so that these smaller traffic pages don't get as much link juice as some of your higher performing pages. I know this can be a bit challenging in some CMS suites, but removing pages that are bringing in traffic only to try and boost other pages is a very advanced SEO metric and should really only be tackled when you have gathered lots of data.
Imagine you remove 10k pages that on average get 2 hits per month each, are you reasonably sure that the remaining 2k pages will generate 20,000 more hits once those pages are gone? Or is it more likely you lose those 20,000 hits along with the traffic patterns those 20,000 hits would normally bring in?
That being said there are cases to be made for removing pages, specifically ones generated by CMS platforms.
When is it okay to remove pages: The general rule here is if the page has no value or negative value for your viewers / customers.
No value pages: These are pages that have been redone, or their content is also available on another page which is more in depth and/or more user friendly. (Thin Content type pages)
Negative value pages: These are often generated by CMS sites, eCommerce CMS's may have auto generated pages like "Manufacturer Info" or "Item Review" pages, which you aren't using. Blog sites may have "Monthly Archives" or "Similar Post" pages which can create duplicate content if not used correctly. These pages can confuse your users and search engines as to what your site is really about.
If you find your site has tons of no value pages and negative value pages I would say okay go for it, but be very careful look at your page metrics, be sure the pages you are removing are in fact useless and provide no value for your viewers / customers.
If you decide to go the route of removal, the best way from my experience is to 301 pages which have correlating content on another page, and only 404 any page that have absolutely no value and page alternatives. If you 404 a page make sure you remove it from your sitemap and try submit it for removal. Lots of 404's isn't a good thing either...
I hope this helps,
Don
-
Hi,
IMO, don't remove/de-index/404 the pages which are not your traffic drivers today, they might bring some traffic for you tomorrow. I assume, these pages aren't hurting (going-to-hurt) you i.e they are not duplicate content candidates or something.
Of course you can promote whatever pages you want, like you want to promote those 2k traffic drivers in your case.
P.S Taking down a page and showing 404 is not a good practice in general. If you're permanently closing a page and don't want to 301/302 redirect it, then handle it using 410 instead of 404 here. You can check more about http status codes here: https://moz.com/learn/seo/http-status-codes
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Preserving link equity from old pages
Hi Moz Community, We have a lot of old pages built with Dreamweaver a long time ago (2003-2010) which sit outside our current content management system. As you'd expect they are causing a lot of trouble with SEO (Non-responsive, duplicate titles and various other issues). However, some of these older pages have very good backlinks. We were wondering what is the best way to get rid of the old pages without losing link equity? In an ideal world we would want to bring over all these old pages to our CMS, but this isn't possible due to the amount of pages (~20,000 pages) and cost involved. One option is obviously to bulk 301 redirect all these old pages to our homepage, but from what we understand that may not lead to the link equity being passed down optimally by Google (or none being passed at all). Another option we can think of would be to bring over the old articles with the highest value links onto the current CMS and 301 redirect the rest to the homepage. Any advice/thoughts will be greatly appreciated. Thumbs up! Thanks,
Intermediate & Advanced SEO | | 3gcouk0 -
How to remove seemingly untouchable link spam
Hey Mozzers, I have been struggling with this issue, and I am hoping someone can help. I have a number of bad/spammy links to my site. We have never engaged in "bad SEO", but an old subdomain received a number of spammy blog comments, and everything seemed to escalate from there. We have removed a subdomain that received all of the bad links from our DNS settings (about a year ago), but these links are still there when using Ahrefs or MajesticSEO. I don't think we have been penalized for these links, but I would just like to clean them up because, well, it's the right thing to do. How does one do this when these sites seem so untouchable. Either they are from China, Russia, Denmark, abandoned in 2009, etc. If I look for someone to contact, I can't seem to find anyone to even email. Suggestions?
Intermediate & Advanced SEO | | evan890 -
How to remove my site's pages in search results?
I have tested hundreds of pages to see if Google will properly crawl, index and cached them. Now, I want these pages to be removed in Google search except for homepage. What should be the rule in robots.txt? I use this rule, but I am not sure if Google will remove the hundreds of pages (for my testing). User-agent: *
Intermediate & Advanced SEO | | esiow2013
Disallow: /
Allow: /$0 -
When removing a product page from an ecommerce site?
What is the best practice for removing a product page from an Ecommerce site? If a 301 is not available and the page is already crawled by the search engine A. block it out in the robot.txt B. let it 404
Intermediate & Advanced SEO | | Bryan_Loconto0 -
Rel Canonical Link on the Canonical Page
Is there a problem with placing a rel=canonical link on the canonical page - in addition to the duplicate pages? For example, would that create create an endless loop where the canonical page keeps referring to itself? Two examples that are troubling me are: My home site is www.1099pro.com which is exactly the same as www.1099pro.com/index.asp (all updates to the home page are made by updating the index.asp page). I want www.1099pro.com/index.asp to have the rel=canonical link to point to my standard homepage www.1099pro.com but any update that I make on the index page is automatically incorporated into www.1099pro.com as well. I don't have access to my hosting web server and any updates I make have to be done to the specific landing pages/templates. I am also creating a new website that could possible have pages with duplicate content in the future. I would like to already include the rel=canonical link on the standard canonical page even though there is not duplicate content yet. Any help really would be appreciated. I've read a ton of articles on the subject but none really define whether or not it is ok to have the rel=canonical link on both the canonical page and the duplicate pages. The closest explanation was in a MOZ article that it was ok but the answer was fuzzy. -Mike
Intermediate & Advanced SEO | | Stew2220 -
Dynamic pages - ecommerce product pages
Hi guys, Before I dive into my question, let me give you some background.. I manage an ecommerce site and we're got thousands of product pages. The pages contain dynamic blocks and information in these blocks are fed by another system. So in a nutshell, our product team enters the data in a software and boom, the information is generated in these page blocks. But that's not all, these pages then redirect to a duplicate version with a custom URL. This is cached and this is what the end user sees. This was done to speed up load, rather than the system generate a dynamic page on the fly, the cache page is loaded and the user sees it super fast. Another benefit happened as well, after going live with the cached pages, they started getting indexed and ranking in Google. The problem is that, the redirect to the duplicate cached page isn't a permanent one, it's a meta refresh, a 302 that happens in a second. So yeah, I've got 302s kicking about. The development team can set up 301 but then there won't be any caching, pages will just load dynamically. Google records pages that are cached but does it cache a dynamic page though? Without a cached page, I'm wondering if I would drop in traffic. The view source might just show a list of dynamic blocks, no content! How would you tackle this? I've already setup canonical tags on the cached pages but removing cache.. Thanks
Intermediate & Advanced SEO | | Bio-RadAbs0 -
Where does "Pages Similar" link text come from?
When I type in a competitor name (in this case "buycostumes") Google shows several related websites in it's "Pages Similar to..." section at the bottom of the page: My question, can anyone tell me where the text comes from that Google uses as the link. Our competitors have nice branded links and our is just a keyword. I can find nothing on-page that Google is using so it must be coming from someplace off-page, but where?
Intermediate & Advanced SEO | | costume0 -
Should I completly remove the meta tags keywords in the html page?
So if the metag is not longer used by the search engines should I keep them in my html ?
Intermediate & Advanced SEO | | lomastravel0