What is the best way to remove old pages (if at all)?
-
Hi,
I have a client who has thousands of pages on his site - 50,000+. It is a news website so most of these pages are old news articles and blog posts that receive very little traffic. We are moving to a new content management system and are debating on whether or not to keep all that old content. So far our decision is to keep the content that has gotten at least 100+ visits from Bing or Google in the last 6 months but dump everything else. This amounts to around 30,000 or so pages most of which have several links pointing to them.
My question is from an SEO standpoint is that okay to do? We'd not only lose pages but links as well. Part of me thinks that in light of the Panda update getting rid of old content that is good but not great could about help out the site (we do great in the SERPs and actually got a bump in traffic after the Panda update to new articles/posts). However, we obviously don't want to cause problems and that is why I'd appreciate the thoughts and ideas on the best way to handle this major downsizing in content.
Thanks!
-
Do all of the pages receive back links? It would be a laborious job and perhaps not worthwhile but you could look at which of the pages receive a substantial amount of links or links from high quality websites. Then just include the "Good" pages in your new site.
Like you say, I think you would benefit from losing some low value content that doesn't rank well and doesn't have many quality links pointing to it.
Whatever you decide it would be interesting to know how that effects your site. Perhaps even worthy of a YOUmoz post?!
-
Be very careful about dumping massive amounts of content. It's not all about how many visits came from search. It's also about the sum-total weight of the site. If you've got 50,000 pages, that's 50,000 internal links to the home page. Take away 30,000, that's a massive hit. Even if each of those pages or the majority of those pages send a tiny little twinkling of link equity, just add them all up.
I've seen client sites take major hits removing that old "worthless" content against my advice. And as an SEO, I then get to step in and say "here - try getting all those old pages moved over now." Then watching as they rebound afterward. Slowly. Painfully.
-
Thats a tough call. Would you redirect the old pages? If so where would you redirect them to? Redirecting to the home page is not a good idea. I would be cautious getting rid of content that has links to it without having somewhere to redirect thos pages.
It seems almost impossible to analyze all of the implications. Have you thought about removing older content in smaller batches? Maybe get rid of the least visited 5000 pages first to see how that impacts the site overall.
I am not sure I would be comfortable dumping that much content willy nilly without doing some testing.
Another option would be to make sure to redirect the old content to the category page so you don't just lose that link juice all together. I would highly recommend figuring out a redirect plan for that content. Losing the links is a bad idea!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Single page verses multiple pages
I am working on a client website that has a Services section. Each service could have its own page. The problem is that some of the sections only have a 3 sentence paragraph. I am not sure if this is enough content to be considered quality. I am of course going to recommend that they revisit their copy and expand on what they do, but the question remains. Is it better to have one lengthy page of services or individual services/page even though the content is light? I know it would benefit from multiple if there was substantial content, but that is not the case in this instance. Thank you!
Content Development | | thinkseo0 -
Google Slower to Trust New Pages than One Year Ago?
It seems to me that Google is slower to trust (and rank) new pages today than in the past. I used to be able to put up a new page and it would go right to the top of a competitive SERP. For about the past year when I launch a new page it starts deep in the SERPs, sits there for a few weeks, then starts slowly moving up. These pages still eventually rank on the first page of Google - often at #1 or #2 after wikipedia or another strong site - but it can take a few months to get there, several months in a competitive SERP. These are not "hot news" topics where freshness is an important factor. Instead they are product pages or general information articles. Anybody else seeing this? [ Just stabbing in the dark here... I am wondering if Google is relying more on visitor behavior these days and the delay is while they collect data?... Just stabbing in the dark.]
Content Development | | EGOL0 -
Google RSS Feed News Feed To Inner Pages
Ive been working on a large ecommerce website that has thousands of pages, although we are working on adding custom content daily to the largest pages, we have "defaul content" with dynamic tags matching search query. Someone recommend we add a google Rss feed where they can also add some type of tag. Is this a bad idea? Ive asked in the past but my main concern is issue with duplicate content, It would be next to impossible to have all pages filled with custom content. Any help much appreciated!! Thank you!
Content Development | | TP_Marketing0 -
Services Page vs Page For Each Service Offered
Read an interesting article about how websites with just a "services" page suffer and they should try to create a meaningful page for each service they offer... Read so many blogs right now that I can't remember where I saw it
Content Development | | JamesFx0 -
URL structure on moving from old nukedit site to Drupal
I am currently rebuilding a site in Drupal. The old site was in Nukedit and all the urls end in .asp. Given the new urls will not end in .asp is there any point in matching the rest of the url or will using 301 redirects (to better seo'd pages) be sufficient to minimise any (temporary) ranking loss?
Content Development | | chunki0 -
Best Alternatives to Google Knol
We have some Google Knols that have done well but now they have to be moved (Google closing). The popular alternatives are: Annotum (new, based on Wordpress and containing scholarly articles) Wordpress.com Blogger Although those could work I am seeking out other viable alternatives. Any suggestions? Squidoo is spammy but has anyone had any good things to say about it? Thanks!
Content Development | | geteducated0 -
How to best take advantage of content being used on another site?
We've never syndicated content or done "article marketing". Another site contacted us and requested to use the content on several of our webpages. The other site is a fairly prestigious nonprofit in our industry. We don't mind them using our content, but we want to get the most benefit out of it. There are two ways the occur to me: Have them create pages with the exact same text as on our pages, but put in the header of those pages Just have them create pages with the text from our pages with embedded links back to our other pages. Each page they create will say "Content courtesy of XXX" Does anyone have opinions on which way is best, or another approach?
Content Development | | DanCrean0 -
Best way to avoid duplicate content issues here.
I am planning to write an article that refutes some claims made in another article. The original article is a 20 page pdf. What I plan to do is to take quotes from this PDF and then under each quote write my arguments for or against the quote. If I take direct quotes from the article, is Google likely to see this as duplicate content?
Content Development | | MarieHaynes0