Migrate Old Archive Content?
-
Hi,
Our team has recently acquired several newsletter titles from a competitor.
We are currently deciding how to handle the archive content on their website which now belongs to us.
We are thinking of leaving the content on their site (so as not to suddenly remove a chunk of their website and harm them) but also replicating it on ours with a canoncial link to say our website is the original source.
The articles on their site go back as far as 2010.
Do you think it would help or hinder our site to have a lot of old archive content added to it? I'm thinking of content freshness issues.Even though the content is old some of it will still be interesting or relevant.
Or do you think the authority and extra traffic this content could bring in makes it worth migrating.
Any help gratefully received on the old content issue or the idea of using canonical links in this way.
Many Thanks
-
Thanks for all your help with this.
-
I agree 100% with Hutch and Patrick. Your best bet is to dive into whatever analytics data you have for the content. I would probably follow a rough procedure like:
- Identify content no one is looking at, is not ranking, is old/poor - start there and you can probably trim out the lowest quality stuff - remove completlely or just noindex to be more conservative
- Then find the other extreme - think 80/20 - find the obvious highest achievers and those are the ones you'd most want to maybe move over or maintain in some way. If any high achievers are getting traffic despite being old/poor - that won't last - so update them.
- The hardest to figure out is the mediocre performing stuff (moderate visits, moderate search visibility). I would probably put all the moderate content in a spreadsheet. Categorize it by topic. Figure out what can stand alone, or be consolidated. Basically you want to arrive at a situation where every piece of content you keep is, if not recent, at least still quality (quality as defined by: unique, well executed, good design, good UX, helpful or entertaining).
The content audit process mentioned by Patrick is a great way to do this analysis with data, but you can also just use some traffic and basic segmenting in analytics as an easier method.
You could also try some tools like URL Profiler, which cake make such an audit process a little easier.
That's just decided if you should keep it - when it comes to migrating I guess it depends on your ultimate vision for the company / branding.
I wouldn't try any tricky things like putting a canonical to say your site is the original source. Google probably knows this is not true, and a canonical is just a "suggestion" so there's no guarantee they will honor it. I would be more in favor of migrating it to your site, removing from the old with a 301 redirect to your site and maybe just a note on your site saying "this article originally appeared in ...." and be really transparent with the user.
-
Great answer, Hutch.
Building on that - Moz offers an extremely comprehensive content audit that goes step by step on how to evaluate your content.
No blanket answer - this will take time and research, but it will make your site so much better overall!
Good luck!
-
I think you are asking a large, loaded question. I do not think there is a "yes you should" or "no you should not" answer for your complex question.
This content is upwards of nearly half a decade old, is it still relevant? Instead of a blanket yes or no, I think you should go through all of it and see what is still valuable, depending on your industry it could be half of it, or it could be none, but you should be looking at each piece individually, not the entire site as one whole. For moving it, if the content is good I think placing it on your site (as I assume you want to consolidate) and redirecting to the new location is fine, plus if you do it as you go, you will not have a massive surge in your content, or drop in the old site but a gradual shift over a period of time.
Hope this helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content issue
Moz crawl diagnostic tool is giving me a heap of duplicate content for each event on my website... http://www.ticketarena.co.uk/events/Mint-Festival-7/ http://www.ticketarena.co.uk/events/Mint-Festival-7/index.html Should i use a 301 redirect on the second link? i was unaware that this was classed as duplicate content. I thought it was just the way the CMS system was set up? Can anyone shed any light on this please. Thanks
Technical SEO | | Alexogilvie0 -
Advice on Duplicate Page Content
We have many pages on our website and they all have the same template (we use a CMS) and at the code level, they are 90% the same. But the page content, title, meta description, and image used are different for all of them. For example - http://www.jumpstart.com/common/find-easter-eggs
Technical SEO | | jsmoz
http://www.jumpstart.com/common/recognize-the-rs We have many such pages. Does Google look at them all as duplicate page content? If yes, how do we deal with this?0 -
Duplicate Page Content for sorted archives?
Experienced backend dev, but SEO newbie here 🙂 When SEOmoz crawls my site, I get notified of DPC errors on some list/archive sorted pages (appending ?sort=X to the url). The pages all have rel=canonical to the archive home. Some of the pages are shorter (have only one or two entries). Is there a way to resolve this error? Perhaps add rel=nofollow to the sorting menu? Or perhaps find a method that utilizes a non-link navigation method to sort / switch sorted pages? No issues with duplicate content are showing up on google webmaster tools. Thanks for your help!
Technical SEO | | jwondrusch0 -
Determining where duplicate content comes from...
I am getting duplicate content warnings on the SEOMOZ crawl. I don't know where the content is duplicated. Is there a site that will find duplicate content?
Technical SEO | | JML11790 -
Does turning website content into PDFs for document sharing sites cause duplicate content?
Website content is 9 tutorials published to unique urls with a contents page linking to each lesson. If I make a PDF version for distribution of document sharing websites, will it create a duplicate content issue? The objective is to get a half decent link, traffic to supplementary opt-in downloads.
Technical SEO | | designquotes0 -
WordPress Duplicate Content Issues
Everyone knows that WordPress has some duplicate content issues with tags, archive pages, category pages etc... My question is, how do you handle these issues? Is the smart strategy to use robots meta and add no follow/ no index category pages, archive pages tag pages etc? By doing this are you missing out on the additional internal links to your important pages from you category pages and tag pages? I hope this makes sense. Regards, Bill
Technical SEO | | wparlaman0 -
Duplicate Content and Canonical use
We have a pagination issue, which the developers seem reluctant (or incapable) to fix whereby we have 3 of the same page (slightly differing URLs) coming up in different pages in the archived article index. The indexing convention was very poorly thought up by the developers and has left us with the same article on, for example, page 1, 2 and 3 of the article index, hence the duplications. Is this a clear cut case of using a canonical tag? Quite concerned this is going to have a negative impact on ranking, of course. Cheers Martin
Technical SEO | | Martin_S0 -
Duplicate content?
I have a question regarding a warning that I got on one of my websites, it says Duplicate content. I'm canonical url:s and is also using blocking Google out from pages that you are warning me about. The pages are not indexed by Google, why do I get the warnings? Thanks for great seotools! 3M5AY.png
Technical SEO | | bnbjbbkb0