Best way to address duplicate news sections within site
-
A client has a news section at www.clientsite.com/news and also at subdomain.clientsite.com/news. The stories within each section are identical:
www.clientsite.com/news/story-11-5-2011
subdomain.clientsite.com/news/story-11-5-2011
What's the best way to avoid a duplicate content issue within the site? A 301 redirect doesn't seem appropriate from the user experience point of view.
Is applying a rel=canonical <www.clientsite.com news="" story-a-b-c="">to each story within the subdomain news section the best option? They have 100's of stories, wondering if there might be an easier way?</www.clientsite.com>
Also, the news pages list the story headline and the first 3 lines of copy. Do these summaries present duplicate content issues with the full story page?
Thank you!
-
Alan, I appreciate your effort here. These are the sources I already shared
A complete summary of everything shared in those articles you quote:
1. It doesn't make a difference to google which method is used. When I examine all the information and analysis, it seems to indicate Google will index the content either way. How well that content will rank in Google is a different topic. There are reasons to keep content separate, such as when discussing topics unrelated to the main site, in which case a subdomain would be best.
2. Matt uses the directory approach, and he recommends for others to do the same.
AT BEST you can get that it is close to even with a slighter preference towards subfolders based on that information.
The Rand offers outstanding analysis as to why subfolders are the superior choice. Rand's analysis is in 2009, 2 years after the original articles quoted from Matt. http://www.seomoz.org/blog/understanding-root-domains-subdomains-vs-subfolders-microsites
The bottom line, it's up to you how much you care about your site and it's performance. Personally, I am a fighter. I also micro-manage website architecture because in many aspects, it is a one-time set it and forget it type of thing. Whether to use subdirectories vs subfolders, whether to use underscores in URLs vs dashes, etc. are things you do one time and then it is automated forever.
A detailed list of reasons supporting the subfolder approach has been offered. The DA, time, costs, etc. all support subfolders. If you wish to ignore all those strong, positive benefits and go with a subdomain then that is your choice.
Good luck.
-
The originals
http://googlewebmastercentral.blogspot.com/2008_01_01_archive.htmlhttp://www.mattcutts.com/blog/subdomains-and-subdirectories/
here is a better example from Matt
Deb December 11, 2007 at 1:01 am
<dd class="comment odd alt thread-odd thread-alt depth-1">
Matt thanks for your reply, just a query (if you don’t mind) if I add content in mattcutts.com/blog – it effect in seo because I add directly content in the domain mattcutts.com but if I add content in blog.mattcutts.com is the effect is same? I don’t think so – because this is a subdomain not directly related with the domain?
If I disturb you please don’t mindThanks
Deb</dd>
<dd class="comment odd alt thread-odd thread-alt depth-1">Matt Cutts December 10, 2007 at 10:55 am</dd>
<dd class="comment byuser comment-author-matt-cutts bypostauthor odd alt thread-odd thread-alt depth-1">
Deb, it really is a pretty personal choice. For something small like a blog, it probably won’t matter terribly much. I used a subdirectory because it’s easier to manage everything in one file storage space for me. However, if you think that someday you might want to use a hosted blog service to power your blog, then you might want to go with blog.example.com just because you could set up a CNAME or DNS alias so that blog.example.com pointed to your hosted blog service.
</dd>
I was trying to find video matt made where he makes a simular claim. but i have to get back to work
-
Alan,
We will have to agree to disagree on this one.
There is a ton of what can only be referred to as "SEO bullshit" published. When I quote a source it will usually be Matt Cutts directly, or Google, or a highly respected SEO who shares an opinion on a topic AND who offers very solid research to back up that opinion. In short, credibility is everything when quoting a source to support a given position.
You are quoting a site I have never heard of, alexander.holbreich.org. Is it just me? Do others know and recognize this site as a reputable source of SEO information?
The author's About page is a total of 4 lines of text. Line 1 = his name, Line 3 & 4 is where he lives. Line 2 = he has a degree in "Business Information" but doesn't even state where or when he received this degree. This web page is a solid example of a page that has absolutely zero trust on SEO.
I think it is great that you read various sources of SEO for ideas, but that is a big difference from depending on those sources as credible information.
If you want to quote, try the main source article. Doing such would add higher credibility to your position. I can agree there is a lot of confusion on this topic, but it is propagated mostly by pages like the one you linked which should probably never be read.
Using the source you quoted and some common ground I would share the following:
-
Matt Cutts stated he uses folders "My personal preference on subdomains vs. subdirectories is that I usually prefer the convenience of subdirectories for most of my content. A subdomain can be useful to separate out content that is completely different."
-
Matt Cutts recommended for others to use folders "If you’re a newer webmaster or SEO, I’d recommend using subdirectories until you start to feel pretty confident with the architecture of your site."
-
Matt shared a specific example of when a subdirectory would be appropriate, and it is an example I had shared as well in response to the original question "A subdomain can be useful to separate out content that is completely different. Google uses subdomains for distinct products such news.google.com or maps.google.com, for example."
The above aside, one site is easier to maintain then two. There are lower costs all around (software, trust badges, SSL, etc). There is less time involved as well. All that time and money can be put into other aspects of SEO such as link building and creating great content.
Further, by combining your content into one site, all your content benefits from the higher DA of your site.
I hope you take the information I am sharing the right way Alan. My professional experience leads me to almost always use a folder unless there is a clear and specific reason to use a subdomain such as trying to separate out content which is not related to the main site. The difference is strong enough to where I would recommend for most clients who have a subdomain to delete it and move to the subfolder structure.
If you find a differing opinion, I would love to hear it. All I ask is for it to be from a highly credible SEO source who preferably shares detailed examples or logic to support the position.
Best Regards,
-
-
"With respect to the general subfolder vs domain discussion, as far as I have seen most of the "debate" ended with subfolders being the winner."
For what reasons is it the winner? I use subdomains a lot, thats why I have looked for evidence, and Matt Cutts has stated it makes no difference.
Rand states, it is his personal belief, but google and Matt Cutts have stated many times it makes no difference to rankings
http://alexander.holbreich.org/2008/01/subdomains-vs-subdirectories/" otherwise irrelevant change during this discussion only serves to confuse an otherwise muddy topic"
I dont think its confusion, it is information clearly stated (not to do with rankings) for one to consider. it is an indication of googles thinking. It is stated correcly and all informmation should be considered. One could say that stating rands personal belief is confusing.
-
I take a different view on this topic then Alan.
As Alan mentioned, the recent Google change sole effect is how links to sub-domains from the root domain visually appear in Google WMT. They have absolutely no ranking weight difference. Bringing up that otherwise irrelevant change during this discussion only serves to confuse an otherwise muddy topic.
With respect to the general subfolder vs domain discussion, as far as I have seen most of the "debate" ended with subfolders being the winner.
There are a couple situations where a subdomain would be preferable to a folder. One example is when a different, unrelated topic or product is being offered. Keith, you brought up the example of Google Maps. A few comments I would share:
-
Google Maps is a different product then Google search. Really the main thing they have is they are being offered by the same company. The idea of providing satellite images and driving directions is really quite different then providing the best search results. These two products happen to be offered by the same company but if you think about it, they are really very distinct products. It would be the same idea if Ford created their own version of Sirius radio. Yes, the radios would be offered in Ford cars but the product is truly distinct of the cars and can stand completely alone.
-
Google's site was set up years ago before this topic was analyzed to this depth. Many changes have been made over the years.
A couple great discussions on this topic:
http://www.seomoz.org/blog/understanding-root-domains-subdomains-vs-subfolders-microsites
A quote Rand shared in a different article "99.9% of the time, if a subfolder will work, it's the best choice for all parties." I agree for the overwhelming majority of cases, a subfolder is preferred. There are some corner cases but normally speaking the subfolder is the preferred approach.
-
-
Subdomains or folder is an old debaiting point, but matt cutts has said it makes no difference.
I have also noticed that google includes subdomain links in its site links, as well as google WMT now shows subdomain links as internal(I know this is seperate to ranking, but it makes but with the other evidence it gives weight to what matt cutts stated). -
Good catch on the subdomains! That is a separate issue, and I am recommending they move everything to a clientsite.com/folder setup. The sub-domains do have unique content (except for the news) and they set it up that way because they've seen other sites, like Google, set up sub-domains for maps and their other products.
What's a good explanation to the client for why other large sites like Google set up different content sections as subdomains vs. the folder approach I am recommending?
-
the news pages list the story headline and the first 3 lines of copy. Do these summaries present duplicate content issues with the full story page?
No
With respect to the subdomain, what is the purpose of having the subdomain? It seems likely the best course of action would be to merge any unique content from the subdomain into the main site, then remove the subdomain. Your articles would benefit from the (presumably) stronger DA on the main site. Also your efforts would be reduced by allowing you to fully focus on one site rather then maintain two sites.
How does this subdomain benefit anyone?
If you insisted on keeping the subdomain, then yes the canonical meta tag would work.
-
canonical would be best here. but you would want to do it with code, or use rewrite outbound rules on the server
I would not worry about the sumery problem
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Redirecting old html site to new wordpress site
Hi I'm currently updating an old (8 years old) html site to wordpress and about a month ago I redirected some url's to the new site (which is in a directory) like this... Redirect 301 /article1.htm http://mysite.net/wordpress/article1/
Technical SEO | | briandee
Redirect 301 /article2.htm http://mysite.net/wordpress/article2/
Redirect 301 /article3.htm http://mysite.net/wordpress/article3/ Google has indexed these new url's and they are showing in search results. I'm almost finished the new version of site and it is currently in a directory /wordpress I intend to move all the files from the directory to the root so new url when this is done will be http://mysite.net/article1/ etc My question is - what to I do about the redirects which are in place - do I delete them and replace with something like this? Redirect 301 /wordpress/article1/ http://mysite.net/article1/
Redirect 301 /wordpress/article2/ http://mysite.net/article2/
Redirect 301 /wordpress/article3/ http://mysite.net/article3/ Appreciate any help with this0 -
Why is my site not being indexed?
Hi, I have performed a site:www.menshealthanswers.co.uk search on Google and none of the pages are being indexed. I do not have a "noindex" value on my robot tag This is what is in place: Any ideas? Jason
Technical SEO | | Jason_Marsh1230 -
Redirects - How Best to do this ?
Hi I am looking to close Website A which has many pages. I would like to keep the home page and add some great content to it with a link pointing to Website B. As for all the other pages excluding the home page , how is it best to approach them on Website A. Should I redirect them all to the home page of Website A which will tell Google thoose Pages are no longer needed and to prevent the visitors from seeing a 404? My Main aim here is to not lose any visitors to Website A by sending them to Website B but also to hopefully pass any Page strength from Website A to Website B Thanks Adam
Technical SEO | | AMG1000 -
Image Height/Width attributes, how important are they and should a best practice site include this as std
Hi How important are the image height/width attributes and would you expect a best practice site to have them included ? I hear not having them can slow down a page load time is that correct ? Any other issues from not having them ? I know some re social sharing (i know bufferapp prefers images with h/w attributes to draw into their selection of image options when you post) Most importantly though would you expect them to be intrinsic to sites that have been designed according to best practice guidelines ? Thanks
Technical SEO | | Dan-Lawrence0 -
Question about duplicate images used within a single site
I understand that using duplicate images across many websites was become an increasingly important duplicate content issue to be aware of. We have a couple dozen geotargeted landing pages on our site that are designed to promote our services to residents from various locations in our area. We've created 400+ word pieces of fresh, original content for each page, some of which talks about the specific region in some detail. However, we have a powerful list of top reasons to choose us that we'd like to use on each page as is, without rewriting them for each page. We'd like to simply present this bulleted list as an image file on each page to get around any duplicate written copy concerns. This image would not appear on any other websites but would appear on about two dozen landing pages for a single site. Is there anything to worry about this strategy from a duplicate content or duplicate image perspective in terms of SEO?
Technical SEO | | LeeAbrahamson0 -
Best way to handle redirection for products that come in and out of inventory.
We have a large volume of products that rotate seasonally. From an SEO perspective we are looking for the best method on how to handle these issues. Currently when crawler or user encounters a URL to a product that is no longer in inventory we are looking at two things. One, the request comes in and send a 200 to a page that says ITEM NOT FOUND. Option 2, is simply send them to a 404. The product may or may not be put back into production. What is the best method to handle this?
Technical SEO | | CC_Dallas0 -
Duplicate Content Issue within the Categories Area
We are in the process of building out a new website, it has been built in Drupal. Within the scan report from SEOMOZ Crawl Diagnostics and it look like I have a duplicate content issue. Example: We sell Vinyl Banners so we have many different templates one can use from within our Online Banner Builder Tool. We have broken them down via categories: Issue: Duplicate Page Content /categories/activities has 9 other URLS associated this issue, I have many others but this one will work for an example. Within this category we have multiple templates attached to this page. Each of the templates do not need their own page however we use this to pull the templates into one page onto the activities landing page. I am wondering if I need to nofollow, noindex each of those individule templates and just get the main top level category name indexed. Or is there a better way to do this to minimize the impact of Panda?
Technical SEO | | Ben-HPB0