Duplicate Content within Website - problem?
-
Hello everyone,
I am currently working on a big site which sells thousands of widgets. However each widget has ten sub widgets (1,2,3... say)
My strategy with this site is to target the long tail search so I'm creating static pages for each possibly variation.
So I'll have a main product page on widgets in general, and also a page on widget1, page on widget2 etc etc.
I'm anticipating that because there's so much competition for searches relating to widgets in general, I'll get most of my traffic from people being more specific and searching for widget1 or widget 7 etc.
Now here's the problem - I am getting a lot of content written for this website - a few hundred words for each widget. However I can't go to the extreme of writing unique content for each sub widget - that would mean 10's of 1,000's of articles.
So... what do I do with the content. Put it on the main widget page was the plan but what do I do about the sub pages. I could put it there and it would make perfect sense to a reader and be relevant to people specifically looking for widget1, say, but could there be a issue with it being viewed as duplicate content.
One idea was to just put a snippet (first 100 words) on each sub page with a link back to the main widget page where the full copy would be.
Not sure whether I've made myself clear at all but hopefully I have - or I can clarify.
Thanks so much in advance
David
-
What's wrong with having ten brass widgets in ten different colors and ten buy buttons all listed on a single page?
I do that I we see lots of people buying a brass widget in every color. I think that this is great for getting more sales. If I was a shopper it would be a real frustration to visit ten pages to get one of each color - or just visit all of those pages to see which color I like best.
Most important, Google might see that and say.... This page has brass widgets in EVERY FREEKING COLOR! and decide to show it to visitors who search for them.
Now, if you are compulsive about having one page per widget and having your writer create yada yada yada content for all of them, keep in mind that you are wasting a lot of money on near duplicate content, boring your writers and spreading your pagerank out over a lot of pages.
-
David, the sub-pages as far as Goggle was concerned fed all the juice to the product page.
No the subpages were not indexed as we told Google they all came from the same page in the canonical.
How do you describe a red widget1 differently to blue widget1? The item is the same but there is only one word different in the content, so we decided to skip a physically different url for the different colours and just use different anchors on the thumbnail images. The title and alt tags would contain specific information about the colour of the widget.
If someone searches for red widget1 and we have keyword strength in widget1 they will get to the widget1 page where they will see the red widget1 and any other colours for that widget1.
The canonical allows you to specify the content origin. So if you have /category/widget1/red and /category/widget1/blue describing the same content you could use /category/widget1 in the canonical ref and both pages would give juice to the main page and get no duplicate content penality.
This only works if you have a small number of variants on each widget as Ryan pointed out, such as size, colour variations etc. Otherwise it is too confusing for humans to follow.
With the amount of content you are looking at, it is probably worthwhile getting a usability study done.
-
SEO = Manipulation doesn't it?
You can call me naive but those days of SEO are either gone or disappearing fast.
I view SEO as working to understand the ever-changing metrics search engines use to rank search results, then applying that knowledge to websites.
We are manipulated into improving our sites to provide a better user experience. The changes we make have lasting value. Other forms of SEO are always one update away from making a post asking "what happened to my site's rankings?"
-
Thanks for the replies, guys.
Oznappies - did that structure mean that all your subproduct pages were pretty much devoid of link juice? Were they even indexed? The big question is if someone searhed for 'red product a' which page showed up? Excuse my ignorance re the canonical stuff.
Ryan, Yes you are right to some degree. I am reverse engineering the website so to speak. But nevertheless I plan to offer huge value to visitors - I have spared little expense with the content writing, usability etc plus we have some fairly radical ideas that should be hugely popular with the visitors.
But I take exception that this is the wrong way to go about it. SEO = Manipulation doesn't it? The old adage 'Just make great content then users will find it and link to it and you'll dominate the serps' is a great theory but we all know in practise it doesn't work like that in 99% of the cases. To get your great product out there you have to give it a push, find an angle to exploit and this targeting of long tail is my angle.
It will be a great site I assure you
-
If the widgets are truly different products, then they should have separate product pages. If you have a weather widget, a currency exchange widget, a local time widget, etc. then you can clearly build unique content for each page.
If you offer a widget in different colors, sizes, etc. but it is really the same widget, you can't effectively generate new content for each page. Your best approach is creating a single, strong page for the widget. The "blue", "yellow" and other widget pages should be canonicalized to the main widget page.
I am getting a lot of content written for this website - a few hundred words for each widget. However I can't go to the extreme of writing unique content for each sub widget - that would mean 10's of 1,000's of articles.
That sums it up pretty well. You are having content "written" which often means it is not quality content. You are not willing to write unique content for each sub widget either. You are not developing your site for the best user experience, but instead to manipulate search engine traffic. Google is focused on preventing you from doing exactly what you are trying to do. Even if you succeed, you will be back here in a couple months asking "why did my site drop so far" after Google makes an update to adjust for this type of manipulation.
You have two options. Condense all your content to one widget page, or develop each widget page as if it was the only page on your website. When you sit down and think "I have 10k pages and I need to have content on all of them" your content will be inferior to other sites, and your SERP will reflect as much.
-
We had a similar issue but not to that scale. We had product A in Red, Blue, Green etc the first approach we used a url /category/product?id=subproduct and set id as a parameter in Google Webmaster Tools site config. This passed all the link juice to /category/product and ensured that all pages had the appropriate for the link juice page.
We then decided that all those page loads just to basically show an image for each subproduct were a pain for the customer and so decided to show small images on the /category/product page an use a jquery call to overlay a larger image when the customer clicked a particular product. This produced faster load time and better customer experience.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content
I am trying to get a handle on how to fix and control a large amount of duplicate content I keep getting on my Moz Reports. The main area where this comes up is for duplicate page content and duplicate title tags ... thousands of them. I partially understand the source of the problem. My site mixes free content with content that requires a login. I think if I were to change my crawl settings to eliminate the login and index the paid content it would lower the quantity of duplicate pages and help me identify the true duplicate pages because a large number of duplicates occur at the site login. Unfortunately, it's not simple in my case because last year I encountered a problem when migrating my archives into a new CMS. The app in the CMS that migrated the data caused a large amount of data truncation Which means that I am piecing together my archives of approximately 5,000 articles. It also means that much of the piecing together process requires me to keep the former app that manages the articles to find where certain articles were truncated and to copy the text that followed the truncation and complete the articles. So far, I have restored about half of the archives which is time-consuming tedious work. My question is if anyone knows a more efficient way of identifying and editing duplicate pages and title tags?
Technical SEO | | Prop650 -
Many Errors on E-commerce website mainly Duplicate Content - Advice needed please!
Hi Mozzers, I would need some advice on how to tackle one of my client’s websites. We have just started doing SEO for them and after moz crawled the e-commerce it has detected: 36 329 Errors – 37496 warnings and 2589 Notices all going up! Most of the errors are due to duplicate titles and page content but I cannot identify where the duplicate pages come from, these are the links moz detected of the Duplicate pages (unfortunately I cannot add the website for confidentiality reasons) : • www.thewebsite.com/index.php?dispatch=categories.view&category_id=233&products_per_00&products_per_2&products_per_2&products_per_2&page=2 • www.thewebsite.com/index.php?dispatch=categories.view&category_id=233&products_per_00=&products_per_00&products_per_2&products_per_2&products_per_2&page=2 • www.thewebsite.com/index.php?dispatch=categories.view&category_id=233&products_per_00=&products_per_00&products_per_2&page=2 • www.thewebsite.com/index.php?dispatch=categories.view&category_id=233&products_per_2=&products_per_00&page=2 • www.thewebsite.com/index.php?dispatch=categories.view&category_id=233&products_per_00&products_per_00&products_per_00&products_per_00&page=2 With these URLs it is quite hard to identify which pages need to be canonicalize. And this is jsut an example out of thousands on this website. If anyone would have any advice on how to fix this and how to tackle 37496 errors on a website like this that would be great. Thank you for your time, Lyam
Technical SEO | | AlphaDigital0 -
Duplicate content for vehicle inventory.
Hey all, In the automotive industry... When uploading vehicle inventory to a website I'm concerned with duplicate content issues. For example, 1 vehicle is uploaded to the main manufacturers website, then again to the actual dealerships website & then again to Craigslist & even sometimes to a group site. The information is all the same, description, notes, car details & images. What would you all recommend for alleviating duplicate content issues? Should I be using the rel canonical back to the manufacturers website? Once the vehicle is sold all pages disappear. Thanks so much for any advice.
Technical SEO | | DCochrane0 -
Duplicate content or titles
Hello , I am working on a site, I am facing the duplicate title and content errors,
Technical SEO | | KLLC
there are following kind of errors : 1- A link with www and without www having same content. actually its a apartment management site, so it has different bedrooms apartments and booking pages , 2- my second issue is related to booking and details pages of bedrooms, because I am using 1 file for all booking and 1 file for all details page. these are the main errors which i am facing ,
can anyone give me suggestions regarding these issues ? Thnaks,0 -
Duplicate Content for Multiple Instances of the Same Product?
Hi again! We're set to launch a new inventory-based site for a chain of car dealers with various locations across the midwest. Here's our issue: The different branches have overlap in the products that they sell, and each branch is adamant that their inventory comes up uniquely in site search. We don't want the site to get penalized for duplicate content; however, we don't want to implement a link rel=canonical because each product should carry the same weight in search. We've talked about having a basic URL for these product descriptions, and each instance of the inventory would be canonicalized to this main product, but it doesn't really make sense for the site structure to do this. Do you have any tips on how to ensure that these products (same description, new product from manufacturer) won't be penalized as duplicate content?
Technical SEO | | newwhy0 -
Link Structure & Duplicate Content
I am struggling with how I should handle the link structure on my site. Right now most of my pages are like this: Home -> Department -> Service Groups -> Content Page For Example: Home -> IT Solutions -> IT Support & Managed Services -> IT Support Home -> IT Solutions -> IT Support & Managed Services -> Managed Services Home -> IT Solutions -> IT Support & Managed Services -> Help Desk Services Home -> IT Solutions -> Virtualization & Data Center Solutions -> Virtualization Home -> IT Solutions -> Virtualization & Data Center Solutions -> Data Center Solutions This structure lines up with our business and makes logical sense but I am not sure how to handle the department and service group pages. Right now you can click them and it just brings you to a page with a small snippet for the links below. The real content is on the content pages. What I am worried about is that the snippets on those pages are just a paragraph or two of the content that's on the content page. Will this hurt me and get considered duplicate content? What is the best practice for dealing with this? Those department/service group pages have some good content on them but it's just parts of other pages. Am I okay doing this because there are not direct duplicates of other pages just parts of a few pages? Any help on this would be great. Thanks in advance.
Technical SEO | | ZiaTG0 -
Duplicate Content For Trailing Slashes?
I have several website in campaigns and I consistently get flagged for duplicate content and duplicate page titles from the domain and the domain/ versions of the sites even though they are properly redirected. How can I fix this?
Technical SEO | | RyanKelly0 -
Canonical Link for Duplicate Content
A client of ours uses some unique keyword tracking for their landing pages where they append certain metrics in a query string, and pulls that information out dynamically to learn more about their traffic (kind of like Google's UTM tracking). Non-the-less these query strings are now being indexed as separate pages in Google and Yahoo and are being flagged as duplicate content/title tags by the SEOmoz tools. For example: Base Page: www.domain.com/page.html
Technical SEO | | kchandler
Tracking: www.domain.com/page.html?keyword=keyword#source=source Now both of these are being indexed even though it is only one page. So i suggested placing an canonical link tag in the header point back to the base page to start discrediting the tracking URLs: But this means that the base pages will be pointing to themselves as well, would that be an issue? Is their a better way to solve this issue without removing the query tracking all togther? Thanks - Kyle Chandler0