Duplicate Content
-
I'm currently working on a site that sells appliances. Currently, there are thousands of "issues" with this site, many of them dealing with duplicate content. Now, the product pages can be viewed in "List" or "Grid" format. As Lists, they have very little in the way of content.
My understanding is that the duplicate content arises from different URLs going to the same site. For instance, the site might have a different URL when told to display 9 items than when told to display 15. This could then be solved by inserting rel = canonical.
Is there a way to take a site and get a list of all possible duplicates? This would be much easier than slogging through every iteration of the options and copying down the URLs. Also, is there anything I might be missing in terms of why there is duplicate content? Thank you.
-
Thank you.
-
Essentially, you need to figure out the primary causes of duplicate content and then pick a way to handle it. A great spot to find your duplicate content is in Google Webmaster Tools under the HTML Improvements section. Look at the section titled "Duplicate Title Tags" and this will show you a spot where you very well may have duplicate content.
The primary ways to take care of it will be:
- NoIndexing
- Canonicalizing
- Parameter Handling in Google Webmaster Tools
Choosing which technique you use will likely be a result of what you are technically able to implement, based on each unique challenge from the different causes of duplicate content. You likely won't be able to kill all of the duplicate content at once. I suggest handling it in chunks. For example, first tackle the Items Shown problem you reference in your question. As you mentioned, you could canonicalize it. Basically, whenever the URL reflects your Item Parameter, you could canonicalize it back to the representative URL.
ie: yoursite.com/category-results&items=15 --> would canonicalize to yoursite.com/category-results
Once you have the Number of Item pages out of the index, focus on the next biggest cause of duplicate content.
-
Have you created a Moz campaign for the site? As Mozbot crawls your site and tells you about all the duplicate content issues that you may have.
To solve that, instead of checking of changing code all over the place, make the changes on those pages that you already know have duplicate content issues (like in the example you gave) and then let Mozbot re-crawl the site so you can see which pages still have issues to solve them.
The rel canonical should point to the one page that has the most info (as you said list has less, grid will be better for the canonical).
If your site uses several categories and subcategories, you should also have a look at the noindex tag, as sometimes that creates duplicate content issues too (subcategory products listed in the root category). The same applies to any kind of listings, such as search results (which should be noindexed).
Hope this helps!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What to do with blog content that is no longer relevant to our business
We are a marketing agency and we have LOTS of blog posts still on our website from when we used to specialize in e-commerce services. We've since shifted There are a lot of old and frankly irrelevant blog posts on our website. My questions: Should we remove these from the website so better "shape" our content profile towards the services we actually offer? Should we attempt to update them so they are still relevant even though we don't offer those services? If we get rid of those pages, what should we redirect them to? The main blog page?
On-Page Optimization | | WhittingtonConsulting0 -
Duplicate content "/"
Hi all, Ran my website through the SEOMOZ campaigns and the crawl diagnostics give me a duplicate error for these urls http://www.mysite.com/cat1/article http://www.mysite.com/cat1/article/ so the url with the "/" is a duplicate of the one without the "/" Can someone point me out to a solution to solve this ? regards, Frederik
On-Page Optimization | | frdrik1230 -
Duplicate meta descriptions
Hi all, I'm using Yoast's SEO plugin and when I run a On Page report card here on SEOMOZ it says there are 2 descriptions tags I've been trying to fix this but can't (I'm new!) Anyone any ideas on this? Thanks Elaine
On-Page Optimization | | elaineryan0 -
Duplicate content and the Moz bot
Hi Does our little friend at SEOmoz follow the same rules as the search engine bots when he crawls my site? He has sent thousands of errors back to me with duplicate content issues, but I thought I had removed these with nofollow etc. Can you advise please.
On-Page Optimization | | JamieHibbert0 -
Using content for cliche' terms, or content found on other sites
howdy, I have a basic question about using content found on other websites for your own use. I have started a pick up lines website for guys to search for pickup lines to use on girls. Anyways, my website has many, if anything a lot, of the same exact pick up lines as all my competitors are using. If I use the same pick up lines found on their site could i be penalized for this as far as SEO? thanks and hope to hear back
On-Page Optimization | | david3050 -
Content Tabs and Keyword Stuffing
I am in the process of drawing up content templates to guide my company's marketing team in creating SEO optimized content as we move over our retail website to a new platform. On each product page, we will have multiple tabs that are crawl-able, each one containing different chunks of information on the products. Within each tab, I was thinking of breaking up the content and adding SEO value by using headers (h2 or h3) that have a keyword included. So, for example: "How The PRODUCT NAME Works" and "User Manuals for your PRODUCT NAME." Between the multiple tabs, in headers alone, the main keyword for the product (which will usually be the product name) will be on the page 7 times. Between this and the keywords that are part of the actual content (ex: product description), is this too many keyword instances? I know headers are often skimmed or skipped when used to simply break up the content, so I don't think they will impact user experience too much. However, I would love some feedback on if you agree with that and if you think I should cut down on the number of keywords or if I am headed in the right direction. Thanks!
On-Page Optimization | | Marketing.SCG0 -
How to avoid product's lists from making your site's content duplicated?
Hi there! We at Outitude, recently launched an outdoor activities marketplace and to make it easy for users to compare activities we show a list of available activities in each activity view. The problem is that though the content is different, the first half is practically identical. Example:
On-Page Optimization | | alexmc
Sailing for a full day: http://outitude.com/en/sailing/world/sailing-full-day and sailing for half a day: http://outitude.com/en/sailing/world/sailing-half-day both URL's are different, their content is different but most of it is not (first half of the page), so that the user can compare the activity it is currently seing with others. Questions: How can we show the activities list without it ruining the page rank? Do you advise the use of "", "" surrounding the duplicated content aka activities lists? Thanks in advance.0 -
DUPLICATE PAGE TITLE ISSUE
Hi We have 25 pages with a download form on it. People arrive at the page through a ink with optimised anchor text which sits on the information pages. As there is no information on these pages we do not need them to be optimised so the developer has given all the download pages exactly the same page title. Although the pages in themselves are not significant would this effect the way Google viewed the whole site, and would it pay to make each one unique or doesn't it really matter. Alternatively, is there a better way to handle this? and if so would that ligate the benefit of the anchor text. Thanks
On-Page Optimization | | PH2920