Duplicate Content
-
I'm currently working on a site that sells appliances. Currently, there are thousands of "issues" with this site, many of them dealing with duplicate content. Now, the product pages can be viewed in "List" or "Grid" format. As Lists, they have very little in the way of content.
My understanding is that the duplicate content arises from different URLs going to the same site. For instance, the site might have a different URL when told to display 9 items than when told to display 15. This could then be solved by inserting rel = canonical.
Is there a way to take a site and get a list of all possible duplicates? This would be much easier than slogging through every iteration of the options and copying down the URLs. Also, is there anything I might be missing in terms of why there is duplicate content? Thank you.
-
Thank you.
-
Essentially, you need to figure out the primary causes of duplicate content and then pick a way to handle it. A great spot to find your duplicate content is in Google Webmaster Tools under the HTML Improvements section. Look at the section titled "Duplicate Title Tags" and this will show you a spot where you very well may have duplicate content.
The primary ways to take care of it will be:
- NoIndexing
- Canonicalizing
- Parameter Handling in Google Webmaster Tools
Choosing which technique you use will likely be a result of what you are technically able to implement, based on each unique challenge from the different causes of duplicate content. You likely won't be able to kill all of the duplicate content at once. I suggest handling it in chunks. For example, first tackle the Items Shown problem you reference in your question. As you mentioned, you could canonicalize it. Basically, whenever the URL reflects your Item Parameter, you could canonicalize it back to the representative URL.
ie: yoursite.com/category-results&items=15 --> would canonicalize to yoursite.com/category-results
Once you have the Number of Item pages out of the index, focus on the next biggest cause of duplicate content.
-
Have you created a Moz campaign for the site? As Mozbot crawls your site and tells you about all the duplicate content issues that you may have.
To solve that, instead of checking of changing code all over the place, make the changes on those pages that you already know have duplicate content issues (like in the example you gave) and then let Mozbot re-crawl the site so you can see which pages still have issues to solve them.
The rel canonical should point to the one page that has the most info (as you said list has less, grid will be better for the canonical).
If your site uses several categories and subcategories, you should also have a look at the noindex tag, as sometimes that creates duplicate content issues too (subcategory products listed in the root category). The same applies to any kind of listings, such as search results (which should be noindexed).
Hope this helps!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content with tagging and categories
Hello, Moz is showing that a site has duplicate content - which appears to be because of tags and categories. It is a relatively new site, with only a few blog publications so far. This means that the same articles are displayed under a number of different tags and categories... Is this something I should worry about, or just wait until I have more content? The 'tag' and 'category' pages are not really pages I would expect or aim for anyone to find in google results anyway. Would be glad to here any advice / opinions on this Thanks!
On-Page Optimization | | wearehappymedia1 -
Content Mismatch
Hi, I've added my app to search console, and there are reported 480 content mismatch pages. How can I solve this problem?
On-Page Optimization | | Silviu0 -
Content in Tabs
I speed read an article recently and forgot to save it regarding Contents on a page in tabs. Is it correct that now Google is rendering the entire page it's better not to have content in tabs hidden by Javascript? As it stands at the moment, we've got the tabs set-up so that the main part of the page containing the keyword rich text is in a tab and not the first thing presented to the user
On-Page Optimization | | Ham19790 -
Duplicate Content - But it isn't!
Hi All, I have a site that releases alerts for particular problem/events/happenings. Due to legal stuff we keep the majority of the content the same on each of these event pages. The URLs are all different but it keeps coming back as duplicate content. The canonical tag is not right (i dont think for this) egs http://www.holidaytravelwatch.com/alerts/call-to-arms/egypt/coral-sea-waterworld-resort-sharm-el-sheikh-egypt-holiday-complaints-july-2014 http://www.holidaytravelwatch.com/alerts/call-to-arms/egypt/hotel-concorde-el-salam-sharm-el-sheikh-egypt-holiday-complaints-may-2014
On-Page Optimization | | Astute-Media0 -
Static content VS Dynamic changing content what is best
We have collected a lot of reviews and we want to use them on our Categories pages. We are going to be updating the top 6 reviews per categories every 4 days. There will be another page to see all of the reviews. Is there any advantage to have the reviews static for 1 or 2 weeks vs. having unique new ones pulled from the data base every time the page is refreshed? We know there is an advantage if we keep them on the page forever with long tail; however, we have created a new page with all of the reviews they can go to.
On-Page Optimization | | DoRM0 -
How dangerous are duplicate page titles
We ran a SEO crawl and on our report it flag up duplicate pages titles, we investigate further and found that these were page titles from the same product line that had more than one page, e.g 1-50 (products) 51-100 (products) with a next button to move to the following 50 products. These where flagged as duplicate page titles ".../range-1/page-1" and ".../range-1/page-2" These titles are obviously being read as duplicates but because they are the same range we do not know what the best course of action is. We want to know how detrimental these page titles will be to our SEO if at all. If anyone could shed some light on this issue it would be a massive help. Thanks
On-Page Optimization | | SimonDixon0 -
No Content on home page + rankings
If a home page has no content will it hurt the sites ability to rank? The interior pages will have content but not the home page. (See attached image) My client does not want content on the home page as he feels it will take away from the look and feel he wants to achieve. This website is actually 10 sites or locations in one as we intend to market each location (a total of 10) separately. In reality the home page is a doorway page to each separate location. I'd like feedback if possible as to the necessity or not, of content on the Home Page of this or any website. Will the lack of content hurt on the Homer Page hurt with SEO? Thanks
On-Page Optimization | | fun52dig
Gary Downey bobby-vans.jpg0 -
Duplicate Page Content and Duplicate Page Title
Hi All, I'm new in SEOMoz and have some questions after I have already spend 2-3 days trying to resolve the problems identified from Crawling one of my clients websites. I get quite a lot of Duplicate Page Conntent and Page Titles warnings and trying to find a workaround through the forums and posts. I continuously get this error on most of my pages: URL: http://domain.com/benefits with the same Page but with a WWW in front URL: http://www.domain.com/benefits Any advice will be highly appreciated. Thanks, Athos
On-Page Optimization | | athosk0