Duplication, pagination and the canonical
-
Hi all, and thank you in advance for your assistance.
We have an issue of paginated pages being seen as duplicates by pro.moz crawlers.
The paginated pages do have duplicated by content, but are not duplicates of each other. Rather they pull through a summary of the product descriptions from other landing pages on the site.
I was planing to use rel=canonical to deal with them, however I am concerned as the paginated pages are not identical to each other, but do feature their own set of duplicate content!
We have a similar issue with pages that are not paginated but feature tabs that alter the URL parameters like so:
?st=BlueWidgets
?st=RedSocks
?st=Offers
These are being seen as duplicates of the main URL, and again all feature duplicate content pulled from elsewhere in the site, but are not duplicates of each other. Would a canonical tag be suitable here?
Many Thanks
-
The rel next prev is not for duplicated content - it just shows google how the parts relate to the whole.
An alternative to the rel next prev is the "Classic Pagination for SEO" that uses noindex another article by Adam
http://searchengineland.com/the-latest-greatest-on-seo-pagination-114284
If you have a duplicate issue, this would solve it as you would noindex all the duplicate pages.
What you need to do (and I can't do this for you), is to look at all the crawl paths that you are providing Google. Â As I mention above, you are not doing any favors to Google or to your site when you show Google an infinite number of paths to get to the same content. Â It just wastes Google's time and you don't want to do that when Google also has to crawl the rest of the internet. Â If you solve this issue, you will solve your duplicate issue.
AJ Kohn just posted an article on the concept of crawl budget that talks about this.  I think the article is quite good and it explains why we need to look at all the topics of noindex, nofollow, robots, canonical and rel next prev  http://www.blindfiveyearold.com/crawl-optimization
-
Thanks CleverPhD,
That's a very interesting read by Adam Audette too, thanks.
I should say that there's no internal search, each tab has a series of duplicated 'blurbs' taken from the product's unique landing page, while the body copy remains the same across the slight variations in the URL. So with:
example.com/example/?st=BlueWidgets
example.com/example/?st=RedSocks
all of these will feature the same body copy, while the last two will have a series of small descriptions from other landing pages in the site. Would the canonical tag be appropriate in this case? We only need to index 'example.com/example'.
Also, does the rel next prev take into account duplicate content? We want only the main URL indexed as all the paginated pages feature duplicate content, there is no view all page however.
Many thanks
-
If I am understanding the question - I think pulling in some body copy from each search result (and not just the whole page) would be fine. Â I think Google will see that this is a search result and that you are pointing to other pages. Â You are probably going to pull in text from the title too. Â This is common practice in search results - heck Google does it!
If you are still concerned about the pulled in descriptions, your option is to setup the system to have an alternate description for each page. Â Use the alternate description when you pull it into your main page. Â It is more work, but it will eliminate this issue.
Separately, paginated pages no longer need to be canonicaled to the index page. Â You can use rel next and prev.
http://googlewebmastercentral.blogspot.com/2011/09/pagination-with-relnext-and-relprev.html
https://support.google.com/webmasters/answer/1663744?hl=en
It explains to Google the relationship between P1 and P2,3,4,5,n etc.
Beyond that, you need to watch that you do not get into too many paginated pages to get to the exact same product pages. Â Lets say you had 1,000 widgets that were blue, red and green and also were Free, Expensive or Cheap. Â You would have several sets of paginated pages (one set for Blue, one for Red, Green, Free, Cheap, Expensive, one for Red and Expensive) etc. Â It gets to be a little crazy as they all lead to the same set of widget product pages. Â You need to manage how to have Google crawl all that and not have your Paginated Category pages look like duplicated. Adam Audette writes great stuff on this. Â Look here for things to consider
http://www.rimmkaufman.com/blog/site-search-dynamic-content-and-seo/01032013/
-
Thank you Robert, and for the helpful link.
You did read my question correctly, however I failed to ask it ask entirely correctly. Just to complicate matters, I neglected to mention that there is body copy on each page, which technically will be duplicated.
It sits above the tabs and does not change, while the tabbed pages - under new URL parameters - pull in a sentence or two of product description from elsewhere (a unique landing page).
So,
?st=BlueWidgets
?st=RedSocks
?st=Offers
will all feature the same body copy and different duplicate content. For obvious reasons, we only want the SE to index the main URL.
Any ideas?
Thanks again
-
Hi
It doesn't sound like rel=canonical is the solution, as each one of your pages might feature multiple pieces of content from various other parts of your website (if I've read your question correctly) - so which would be the canonical version of the page?
You could use Parameter Handling in Webmaster Tools to ensure Google knows what to do with your various parameters. Moz doesn't matter here, as long as Search Engines are aware of how to handle your pages correctly.
There's a good overview here.
I hope that's helpful
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate title while setting canonical tag.
Hi Moz Fan, My websites - https://finance.rabbit.co.th/ has run financial service, So our main keywords is about "Insurance" in Thai, But today I have an issues regarding to carnonical tag. We have a link that containing by https://finance.rabbit.co.th/car-insurance?showForm=1&brand_id=9&model_id=18&car_submodel_id=30&ci_source_id=rabbit.co.th&car_year=2014 and setting canonical to this url - https://finance.rabbit.co.th/car-insurance within 5,000 items. But in this case I have an warning by site audit tools as Duplicate Page Title (Canonical), So is that possible to drop our ranking. What should we do, setting No-Index, No-Follow for all URL that begin with ? or keep them like that.
Technical SEO | | ASKHANUMANTHAILAND0 -
Duplicate Pages , Do they matter ?
I have been told buy the company who created my site that duplicate page warning are not a problem ? my site is small and only has 50 pages ( including product pages etc ) yet the crawl shows over 6500 duplicate pages am I right to be concerned?
Technical SEO | | Gardening4you0 -
Rel Canonical for Miva Merchant
Due to necessary pagination on the site that sells thousands of products, and due to products being assigned to more than one category in the Miva Merchant store, we have been battling duplicate content, and Meta tag issues. I asked lot of questions on the Miva forum on how to use rel canonical in Miva, and got this script below to use. It was supposed to solve all of our problems, but now it seems that every page of the site is under Rel Canonical Notices in the Crawl Diagnostics. I am not sure I am reading the Notices correctly, and if we achieved what we want or not. Here is an example of one listing: URL: http://www.domain.com/ABUS.html
Technical SEO | | 2CDevGroup
Tag Value: http://www.domain.com/
Page Authority: 28
Linking Root Domains: 1 | Â | Â | Â | Â |0 -
Problem with Rel Canonical
Background:Â We check to make sure that IF you use canonical URL tags, it points to the right page. If the canonical tag points to a different URL, engines will not count this page as the reference resource and thus, it won't have an opportunity to rank. If you've not made this page the rel=canonical target, change the reference to this URL. NOTE: For pages not employing canonical URL tags, this factor does not apply. Clearly I am doing something wrong here, how do I check my various pages to see where the problem lies and how do I go about fixing it?
Technical SEO | | SallySerfas0 -
Canonical usage and duplicate content
Hi We have a lot of pages about areas like ie. "Mallorca" (domain.com/Spain/Mallorca), with tabbed pages like "excursion" (domain.com/spain/Mallorca/excursions) and "car rental" (domain.com/Spain/Mallorca/car-rental) etc. The text on ie the "car rental"-page is very similar on Mallorca and Rhodos, and seomoz marks these as duplicate content. This happens on "car rental", "map", "weather" etc. which not have a lot of text but images and google maps inserted. Could i use rel=nex/prev/canonical to gather the information from the tabbed pages? That could show google that the Rhodos-map page is related to Rhodos and not Mallorca. Is that all wrong or/and is there a better way to do this? Thanks, Alsvik
Technical SEO | | alsvik0 -
Duplicate Page Title with Pretashop
We have our main website and blog in Wordpress under www.enasport.com and our shop with Prestashop under www.enasport.com/productos so all our products have for example www.enasport.com/productos/56-creatina-monohidrato.html I wonder if this is the problem with Duplicate Page Title as seems we have more than 200 of this issue. Is there any way to solve this?
Technical SEO | | ENASports0 -
Canonical Issues
Hi Guys, I have a technical question. Ive started optimising an ecommerce site for a client and come across some duplicate content issues:- This page: http://www.bracknelllamps.com/projector-manufacturer/SANYO/70 is actually indexed in Google as:- http://www.bracknelllamps.com/projector-lamps.php?make=SANYO Both pages have the same content and I'm guessing the indexed page refers to an old way of navigating the site. As I'm concerned about duplicate content issues, what's the best approach as this seems to be the case for all 'projector manufacturer' pages. would it be to 301 redirect each manufacturer url (this could take forever with 107) manufacturers or rel="canonical" tag? to show Google which page I want indexing? Kind Regards Neil
Technical SEO | | nezona0 -
Duplicate Homepage In Google
Hi Just found through my SEO dashboard, Google has two versions of the same homepage, the root page, plus the index.html page, causing duplicate content from both the pages. what is the best option to ensure google only have 1 version of the homepage listed?
Technical SEO | | rfksolutionsltd0