Is reported duplication on the pages or their canonical pages?
-
There are several sections getting flagged for duplication on one of our sites:
http://mysite.com/section-1/?something=X&confirmed=true
http://mysite.com/section-2/?something=X&confirmed=true
http://mysite.com/section-3/?something=X&confirmed=trueEach of the above are showing as having duplicates of the other sections. Indeed, these pages are exactly the same (it's just an SMS confirmation page you enter your code in), however, they all have canonical links back to the section (without the query string), i.e. section-1, section-2 and section-3 respectively.
These three sections have unique content and aren't flagged up for duplications themselves, so my questions are:
Are the pages with the query strings the duplicates, and if so why are the canonical links being ignored?
or
Are the canonical pages without the query strings the duplicates, and if so why don't they appear as URLs in their own right in the duplicate content report?
I am guessing it's the former, but I can't figure out why it would ignore the canonical links. Any ideas?
Thanks
-
This is good news sugar-coating bad news
Thanks!
-
Hi,
The URLs that are reported by the crawl as being duplicates are the duplicate pages. Unfortunately the way the crawl from SEOMoz works, it does not factor the rel=canonical tag when reporting duplicates. In other words, even with the tag implemented, it will still report these pages as duplicates. Don't worry though, as long as the tag is implemented, the search engines should treat the canonical like a 301 redirect and not penalise you for duplicate content.
So to answer your question:
Are the pages with the query strings the duplicates? - Yes.
Hope that helps,
Adam
-
Hey,
It's kind of tricky to answer this without seeing at least two of the category pages but I am guessing that the duplication is in the category pages themselves and if they are simply very thin pages with little to differentiate category A from category B then there is your problem.
Rather than look at the web tool, if you export the spreadsheet this is a lot easier to understand and for each page there is a duplication column which has a comma separated list of the pages that are being flagged as possible duplicates so this should answer your question.
What to do though?
I may be telling you how to suck eggs but this is always a good read when it comes to thin content problems and solutions:
http://www.seomoz.org/blog/fat-pandas-and-thin-contentIf it was me, and these pages are thin, but that is the way they are supposed to be, and they are not really search landing pages then there is a good argument to noindex them and remove the possibility of them causing you any problems. If you do this, next time the campaign tool crawls your site they will be ignored and will not show up as a possible duplicate.
Obviously, from a Panda perspective, if these pages are listed as thin, they could be damaging other pages on the site so it is certainly an issue worth addressing.
Hope this helps!
Marcus
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What is the best meta description for Category Pages, Tag Pages and Main Article?
Hi, I want to index all my categories and tags. But I fear about duplicating the meta description. for example: I have a tag name "Learn Stock Market", a category name "Learning", and a main article "What is Stock Market". What is your suggestion for meta description of these three pages that looks great for seo google?
On-Page Optimization | | mbmozmb0 -
Canonicals
I dynamically generated pages using rewrite functions in wordpress (new-york, san-diego, san-francisco). All these pages look the same but with different content. yoast (seo wordpress plugin) was unaware of this and set canonicals up relating to the wordpress page used as the template page for the dynamic pages (City_home_page). so all these pages had the canonical https://dinnerdancecruises.com/City_Home_Page. using search console, i saw google indexed my site, looked at all these dynamically created pages (which is about 30 pages) and took them all in as duplicate pages. this happen sometime in april or may. I fixed this problem and set unique canonicals up for each dynamically created page. but now google is not crawling them for some reason. im not sure why its been months and these pages are not indexed. i thought to myself is it because these links end up on multiple pages? sort of like having "terms of agreement" link at the footer. every single page has that terms of agreement link. does google look at those links as duplicates and not index the page at all. this is where my issue lies. i need google to crawl regularly and see those pages with their new, unique canonicals and re-index them correctly. but it seems to save cpu resources, google feels once a thief always a thief. i could be wrong but this is why i need your suggestion. thank you.
On-Page Optimization | | bobperez7360950 -
Can you 301 redirect to a page that has other pages 301 to it?
Two years ago updated url page to include better keywords and used a 301 redirect from the old page to the new. so www.example.com/keyword-1st-generation.html now points to ... www.example.com/keyword-2nd-generation.html That moved the pages up in ranking, but now have better kw for the url, so is it okay to redirect the /keyword-2nd-geration-html to www.example.com/keyword-3rd-generation.html And what is a good length of time before removing the 1st-generation url? It's been 3 years and there is no chance of using it again. Plus, no sign of it in analytics.
On-Page Optimization | | AllIsWell0 -
E commerce Website canonical and duplicate content isssue
i have a ecomerce site , i am just wondering if any one could help me answer this the more info page can be access will google consider it as duplicate and if it does then how to best use the canonical tag http://domain.com/product-page http://domain.com/product-page/ http://domain.com/product-Page http://domain.com/product-Page/ also in zencart when link product it create duplicate page content how to tackle it? many thanks
On-Page Optimization | | conversiontactics0 -
Canonical Notice
I am curious why I receive this canonical notice even though there is a canonical for this homepage. Nq3fD.jpg
On-Page Optimization | | paumer800 -
Home Page
We are re-design our home page, one are of the current home page has a drop down window called "popular products" . We wrote short articles for our keywords and have them linked to product page. In the past, it has helped us rank. However, with new Google rules, our feeling is that such practice is no good. So, we lean towards to remove it. Still, we'd like to hear some opinions and ask some questions too: www.butterflycraze.com is it clear to you that this is not good in Google's eyes? how do I determine if these links serving any SEO purpose now after Panda? depend on the answer to 2), what should we do about these pages? shall be re-direct or shall we remove them from Google index?
On-Page Optimization | | ypl0 -
Duplicated Page Content
I have encountered this weird problem about duplicate page content. My site got 3 duplicate content similar on the link structure below. If I'm going to use rel canonical does it help to resolve the duplication problem? Thanks http://www.sample.com http://www.sample.com/ http://www.sample.com/index.php
On-Page Optimization | | mattvectorbpo0 -
Duplicate page content errors
Site just crawled and report shows many duplicate pages but doesn't tell me which ones are dups of each other. For you experienced duplicate page experts, do you have a subscription with copyscape and pay $.05 per test? What is the best way to clear these? Thanks in advance
On-Page Optimization | | joemas990