Is reported duplication on the pages or their canonical pages?
-
There are several sections getting flagged for duplication on one of our sites:
http://mysite.com/section-1/?something=X&confirmed=true
http://mysite.com/section-2/?something=X&confirmed=true
http://mysite.com/section-3/?something=X&confirmed=trueEach of the above are showing as having duplicates of the other sections. Indeed, these pages are exactly the same (it's just an SMS confirmation page you enter your code in), however, they all have canonical links back to the section (without the query string), i.e. section-1, section-2 and section-3 respectively.
These three sections have unique content and aren't flagged up for duplications themselves, so my questions are:
Are the pages with the query strings the duplicates, and if so why are the canonical links being ignored?
or
Are the canonical pages without the query strings the duplicates, and if so why don't they appear as URLs in their own right in the duplicate content report?
I am guessing it's the former, but I can't figure out why it would ignore the canonical links. Any ideas?
Thanks
-
This is good news sugar-coating bad news Thanks!
-
Hi,
The URLs that are reported by the crawl as being duplicates are the duplicate pages. Unfortunately the way the crawl from SEOMoz works, it does not factor the rel=canonical tag when reporting duplicates. In other words, even with the tag implemented, it will still report these pages as duplicates. Don't worry though, as long as the tag is implemented, the search engines should treat the canonical like a 301 redirect and not penalise you for duplicate content.
So to answer your question:
Are the pages with the query strings the duplicates? - Yes.
Hope that helps,
Adam
-
Hey,
It's kind of tricky to answer this without seeing at least two of the category pages but I am guessing that the duplication is in the category pages themselves and if they are simply very thin pages with little to differentiate category A from category B then there is your problem.
Rather than look at the web tool, if you export the spreadsheet this is a lot easier to understand and for each page there is a duplication column which has a comma separated list of the pages that are being flagged as possible duplicates so this should answer your question.
What to do though?
I may be telling you how to suck eggs but this is always a good read when it comes to thin content problems and solutions:
http://www.seomoz.org/blog/fat-pandas-and-thin-contentIf it was me, and these pages are thin, but that is the way they are supposed to be, and they are not really search landing pages then there is a good argument to noindex them and remove the possibility of them causing you any problems. If you do this, next time the campaign tool crawls your site they will be ignored and will not show up as a possible duplicate.
Obviously, from a Panda perspective, if these pages are listed as thin, they could be damaging other pages on the site so it is certainly an issue worth addressing.
Hope this helps!
Marcus
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why are http and https pages showing different domain/page authorities?
My website www.aquatell.com was recently moved to the Shopify platform. We chose to use the http domain, because we didn't want to change too much, too quickly by moving to https. Only our shopping cart is using https protocol. We noticed however, that https versions of our non-cart pages were being indexed, so we created canonical tags to point the https version of a page to the http version. What's got me puzzled though, is when I use open site explorer to look at domain/page authority values, I get different scores for the http vs. https version. And the https version is always better. Example: http://www.aquatell.com DA = 21 and https://www.aquatell.com DA = 27. Can somebody please help me make sense of this? Thanks,
On-Page Optimization | | Aquatell1 -
Duplicate home page URL on crawl test
Hi i just recently made a crawl test but before doing that i made sure that i have no more duplicates on my site i am using joomla and as of now i only have 11 links on my site but when my crawl test is done i saw duplicate url of my homepage the duplicate url has a trailing backslash so basically i have all the 11 links + 1 duplicate URL http://mangthomas.com http://mangthomas.com/ can you guys give advise how i can remove the duplicate i dont even know which one to retain. THANKS A LOT, cris
On-Page Optimization | | crisbasma0 -
Using Canonical Tags on Every Page
I'm doing competitive research and noticed that one of our competitors (who outranks us) uses canonical tags on every page on their site. The canonical tags reference the page they are on. For example. www.competitor.com/product has a canonical tag of www.competitor.com/product. Does anyone use this practice? It seems strange to me. Thank you, Kristen
On-Page Optimization | | Ksink0 -
Duplicate Content
I'm currently working on a site that sells appliances. Currently, there are thousands of "issues" with this site, many of them dealing with duplicate content. Now, the product pages can be viewed in "List" or "Grid" format. As Lists, they have very little in the way of content. My understanding is that the duplicate content arises from different URLs going to the same site. For instance, the site might have a different URL when told to display 9 items than when told to display 15. This could then be solved by inserting rel = canonical. Is there a way to take a site and get a list of all possible duplicates? This would be much easier than slogging through every iteration of the options and copying down the URLs. Also, is there anything I might be missing in terms of why there is duplicate content? Thank you.
On-Page Optimization | | David_Moceri0 -
Optimizing web page
Hi I have implemented most of the suggestions, which are offered threw SEOMOZ on our web site www.zaposlitev.net. But rankings are not improving. Could anyone help please? Thanks Tomaz
On-Page Optimization | | tomaz770 -
On Page Report Card F Grade Critical Factors
The website and page in question is http://www.upstrap-pro.com/ I sell non-slip camera straps and FYI for key word(s) camera strap(s) we were for a number of years on page 1 or 2. Google sold our registered trade-name _UP_strap® all over the web including Amazon. And of course we were hijacked for the keyword. Be that as it may According to SEOMOZ there are many errors on our homepage. I am having the host look at a number of SEOMOZ's report findings. Two critical findings that are making me nuts because I do not have the tech chops to understand why are: 1) Accessible to Engines <dl> <dt>Explanation</dt> <dd>Pages that can't be crawled or indexed have no opportunity to rank in the results. Before tweaking keyword targeting or leveraging other optimization techniques, it's essential to make sure this page is accessible.</dd> </dl> 2) Appropriate Use of Rel Canonical <dl> <dt>Explanation</dt> <dd>If the canonical tag is pointing to a different URL, engines will not count this page as the reference resource and thus, it won't have an opportunity to rank. Make sure you're targeting the right page (if this isn't it, you can reset the target above) and then change the canonical tag to reference that URL</dd> <dd>So here is the code:</dd> <dd>```
On-Page Optimization | | Asteg
<html xmlns="<a class="attribute-value">http://www.w3.org/1999/xhtml</a>"> <head> <title>DSLR-Camera-Straps Award Winning Non~Slip Shoulder Strapstitle> <meta name="<a class="attribute-value">description</a>" content="<a class="attribute-value">An Amazing Camera Strap that will NOT slip off your shoulder! Neck straps are bad for your neck & camera slings are bulky. Easy 60 day money back return policy.</a>" /> <meta http-equiv="<a class="attribute-value">Content-type</a>" content="<a class="attribute-value">text/html;charset=UTF-8</a>" /> <base href="http://www.upstrap-pro.com/Merchant2/" /> <link type="<a class="attribute-value">text/css</a>" rel="<a class="attribute-value">stylesheet</a>" href="css/00000002/cssui.css" media="" /> <link rel="<a class="attribute-value">canonical</a>" href="http://www.upstrap-pro.com/" /> </dl>0 -
Why does my on-page report card say my page title is 403 forbidden when its not?
I'm trying to get on top of my on page stuff and I'm going through the SEO Moz on-page report cards and it says I'm scoring a fail on certain elements within the 'critical' and 'high importance' factors as my page title is '403 forbidden' but when I go on to my site, my sites CMS it's not '403 forbidden' it's the text I entered?
On-Page Optimization | | jamesj35mm0 -
Why home page ranks higher than keyword-optimized page
We have a page that is optimized for the keyword "job scheduling". A search on the keyword "job scheduling" results in this page not ranking at all, while our home page (uc4.com) ranks third. Could you provide some ideas/suggestions as to why this would be the case and how to make our job scheduling page rank higher? Thanks, claudia
On-Page Optimization | | claudmar0