Is reported duplication on the pages or their canonical pages?
-
There are several sections getting flagged for duplication on one of our sites:
http://mysite.com/section-1/?something=X&confirmed=true
http://mysite.com/section-2/?something=X&confirmed=true
http://mysite.com/section-3/?something=X&confirmed=trueEach of the above are showing as having duplicates of the other sections. Indeed, these pages are exactly the same (it's just an SMS confirmation page you enter your code in), however, they all have canonical links back to the section (without the query string), i.e. section-1, section-2 and section-3 respectively.
These three sections have unique content and aren't flagged up for duplications themselves, so my questions are:
Are the pages with the query strings the duplicates, and if so why are the canonical links being ignored?
or
Are the canonical pages without the query strings the duplicates, and if so why don't they appear as URLs in their own right in the duplicate content report?
I am guessing it's the former, but I can't figure out why it would ignore the canonical links. Any ideas?
Thanks
-
This is good news sugar-coating bad news Thanks!
-
Hi,
The URLs that are reported by the crawl as being duplicates are the duplicate pages. Unfortunately the way the crawl from SEOMoz works, it does not factor the rel=canonical tag when reporting duplicates. In other words, even with the tag implemented, it will still report these pages as duplicates. Don't worry though, as long as the tag is implemented, the search engines should treat the canonical like a 301 redirect and not penalise you for duplicate content.
So to answer your question:
Are the pages with the query strings the duplicates? - Yes.
Hope that helps,
Adam
-
Hey,
It's kind of tricky to answer this without seeing at least two of the category pages but I am guessing that the duplication is in the category pages themselves and if they are simply very thin pages with little to differentiate category A from category B then there is your problem.
Rather than look at the web tool, if you export the spreadsheet this is a lot easier to understand and for each page there is a duplication column which has a comma separated list of the pages that are being flagged as possible duplicates so this should answer your question.
What to do though?
I may be telling you how to suck eggs but this is always a good read when it comes to thin content problems and solutions:
http://www.seomoz.org/blog/fat-pandas-and-thin-contentIf it was me, and these pages are thin, but that is the way they are supposed to be, and they are not really search landing pages then there is a good argument to noindex them and remove the possibility of them causing you any problems. If you do this, next time the campaign tool crawls your site they will be ignored and will not show up as a possible duplicate.
Obviously, from a Panda perspective, if these pages are listed as thin, they could be damaging other pages on the site so it is certainly an issue worth addressing.
Hope this helps!
Marcus
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Should i make all of my pages with canonical tag
Hi, Im using thesis Wordpress theme, and their default option is "Add canonical <acronym title="Uniform Resource Locator">URL</acronym>s to your site" im just wandering if i should keep that box checked and apply canonical <acronym title="Uniform Resource Locator">URL</acronym>s to all of my pages? Thank You
On-Page Optimization | | Vmezoz0 -
Best practice to solve this Unique duplicate page content issue?
I just got Seomoz Pro (it's awesome!), and when I did a campaign for my website I discovered that I have a big issue with duplicate page content (as well as titles). The Crawl Diagnostics Summary told me I have 196 Crawl Errors Found (I had a total of 362 pages crawled on my site), and as much as 160 of these was duplicate page content. Which to me sounds like a big problem, correct me if I'm wrong (I'm very new to SEO). So our website is an ecommerce that sells greeting cards. The unique part about our platform is that we offer the customer to make a customization of the cards.
On-Page Optimization | | danielpett
Let me walk you through each step a customer takes so you fully understand: They find a card they like and visit the product page of that card (just like on any ecommerce store.) They then decide they want to buy it. There is no "Add to cart" button, they will instead click on a "customize the card" button. 3) This takes them to a step by step process of customizing the card. They change the name on the front of the greeting card so it says for example: "Happy Birthday Katy!". And then adds a personal text on the inside of the card. They then add an delivery address and when it should be delivered. After that they proceed to checkout and it's all done. This is my website (it's in Swedish): loveday.se - it will take you to a product page so that you can click the green button and see what I mean with the customization pages. Hopefully it helps even though it's in Swedish. My issue starts at the customization part of the site (the bolded step above), as I can see the permalinks in the diagnostics I got.
This step-by-step process looks exactly the same with every card in the store. Same call-to-action headline, same descriptive text etc. The only difference is a JPEG-file with the unique greeting card design. So, what is your take on this? Let me know if I was unclear about something. Any help or advice is greatly appreciated.0 -
How much SEO value does a fashion site get from bolting text onto the bottom of home page? Does the value compensate for cluttering up a page focused on an iconic image?
Getting ready to launch a completely redesigned site for a fashion designer. Since it is a fashion site, visitors do not need text to describe what the site is about., We are weighing three options: 1) clean design with no text (just images and navigational links), 2) bolting on a couple of sentences of text at the bottom of the page to signal keyword terms to the search engines, 3) following the lead of the top ranking site in the category and adding lots of text to the bottom of the page. Do the SEO benefits justify cluttering up the design by bolting text onto the bottom of the home page, and if so, how many characters of text seem to be the minimum to be effective?
On-Page Optimization | | RandyP0 -
The "100 links/page recommendation" - Do Duplicate Links Count?
We have way too many links on our homepage. The PageRank Link Juice Calculator (www.ecreativeim.com/pagerank-link-juice-calculator.php) counts them to 300. But all of them are not unique, that is some links point to the same URL. So my question: does the "100 links/page recommendation" refer to all anchors on the page or only to unique link target URLs? I know "100" is just a standard recommendation.
On-Page Optimization | | TalkInThePark0 -
View all Page for Product Overview Pages
Hi everybody! We have an ecommerce site with product overview pages, where sometimes there are hundreds of products listed. Usually, we just display 30 and have a button where users can click to see 30 more - or all products listed at once. This is the overview page (as indexed in google): http://www.geschenkidee.ch/aussergewoehnliches.html
On-Page Optimization | | zeepartner
And this is the view-all page: http://www.geschenkidee.ch/aussergewoehnliches.html#all What should I do here? The product overview page will hardly generate more traffic by listing all products (because the overview page will rank for generic keywords, while the product keyword searches will be referred to the specific product pages themselves). I was originally thinking of using rel=canonical pointing to the view-all page. But this would just lead to longer load time. Should we just leave those overview pages or is there a best practice for how to deal with such pages? Thanks for your thoughts on this!0 -
On page report card 410 error
I have been trying to test my site through the on page report card using our primary keyword phrase, however, I keep getting the following error message: We were unable to grade that page. The page did not load. Got a 410 response code from server If I try the same search and keyword phrase on other sites, it does work. Am I doing something wrong?
On-Page Optimization | | duesoon0 -
Add Rel Canonical to all pages on my site (Magento)
Can anyone guide me as to how to add the REL CANONICAL feature to every page on my website (Magento shopping cart) Thanks
On-Page Optimization | | lacx.com0 -
Page Authority
I have recently optimised a set of images for a client of ours: I'm looking through all the PA of these newly optimised images, and have varying PA {from SEOmoz toolbar} I understand that internal linking will pass link juice, and obviously external links will add to the overall PA. I have several pages with a PA of 36: { Fairly deep pages} Yet they have no external or internal links going to them. My question is "How can a page gain any authority when it has no visible links pointing at it?" Obviously there must be a link pointing at it {internally} as Google wouldn't have crawled the page right? Also lets say all the keywords are of equal competitiveness would the keywords with highest PA rank higher than those on O PA pages. Many Thanks
On-Page Optimization | | Yozzer0