Duplicate Content Mystery
-
Hi Moz community!
I have an ongoing duplicate mystery going on here and I'm hoping someone here can answer my question.
We have an Ecommerce site that has a variety of product pages and category pages. There are Rel canonicals in place, along with parameters in GWT, and there are also URL rewrites.
Here are some scenarios, maybe you can give insight as to what’s exactly going on and how to fix it.
All the duplicates look to be coming from category pages specifically.
For example:
This link re-writes:To:
http://www.incipio.com/cases/tablet-cases/amazon-kindle-cases-sleeves.html
The rel canonical tag looks like this:
http://www.incipio.com/cases/tablet-cases/amazon-kindle-cases-sleeves.html" />
The CONTENT is different, but the URLs are the same. It thinks that the product category view is the same as the all products view, even though there is a canonical in there telling it which one is the original. Some of them don’t have anything to do with each other.
Take a look:
Link identified as duplicate:
Link this is a duplicate of:
http://www.incipio.com/cases/macbook-cases/macbook-pro-13in-cases.html
Any idea as to what could be happening here?
-
Hi Ishwar,
If you have done so yet it would be best to create your own post. Many people pop in here to help others and when they see this topic as answered they may not look at it. Creating your own post will get the most attention.
-
Hi Nicole,
Okay so the reason I stated that it appears something is improperly installed is due to the fact a page should in general have 1 head tag, 1 title tag, 1 body tag and 1 document type declaration. Your page has the normal ones you'd expect to see plus another set.
In the code I posted above you have an Iframe, which is basically a tag that says display information from a different source. In this case it is Google, which is fine but it should not contain another set of head, title, and body tags along with a document declaration. Google would never do that. This along with my years of experience looking at and installing ad-ons leads me to believe that something was installed incorrectly or at the very least not coded correctly.
As to the misconfiguration issue, I would look first at how my url rewrites are being done as there is no viable reason the first link you posted should rewrite to a url and serve different content than what is suppose to be there. That tells me that the re-writes are being incorrectly handled.
I hope that helps a little,
Don
-
Hello Moz Communtiy!
i am also having error of Duplicate Tag Content Mystery like:
http://www.earnmoneywithgoogleadsense.com/tag/blog-post/
http://www.earnmoneywithgoogleadsense.com/tag/effective-blog-post/
Pages are same. I have 100+ Error on website so how can i remove this error? DO you have any tutorial based on this?
Can i change canonical url at once or i need to set it one by one
-
Hi Donford,
Thanks so much for getting back to me. Great answer! I'd like some clarification here. I did not configure this and if I'm going to talk to the developer, I'd like to have more knowledge to speak to it.
Could you please clarify what you mean when you say:
- It looks like something is installed and configured improperly.
- You have 2 head tags on the page that shows up from the redirect.
- This is actually inside the first head tag complete with a body tag and another doc declaration.
I looked at the example you sent, but I'm not sure what I'm looking at. If you could explain those bullet points in more detail, it would greatly help.
You're the best!
Thanks,
Nicole
-
It looks like something is installed and configured improperly.
You have 2 head tags on the page that shows up from the redirect.
This is actually inside the first head tag complete with a body tag and another doc declaration.
<iframe id="oauth2relay579972146" name="oauth2relay579972146" src="https://accounts.google.com/o/oauth2/postmessageRelay?parent=http%3A%2F%2Fwww.incipio.com#rpctoken=728288212&forcesecure=1" style="width: 1px; height: 1px; position: absolute; top: -100px;" tabindex="-1">
<html><head><title>title><meta content="text/html; charset=utf-8" http-equiv="content-type"><meta content="IE=edge" http-equiv="X-UA-Compatible"><meta content="width=device-width, initial-scale=1, minimum-scale=1, maximum-scale=1, user-scalable=0" name="viewport"><script src="https://apis.google.com/js/api.js" type="text/javascript" gapi_processed="true"><script src="https://oauth.googleusercontent.com/gadgets/js/core:rpc:shindig.random:shindig.sha1.js?c=2" type="text/javascript"><script src="https://ssl.gstatic.com/accounts/o/3417060037-postmessagerelay.js">head><body>html>iframe>
That looks like an installation issue.
-
Now the misconfiguration issue would have to be why the URL re-writes to page but serves up different content.
-
And lastly I think even if you fix those issues you're still going to get duplicate content warnings because you have very thin content on pages.
-
Example: Page 1 http://www.incipio.com/cases/tablet-cases/amazon-kindle-cases-sleeves/amazon-kindle-fire-hd-6-cases.html
-
Example: Page 2 http://www.incipio.com/cases/tablet-cases/amazon-kindle-cases-sleeves/amazon-kindle-fire-hd-7-cases.html
-
On those 2 pages there is a 1 character difference 6 instead of 7. All the other content (header & footer) and 1 letter difference. Than if you go to the actual product page you have the exact same issue same description to the letter except the one number. Yep, you're going to have a duplicate content problem.
-
This is something that all e-commerce stores face. You honestly need to write unique content for each and every product you sell. Don't copy & paste stuff from another site like Amazon or the manufacturers site, write your own content.
-
In summation, I would recheck any modules/ad-ons/plug-ins you installed as one appears to be incorrect. if that doesn't' fix the re-write issue have a developer that is familiar with your ecommerce platform look at this issue. Lastly, you got to have unique content.
-
Maybe not the best news but I hope it helps
-
Don
Edit in bullet points to try and make the post a look a little better. These forums don't take kindly to adding code blocks
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content question
Hey Mozzers! I received a duplicate content notice from my Cycle7 Communications campaign today. I understand the concept of duplicate content, but none of the suggested fixes quite seems to fit. I have four pages with HubSpot forms embedded in them. (Only two of these pages have showed up so far in my campaign.) Each page contains a title (Content Marketing Consultation, Copywriting Consultation, etc), plus an embedded HubSpot form. The forms are all outwardly identical, but I use a separate form for each service that I offer. I’m not sure how to respond to this crawl issue: Using a 301 redirect doesn’t seem right, because each page/form combo is independent and serves a separate purpose. Using a rel=canonical link doesn’t seem right for the same reason that a 301 redirect doesn’t seem right. Using the Google Search Console URL Parameters tool is clearly contraindicated by Google’s documentation (I don’t have enough pages on my site). Is a meta robots noindex the best way to deal with duplicate content in this case? Thanks in advance for your help. AK
Technical SEO | | AndyKubrin0 -
Internal duplicated content on articles, when is too much?
I have an automotive rental blog with articles that explain the pros of renting a specific model. So in this articles the advantages of rental versus the buying of a new model. This advantages are a list with bullets like this:
Technical SEO | | markovald
Rental | Buy new car
Rental:
Free car insurance
Free assistance
etc.
Buy new car
You have to pay insurance
You have to pay assistance
etc. etc. I want to do this because i want to make all articles like landing pages...
This "advantages box" have 100 characters. The general length of articles on my blog is 500/600 characters. So i have an average of 15/20% internal duplicated content on all my articles. Is this bad for seo? Any alternatives?0 -
Hreflang and possible duplicate content SEO issue
| 0 <a class="vote-down-off" title="This question does not show any research effort; it is unclear or not useful">down vote</a> favorite | Hey community, my first question here 🙂 Imagine there is a page with video, it has hreflang tags setup, to lead let's say German visitors to /de/ folder... So, on that German version of page, everything like menus, navigation and such are in German, but the video is the same, the title of the video (H1 tag) is the same, <title></code></strong> and <strong><code>meta description</code></strong> is the same as on the original English page. It means that general (English) page and German version of it has the same key content in English.</p> <p>To me it seems to be a SEO duplicate content issue. As I know, Google doesn't think that content is duplicate, if it is properly translated to other language.</p> <p>Does my explained case mean that the content will be detected by Google as duplicate?</p> </div> </div> </td> </tr> </tbody> </table></title> |
Technical SEO | | poiseo0 -
Removed .html - Now Get Duplicate Content
Hi there, I run a wordpress website and have removed the .html from my links. Moz has done a crawl and now a bunch of duplicated are coming up. Is there anything I need to do in perhaps my htaccess to help it along? Google appears to still be indexing the .html versions of my links
Technical SEO | | MrPenguin0 -
Sites for English speaking countries: Duplicate Content - What to do?
HI, We are planning to launch sites specific to target market (geographic location) but the products and services are similar in all those markets as we sell software.So here's the scenario: Our target markets are all English speaking countries i.e. Britain, USA and India We don't have the option of using ccTLD like .co.uk, co.in etc. How should we handle the content? Because product, its features, industries it caters to and our services are common irrespective of market. Whether we go with sub-directory or sub-domain, the content will be in English. So how should we craft the content? Is writing the unique content for the same product thrice the only option? Regards
Technical SEO | | IM_Learner0 -
Duplicate Content
SEOmoz is reporting duplicate content for 2000 of my pages. For example, these are reported as duplicate content: http://curatorseye.com/Name=“Holster-Atlas”---Used-by-British-Officers-in-the-Revolution&Item=4158
Technical SEO | | jplill
http://curatorseye.com/Name=âHolster-Atlasâ---Used-by-British-Officers-in-the-Revolution&Item=4158 The actual link on the site is http://www.curatorseye.com/Name=“Holster-Atlas”---Used-by-British-Officers-in-the-Revolution&Item=4158 Any insight on how to fix this? I'm not sure where the second version of the URL is coming from. Thanks,
Janet0 -
Duplicate Content from Google URL Builder
Hello to the SEOmoz community! I am new to SEOmoz, SEO implementation, and the community and recently set up a campaign on one of the sites I managed. I was surprised at the amount of duplicate content that showed up as errors and when I took a look in deeper, the majority of errors were caused by pages on the root domain I put through Google Analytics URL Builder. After this, I went into webmaster tools and changed the parameter handling to ignore all of the tags the URL Builder adds to the end of the domain. SEOmoz recently recrawled my site and the errors being caused by the URL Builder are still being shown as duplicates. Any suggestions on what to do?
Technical SEO | | joshuaopinion0 -
Mapping Internal Links (Which are causing duplicate content)
I'm working on a site that is throwing off a -lot- of duplicate content for its size. A lot of it appears to be coming from bad links within the site itself, which were caused when it was ported over from static HTML to Expression Engine (by someone else). I'm finding EE an incredibly frustrating platform to work with, as it appears to be directing 404's on sub-pages to the page directly above that subpage, without actually providing a 404 response. It's very weird. Does anyone have any recommendations on software to clearly map out a site's internal link structure so that I can find what bad links are pointing to the wrong pages?
Technical SEO | | BedeFahey0