Duplicate content penalty
-
when moz crawls my site they say I have 2x the pages that I really have & they say I am being penalized for duplicate content. I know years ago I had my old domain resolve over to my new domain. Its the only thing that makes sense as to the duplicate content but would search engines really penalize me for that? It is technically only on 1 site. My business took a significant sales hit starting early July 2013, I know google did and algorithm update that did have SEO aspects. I need to resolve the problem so I can stay in business
-
Thx Jane- No I wasn't aware of that. I don't get it because I put canonical tags right under the Head and I used the code below to do it. I will check again but am unsure how to fix it
I don't even know how to fix coding on the Http://cheaptubes.com site. It seems like when I add content to the canonical site it updates all of them. Thx for pointing out errors, you are giving me something to fix and improve.
-
Hi again,
Are you aware that you have a canonical tag on http://cheaptubes.com that points to a non-existent URL? i.e. http://i.imgur.com/yEd2377.png
http://www.cheaptubes.com/default.html
If http://cheaptubes.com/ 301 redirected to http://www.cheaptubes.com/, this would resolve the issue.
Are you aware that the www version of your site shows for a brand search (https://www.google.co.uk/search?q=cheaptubes.com&oq=cheaptubes.com&aqs=chrome..69i58j69i60l2j69i57j69i60j0.3367j0j4&sourceid=chrome&es_sm=91&ie=UTF-8) but that the canonical tags on each page point to the non-www version, e.g. http://i.imgur.com/P7Tizsv.png and http://i.imgur.com/lhTA95w.png?
The canonical tag on the www.cheaptubes.com/ page also points to http://www.cheaptubes.com/default.html. Sorry to show so many errors, but it doesn't look like canonicalisation has been implemented properly here.
-
Thx Jane - I may not have put a canonical tag on that page yet but its the same for every page. I can't access the http://cheaptubes.com but I can access the canonical version to publish. I did put canonical tags on most of my other pages such as the SWNTs page but it still shows a non canonical version when moz crawls it. Perhaps a 301 from http://cheaptubes.com to the canonical page? I'm just not sure how to handle it.
-
Hi Mike,
Are you saying that there is a canonical tag on http://cheaptubes.com/cntmaterialsafetydatasheet.htm, pointing to http://www.cheaptubes.com/cntmaterialsafetydatasheet.htm? This would solve the duplicate content problem, but I do not see a canonical tag on either of those pages...
-
Hi Everyone - I'm hoping you can help me out again. I have a functional 301 on cheaptubesinc.com. that cleared about 1/2 my dup content penalty on the moz crawl this week. As you can see in the results below, I still have 57 pages with dup content according to Moz.
57 Duplicate Page Content
13 4XX (Client Error)
57 Duplicate Page TitleI checked and I think it is mostly a canonical problem. I do have Rel Canonical tags on all my pages. When I clicked on the 1st one it appears that is the case, see below
cheaptubes.com carbon nanotubes msds
http://www.cheaptubes.com/cntmaterialsafetydatasheet.htm29414872001 duplicate
cheaptubes.com carbon nanotubes msds
http://cheaptubes.com/cntmaterialsafetydatasheet.htm25My question is, do I need another 301 from http://cheaptubes.com to the canonical version? I'ld rather not since I had to fight with network solutions for a week for them to add the / after .com so my other pages would work. Is this a penalty I should still be concernedabout given that I have the rel canonical tags? Please let me know your thoughts on thisMike
-
Thanks Everyone. I got the old url cheaptubesinc.com 301'd to cheaptubes.com this week. Of course network solutions left off the / after .com and before the page name so that only the home page would 301 and they could try to sell me more 301s, it cost $60 for 1 and I have 48 pages on my site. I called and emailed them all week and they kept saying they had done it right and and they couldn't force google to change the links. I then realized if I typed www.cheaptubesinc/graphene.htm that it didn't work because it 301'd to www.cheaptubes.comgraphene.htm. They were argumentative with me even though I was polite with them even though I didn't want to be. I finally got a tech on the phone who said he would add the slash and ask his boss for forgiveness. However given the history of having the domain parked and pointed before and that not working over time & now this, I think my best bet is to transfer my domains to someone else. I heard bluehost is good. My concern is if they were that unethical in our dealings and the boss was argumentative in emails than they could go in an remove the slash at any time.
I also found a ton of code errors right at the top of my pages. I now know it was from putting up temporary messages but not checking to make sure the code was clean. The woman I bought my them from (6.5 years ago I paid her $60 and she still helps me for free, what difference between her & NS) notice open H1's & P elements at the top of the pages. I was still ranking well for acronyms but missing out on the long keywords since last july which caused my sales to drop off. I figure I lost at least 150K in sales because I neglected my website and didn't clean up the code on my pages a painful lesson I won't soon forget. On tuesday, when I searched single walled carbon nanotubes I had to go 8 pages back in google to find my page. By week's end I was #8 on page 1 and ahead of sigma aldrich a major materials supplier.
Thank you so much for your help everyone, it is sincerely appreciated
Mike
-
Thank you Oleg - I did put the tag
into the head right below the robots & google bot code on every page. I mistakenly deleted some very old non updated pages. Thx to moz, i have a list of the pages and will contact hosting co to 301 it. I think I ultimately have to 301 each page. I had moz recrawl my site last night but it said it dropped from 100 duplicate content penalties to 89, an improvement but not the one I hoped for. I did have a client tell my the site was down today, contacted network solutions and they said it was up now but they had an outage last night. Perhaps it affected the moz recrawl,but I can't know that. I also want to change the names of the pages as an interim measure before I update the site to newer format. Should I create new optimized by name pages first and then get on the phone with tech support and 301 them all to the knew pages? seems logical but so did deleting old pages until moz couldn't find them, then i realized the bots will count it against me rather than the housekeeping that it was.
Mike
Mike
-
Read these two posts... they cover everything.
-
ok, got it, thank you so much Jane
-
Hi,
You don't need to redirect at all (with a 301 or otherwise) if the canonical tag is in place. So don't worry about that at all - both URLs can load together if the canonical tag points Google from the "duplicate" to the "correct / canonical" one. Sorry if that wasn't clear.
I am not sure the frequency of Moz's crawling or if you can force a refresh, I'm sorry.
-
Thx Jane
The problem is I can't simply 301 it because I'm not on apache. I can do the canonical tag. Of course I've already gone in and changed it over to the tag + refresh but server is down so it won't publish right now. I was trying to get it done ahead of moz crawling my site today. Is there a way to get moz to recrawl it after the changes are updated or do I need to wait another week?
-
Hi,
Hard to say, but it definitely won't have helped. As Bryan says, you've split authority between over twice the number of pages the site should have, and Google can take action against sites that produce a large amount of duplicate content. I'd get the canonical tags in place (and thoroughly check they're set up right, as it can be a mess if they're implemented incorrectly) and check on progress over two or three weeks. If you see nothing happen, I'd say your reason for dropping could be something else.
-
Hi again,
The canonical tag sounds like the right way to go for you.
Regarding the meta refresh method of redirection - this works perfectly for users... it was always the case that search engines did not honour this as a redirect though. This may have changed in the recent past (and realistically, it should have - a lot of people used this tactic for redirection and Google should understand that it shows a moved page). However, it is generally thought that the meta refresh does not pass all authority (as noted here), and this thread shows a Googler advising against it (this is a post from 2010 though).
Honestly, with the canonical tag, you don't need to do the refresh / redirection - this will take care of the issue
Cheers,
Jane
-
Hi there,
I'll answer these one at a time as there are a few responses to go through.
default.htm is the home page as created by the CMS, but you want to either use that URL or www.cheaptubes.com as the home page, not both.
The solution is a 301 or the canonical tag so that home page content does not appear on both URLs.
-
Hi Jane, Oleg, & Bryan
I checked with the woman who designed my theme (she is awesome). She offered the following suggestions which seem like the way to go for me. Are there any negatives that I'm not aware of with the options below?
Since you are still using FrontPage, just open your site, locate the appropriate pages, and type the following into the head area:
If you are on a Windows server, your web host can do the 301 redirect for you. You will tell them the name of the old pages and the name of the new pages and they will do the rest.
An easy alternate is for you to do the redirect yourself with an easy tag that goes into the head area of the old pages. This tag is called a redirect and redirects from the old page to the new one.
URL="http://www.newsite.com/newurl.html">
Google, Bing, and Yahoo all recognize the meta tag for the redirect and will adjust their indexing accordingly. I will usually leave an old page on the server for about 3 months to give the search engines time to catch up. Then I can delete the page.
You can, of course, get more "bang for your buck" by using both the canonical link and the meta refresh at the same time.
URL="http://www.newsite.com/newurl.html">
I like the last one, am going to try that unless you think its a flawed strategy.
Thanks for your help
Mike
-
Hi Jane
How do I change to canonical url's if I can't do a 301?
Mike
-
so how do I use the canonical tag since i can't 301 it?
-
It certainly could. Google sees the www. version as a 2nd website, so essentially you're splitting your 'ranking authority' between 2 webpages.
-
Thanks Oleg
I can't 301 because I'm not using apache, still on frontpage. I know its old, getting out my abacus now : )
-
To sum up...
- 301 redirect all non-www urls to www versions (since it has a higher page authority) and add canonicals to all pages with the www version of the url
- For all lower case / upper case page duplicates... pick one, set a canonical tag and 301 to the chosen case, make sure all your links point to the correct url case.
- 301 redirect default.htm to your root domain - http://www.cheaptubes.com
-
does the 2 versions problem help to explain why my sales started dropping significantly after the google july 4th update? I know there were some SEO penalties in that update. I also know a friendly competitor who saw a similar drop starting in early July.
-
Hi Jane
Thank you so much. I am reviewing the link you provided. I don't think I can 301 redirect because it is done in front page, not apache. I have tried for years to find another platform but failed. I spent years trying to figure out drupal, even ordered several books but no luck. I tried concrete 5 and just using HTML 5 editor like coffee cup. I keep struggling with getting them to work. I've bought themes to use but can't get them operational.
I thought default.htm was supposed to be the home page, is that incorrect?
Mike
-
Hi again,
Yep - your non-www and www pages are both resolving... e.g. http://cheaptubes.com/ and http://www.cheaptubes.com/ bring up the same content. Also, http://cheaptubes.com/default.htm and http://www.cheaptubes.com/default.htm is also a duplicate of the home page.
Internally, I am seeing the same thing, e.g. http://www.cheaptubes.com/carbon-nanotubes-prices.htm and http://cheaptubes.com/carbon-nanotubes-prices.htm - same page, one on the www subdomain ("www." is a subdomain like any other, just with an extremely common name) and one just sitting on the root.
The solution here is either to 301 redirect the non-www version of the site to the www version for every page, or to use the canonical tag to point from the non-preferred versions to the "canonical" versions. More information on this is available here.
You also have a situation where upper-case URLs will resolve as well as lower case ones, e.g. http://www.cheaptubes.com/SWNTs.htm and http://www.cheaptubes.com/swnts.htm (as well as http://cheaptubes.com/swnts.htm!).
URLs should only be allowed to resolve with one case, preferably lower. The upper / mixed case should 301 redirect to the proper version.
Essentially, the "two versions of the site" issue is the biggest problem, with all pages being available on at least two URLs - one with www and one without. There are other tidiness issues like /default.htm bringing up the home page as well.
Does this make sense? Let me know if this is not clear.
Best,
Jane
-
also on a page that moz ranks as an "F", I still rank high in organic results, see the results from when I searched for MWNTs below, I was 1st organic result. If long form, multi walled carbon nanotubes I fell to 6th or 7th but still on the first page.
-
Thank you Oleg, Bryan, & Jane. I am a rookie when it comes to web development but my pages always ranked well because enough Moz tips sunk in. See the alert from last weeks crawl below.
Pages with High Priority Issues
98Duplicate Page Content24XX (Client Error)If we look at the home page, it has 3 URLs, see belowURLPage AuthorityLinking Root DomainsExternal Link CountInternal Link CountStatus CodeDuplicate URLsDownload Duplicates
cheaptubes.com the source for carbon nanotubes home page
http://www.cheaptubes.com3322611882003 duplicates
cheaptubes.com the source for carbon nanotubes home page
http://www.cheaptubes.com/default.htm2422622001 of 3 duplicates
cheaptubes.com the source for carbon nanotubes home page
http://cheaptubes.com2931502002 of 3 duplicates
cheaptubes.com the source for carbon nanotubes home page
http://cheaptubes.com/default.htm2410462003 of 3 duplicates Does this help?-Mike
-
Hi there,
This could definitely be a case of both non-www and www URLs resolving, but I'd like to echo the guys above me and ask for more information - if you could share the actual examples, either on here or in a private message, it would be easier to find why Moz has found twice the number of URLs your site should have.
Thanks,
Jane
-
+Really need more information
If you have URLs constructed dynamically depending on where the user navigates from this could also be an issue, but I would expect more than 2x the pages.
-
Could you share more details?
What do the duplicate content examples look like? http vs https? www. vs non-www?
If the content is replicated on 2 domains, yes that is duplicate content and you should consolidate to one site via 301 redirects.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content question
Hi I have a site that is run off one CMS system but has 3 different web addresses. One is a comic shop, one is a toy shop and one is a game shop. Now due to the nature of what we are selling some of the products we are selling on both or all 3 of the sites. I was wondering as to whether this would affect my ability to rank in google and if i would be penalised for any duplicate content? Thanks in advance
On-Page Optimization | | TheZenAgency0 -
I have a lot of internal duplicate content as intros to a series of articles, is this bad?
On a site that I'm working on there is a series of posts with the same beginning to their titles. All of the titles start with Christ's Church ("Mormons"): And then about the first four paragraphs of all these posts is exactly the same, it is just explaining this series of posts. I'll link to a couple of examples so you know what I'm talking about. I know there are several other problems with these posts/site 🙂 but I am specifically curious about the partial duplicate title and the first few paragraphs being duplicate. http://www.mormonchurch.com/3259/christs-church-mormons-helping-out-a-friend http://www.mormonchurch.com/2969/christs-church-mormon-happiness-is-found-only-through-christ There are about 30 posts similar to these. Thank you, I look forward to your responses.
On-Page Optimization | | ThridHour1 -
Tags creating duplicated content issue?
Hello i believe a lot of us use tags in our blogs as a way to categorize content and make it easy searchable but this usually (at lease in my case) cause duplicate content creation. For example, if one article has 2 tags like "SEO" & "Marketing", then this article will be visible and listed in 2 urls inside the blog like this domain.com/blog/seo and domain.com/blog/marketing In case of a blog with 300+ posts and dozens of different tags this is creating a huge issue. My question is 1. Is this really bad? 2. If yes how to fix it without removing tags?
On-Page Optimization | | Lakiscy0 -
Duplicate content and the Moz bot
Hi Does our little friend at SEOmoz follow the same rules as the search engine bots when he crawls my site? He has sent thousands of errors back to me with duplicate content issues, but I thought I had removed these with nofollow etc. Can you advise please.
On-Page Optimization | | JamieHibbert0 -
Guest vs Logged In Content
Hi Mozzers I have a client that recently launched a q&a and he has the answers hidden by registering for free you can see the answers. It's a free community. Now the question comes: Google will not get the entire page only the question content which I think is bad. What option would solve the issue. Have thought about making the answers hidden through css... so if you're a guest the answers are display:none . But it has to be a better option than dirty things like this 🙂
On-Page Optimization | | mosaicpro0 -
Duplicate Page Content and Duplicate Page Title
Hi All, I'm new in SEOMoz and have some questions after I have already spend 2-3 days trying to resolve the problems identified from Crawling one of my clients websites. I get quite a lot of Duplicate Page Conntent and Page Titles warnings and trying to find a workaround through the forums and posts. I continuously get this error on most of my pages: URL: http://domain.com/benefits with the same Page but with a WWW in front URL: http://www.domain.com/benefits Any advice will be highly appreciated. Thanks, Athos
On-Page Optimization | | athosk0 -
Can duplicate content issues be solved with a noindex robot metatag?
Hi all I have a number of duplicate content issues arising from a recent crawl diagnostics report. Would using a robots meta tag (like below) on the pages I don't necessarily mind not being indexed be an effective way to solve the problem? Thanks for any / all replies
On-Page Optimization | | joeprice0 -
How woud you deal with Blog TAGS & CATEGORY listings that are marked a 'duplicate content' in SEOmoz campaign reports?
We're seeing "Duplicate Content" warnings / errors in some of our clients' sites for blog / event calendar tags and category listings. For example the link to http://www.aavawhistlerhotel.com/news/?category=1098 provides all event listings tagged to the category "Whistler Events". The Meta Title and Meta Description for the "Whistler Events" category is the same as another other category listing. We use Umbraco, a .NET CMS, and we're working on adding some custom programming within Umbraco to develop a unique Meta Title and Meta Description for each page using the tag and/or category and post date in each Meta field to make it more "unique". But my question is .... in the REAL WORLD will taking the time to create this programming really positively impact our overall site performance? I understand that while Google, BING, etc are constantly tweaking their algorithms as of now having duplicate content primarily means that this content won't get indexed and there won't be any really 'fatal' penalties for having this content on our site. If we don't find a way to generate unique Meta Titles and Meta Descriptions we could 'no-follow' these links (for tag and category pages) or just not use these within our blogs. I am confused about this. Any insight others have about this and recommendations on what action you would take is greatly appreciated.
On-Page Optimization | | RoyMcClean0