Duplicate content penalty
-
when moz crawls my site they say I have 2x the pages that I really have & they say I am being penalized for duplicate content. I know years ago I had my old domain resolve over to my new domain. Its the only thing that makes sense as to the duplicate content but would search engines really penalize me for that? It is technically only on 1 site. My business took a significant sales hit starting early July 2013, I know google did and algorithm update that did have SEO aspects. I need to resolve the problem so I can stay in business
-
Thx Jane- No I wasn't aware of that. I don't get it because I put canonical tags right under the Head and I used the code below to do it. I will check again but am unsure how to fix it
I don't even know how to fix coding on the Http://cheaptubes.com site. It seems like when I add content to the canonical site it updates all of them. Thx for pointing out errors, you are giving me something to fix and improve.
-
Hi again,
Are you aware that you have a canonical tag on http://cheaptubes.com that points to a non-existent URL? i.e. http://i.imgur.com/yEd2377.png
http://www.cheaptubes.com/default.html
If http://cheaptubes.com/ 301 redirected to http://www.cheaptubes.com/, this would resolve the issue.
Are you aware that the www version of your site shows for a brand search (https://www.google.co.uk/search?q=cheaptubes.com&oq=cheaptubes.com&aqs=chrome..69i58j69i60l2j69i57j69i60j0.3367j0j4&sourceid=chrome&es_sm=91&ie=UTF-8) but that the canonical tags on each page point to the non-www version, e.g. http://i.imgur.com/P7Tizsv.png and http://i.imgur.com/lhTA95w.png?
The canonical tag on the www.cheaptubes.com/ page also points to http://www.cheaptubes.com/default.html. Sorry to show so many errors, but it doesn't look like canonicalisation has been implemented properly here.
-
Thx Jane - I may not have put a canonical tag on that page yet but its the same for every page. I can't access the http://cheaptubes.com but I can access the canonical version to publish. I did put canonical tags on most of my other pages such as the SWNTs page but it still shows a non canonical version when moz crawls it. Perhaps a 301 from http://cheaptubes.com to the canonical page? I'm just not sure how to handle it.
-
Hi Mike,
Are you saying that there is a canonical tag on http://cheaptubes.com/cntmaterialsafetydatasheet.htm, pointing to http://www.cheaptubes.com/cntmaterialsafetydatasheet.htm? This would solve the duplicate content problem, but I do not see a canonical tag on either of those pages...
-
Hi Everyone - I'm hoping you can help me out again. I have a functional 301 on cheaptubesinc.com. that cleared about 1/2 my dup content penalty on the moz crawl this week. As you can see in the results below, I still have 57 pages with dup content according to Moz.
57 Duplicate Page Content
13 4XX (Client Error)
57 Duplicate Page TitleI checked and I think it is mostly a canonical problem. I do have Rel Canonical tags on all my pages. When I clicked on the 1st one it appears that is the case, see below
cheaptubes.com carbon nanotubes msds
http://www.cheaptubes.com/cntmaterialsafetydatasheet.htm29414872001 duplicate
cheaptubes.com carbon nanotubes msds
http://cheaptubes.com/cntmaterialsafetydatasheet.htm25My question is, do I need another 301 from http://cheaptubes.com to the canonical version? I'ld rather not since I had to fight with network solutions for a week for them to add the / after .com so my other pages would work. Is this a penalty I should still be concernedabout given that I have the rel canonical tags? Please let me know your thoughts on thisMike
-
Thanks Everyone. I got the old url cheaptubesinc.com 301'd to cheaptubes.com this week. Of course network solutions left off the / after .com and before the page name so that only the home page would 301 and they could try to sell me more 301s, it cost $60 for 1 and I have 48 pages on my site. I called and emailed them all week and they kept saying they had done it right and and they couldn't force google to change the links. I then realized if I typed www.cheaptubesinc/graphene.htm that it didn't work because it 301'd to www.cheaptubes.comgraphene.htm. They were argumentative with me even though I was polite with them even though I didn't want to be. I finally got a tech on the phone who said he would add the slash and ask his boss for forgiveness. However given the history of having the domain parked and pointed before and that not working over time & now this, I think my best bet is to transfer my domains to someone else. I heard bluehost is good. My concern is if they were that unethical in our dealings and the boss was argumentative in emails than they could go in an remove the slash at any time.
I also found a ton of code errors right at the top of my pages. I now know it was from putting up temporary messages but not checking to make sure the code was clean. The woman I bought my them from (6.5 years ago I paid her $60 and she still helps me for free, what difference between her & NS) notice open H1's & P elements at the top of the pages. I was still ranking well for acronyms but missing out on the long keywords since last july which caused my sales to drop off. I figure I lost at least 150K in sales because I neglected my website and didn't clean up the code on my pages a painful lesson I won't soon forget. On tuesday, when I searched single walled carbon nanotubes I had to go 8 pages back in google to find my page. By week's end I was #8 on page 1 and ahead of sigma aldrich a major materials supplier.
Thank you so much for your help everyone, it is sincerely appreciated
Mike
-
Thank you Oleg - I did put the tag
into the head right below the robots & google bot code on every page. I mistakenly deleted some very old non updated pages. Thx to moz, i have a list of the pages and will contact hosting co to 301 it. I think I ultimately have to 301 each page. I had moz recrawl my site last night but it said it dropped from 100 duplicate content penalties to 89, an improvement but not the one I hoped for. I did have a client tell my the site was down today, contacted network solutions and they said it was up now but they had an outage last night. Perhaps it affected the moz recrawl,but I can't know that. I also want to change the names of the pages as an interim measure before I update the site to newer format. Should I create new optimized by name pages first and then get on the phone with tech support and 301 them all to the knew pages? seems logical but so did deleting old pages until moz couldn't find them, then i realized the bots will count it against me rather than the housekeeping that it was.
Mike
Mike
-
Read these two posts... they cover everything.
-
ok, got it, thank you so much Jane
-
Hi,
You don't need to redirect at all (with a 301 or otherwise) if the canonical tag is in place. So don't worry about that at all - both URLs can load together if the canonical tag points Google from the "duplicate" to the "correct / canonical" one. Sorry if that wasn't clear.
I am not sure the frequency of Moz's crawling or if you can force a refresh, I'm sorry.
-
Thx Jane
The problem is I can't simply 301 it because I'm not on apache. I can do the canonical tag. Of course I've already gone in and changed it over to the tag + refresh but server is down so it won't publish right now. I was trying to get it done ahead of moz crawling my site today. Is there a way to get moz to recrawl it after the changes are updated or do I need to wait another week?
-
Hi,
Hard to say, but it definitely won't have helped. As Bryan says, you've split authority between over twice the number of pages the site should have, and Google can take action against sites that produce a large amount of duplicate content. I'd get the canonical tags in place (and thoroughly check they're set up right, as it can be a mess if they're implemented incorrectly) and check on progress over two or three weeks. If you see nothing happen, I'd say your reason for dropping could be something else.
-
Hi again,
The canonical tag sounds like the right way to go for you.
Regarding the meta refresh method of redirection - this works perfectly for users... it was always the case that search engines did not honour this as a redirect though. This may have changed in the recent past (and realistically, it should have - a lot of people used this tactic for redirection and Google should understand that it shows a moved page). However, it is generally thought that the meta refresh does not pass all authority (as noted here), and this thread shows a Googler advising against it (this is a post from 2010 though).
Honestly, with the canonical tag, you don't need to do the refresh / redirection - this will take care of the issue
Cheers,
Jane
-
Hi there,
I'll answer these one at a time as there are a few responses to go through.
default.htm is the home page as created by the CMS, but you want to either use that URL or www.cheaptubes.com as the home page, not both.
The solution is a 301 or the canonical tag so that home page content does not appear on both URLs.
-
Hi Jane, Oleg, & Bryan
I checked with the woman who designed my theme (she is awesome). She offered the following suggestions which seem like the way to go for me. Are there any negatives that I'm not aware of with the options below?
Since you are still using FrontPage, just open your site, locate the appropriate pages, and type the following into the head area:
If you are on a Windows server, your web host can do the 301 redirect for you. You will tell them the name of the old pages and the name of the new pages and they will do the rest.
An easy alternate is for you to do the redirect yourself with an easy tag that goes into the head area of the old pages. This tag is called a redirect and redirects from the old page to the new one.
URL="http://www.newsite.com/newurl.html">
Google, Bing, and Yahoo all recognize the meta tag for the redirect and will adjust their indexing accordingly. I will usually leave an old page on the server for about 3 months to give the search engines time to catch up. Then I can delete the page.
You can, of course, get more "bang for your buck" by using both the canonical link and the meta refresh at the same time.
URL="http://www.newsite.com/newurl.html">
I like the last one, am going to try that unless you think its a flawed strategy.
Thanks for your help
Mike
-
Hi Jane
How do I change to canonical url's if I can't do a 301?
Mike
-
so how do I use the canonical tag since i can't 301 it?
-
It certainly could. Google sees the www. version as a 2nd website, so essentially you're splitting your 'ranking authority' between 2 webpages.
-
Thanks Oleg
I can't 301 because I'm not using apache, still on frontpage. I know its old, getting out my abacus now : )
-
To sum up...
- 301 redirect all non-www urls to www versions (since it has a higher page authority) and add canonicals to all pages with the www version of the url
- For all lower case / upper case page duplicates... pick one, set a canonical tag and 301 to the chosen case, make sure all your links point to the correct url case.
- 301 redirect default.htm to your root domain - http://www.cheaptubes.com
-
does the 2 versions problem help to explain why my sales started dropping significantly after the google july 4th update? I know there were some SEO penalties in that update. I also know a friendly competitor who saw a similar drop starting in early July.
-
Hi Jane
Thank you so much. I am reviewing the link you provided. I don't think I can 301 redirect because it is done in front page, not apache. I have tried for years to find another platform but failed. I spent years trying to figure out drupal, even ordered several books but no luck. I tried concrete 5 and just using HTML 5 editor like coffee cup. I keep struggling with getting them to work. I've bought themes to use but can't get them operational.
I thought default.htm was supposed to be the home page, is that incorrect?
Mike
-
Hi again,
Yep - your non-www and www pages are both resolving... e.g. http://cheaptubes.com/ and http://www.cheaptubes.com/ bring up the same content. Also, http://cheaptubes.com/default.htm and http://www.cheaptubes.com/default.htm is also a duplicate of the home page.
Internally, I am seeing the same thing, e.g. http://www.cheaptubes.com/carbon-nanotubes-prices.htm and http://cheaptubes.com/carbon-nanotubes-prices.htm - same page, one on the www subdomain ("www." is a subdomain like any other, just with an extremely common name) and one just sitting on the root.
The solution here is either to 301 redirect the non-www version of the site to the www version for every page, or to use the canonical tag to point from the non-preferred versions to the "canonical" versions. More information on this is available here.
You also have a situation where upper-case URLs will resolve as well as lower case ones, e.g. http://www.cheaptubes.com/SWNTs.htm and http://www.cheaptubes.com/swnts.htm (as well as http://cheaptubes.com/swnts.htm!).
URLs should only be allowed to resolve with one case, preferably lower. The upper / mixed case should 301 redirect to the proper version.
Essentially, the "two versions of the site" issue is the biggest problem, with all pages being available on at least two URLs - one with www and one without. There are other tidiness issues like /default.htm bringing up the home page as well.
Does this make sense? Let me know if this is not clear.
Best,
Jane
-
also on a page that moz ranks as an "F", I still rank high in organic results, see the results from when I searched for MWNTs below, I was 1st organic result. If long form, multi walled carbon nanotubes I fell to 6th or 7th but still on the first page.
-
Thank you Oleg, Bryan, & Jane. I am a rookie when it comes to web development but my pages always ranked well because enough Moz tips sunk in. See the alert from last weeks crawl below.
Pages with High Priority Issues
98Duplicate Page Content24XX (Client Error)If we look at the home page, it has 3 URLs, see belowURLPage AuthorityLinking Root DomainsExternal Link CountInternal Link CountStatus CodeDuplicate URLsDownload Duplicates
cheaptubes.com the source for carbon nanotubes home page
http://www.cheaptubes.com3322611882003 duplicates
cheaptubes.com the source for carbon nanotubes home page
http://www.cheaptubes.com/default.htm2422622001 of 3 duplicates
cheaptubes.com the source for carbon nanotubes home page
http://cheaptubes.com2931502002 of 3 duplicates
cheaptubes.com the source for carbon nanotubes home page
http://cheaptubes.com/default.htm2410462003 of 3 duplicates Does this help?-Mike
-
Hi there,
This could definitely be a case of both non-www and www URLs resolving, but I'd like to echo the guys above me and ask for more information - if you could share the actual examples, either on here or in a private message, it would be easier to find why Moz has found twice the number of URLs your site should have.
Thanks,
Jane
-
+Really need more information
If you have URLs constructed dynamically depending on where the user navigates from this could also be an issue, but I would expect more than 2x the pages.
-
Could you share more details?
What do the duplicate content examples look like? http vs https? www. vs non-www?
If the content is replicated on 2 domains, yes that is duplicate content and you should consolidate to one site via 301 redirects.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Page content | What to do?
Hello Guys, I have some duplicate pages detected by MOZ. Most of the URL´s are from a registracion process for users, so the URL´s are all like this: www.exemple.com/user/login?destination=node/125%23comment-form What should I do? Add this to robot txt? If so how? Whats the command to add in Google Webmaster? Thanks in advance! Pedro Pereira
On-Page Optimization | | Kalitenko20140 -
Duplicate Issue
Hello Mozzers! We have a client going through a website revamp. The client is The Michelangelo Hotel, and they are part of Star Hotels. Star Hotels plans to create a section on their site for The Michelangelo, as opposed to maintaining a stand alone site. They will then take the michelangelohotel.com domain, and point it to the corresponding pages on the Star site. The guest will key in www.michelangelohotel.com, and will see the same content that can be found on www.starhotel.com/en/michelangelo-hotel-new-york. The problem we have is this: Essentially the same content will be indexed twice, once on starhotels.com and once on michelangelohotel.com. This would seem to cause a duplicate content issue. What are your thoughts? Edit: I apologize, because I was not nearly clear enough here. The Star Hotels site will have 5 pages dedicated to The Michelangelo Hotel. The content will sit solely on that server as those 5 pages. Those 5 pages will each be indexed as 2 URLs. www.michelangelohotel.com <-> www.starhotels.com/en/michelangelo/ www.michelangelohotel.com/accommodations <-> www.starhotels.com/en/michelangelo/accommodations And so on. Thanks!
On-Page Optimization | | FrankSweeney0 -
A Lot of Duplicate Meta Descriptions
Hi Everyone Its polesandblinds.com newbie here again, I've Just been in Webmaster tools and see my site has: 278 Duplicate Meta Descriptions 13 Long Meta Descriptions 4 Short Meta Descriptions 304 Duplicate Title Tags We are Using Magento version. 1.6.0.0, It seems that our description content which is unique per page (200 to 300 words) is somehow getting put into the meta description, which I confirmed when I viewed the source code, however its not showing in the meta description boxes in magento. I would say its been like this since the re-launch of the site in December 2012. example page http://www.polesandblinds.com/curtain-tracks/ view-source:http://www.polesandblinds.com/curtain-tracks/ My BIG worry is how Google views this and what effects it may have or had on a site. My site has taken quite a big hit on rankings since the Google updates. I'm really looking forward to your responses good or bad. Many Thanks Jonathan
On-Page Optimization | | JonnytheB0 -
E-commerce site product descriptions and duplicate content
Hi everyone. I'm developing an e-commerce site using Prestashop and concerned about the issue of duplicate content among product descriptions. My main concerns are: If there are 500 or more products and those product descriptions are obtained from a manufacturer or supplier's website hence running into external duplicate content issues. Internal duplicate content is also an issue, if there are multiple similar products and each product has the same description across several pages. What would be the best approach to eliminate the possibility of incurring a duplicate content penalty due to similar product descriptions? I've already considered the suggestion of noindex-ing the complete range of products to help protect from duplicate content penalties and having unique articles written in the site blog discussing products instead linking to certain products on the site. Another consideration I had was noindex-ing all product pages except pages for featured products in the store and rewriting descriptions for a set amount of those featured products regularly (this will still have the problem of internal duplicate content across pages if similar product descriptions are rewritten). The product range is intended to be very large so I'm really seeking an alternative solution from the insane task of rewriting many product descriptions. Any suggestions to make SEO work efficient are very much welcome and appreciated. Thank you!
On-Page Optimization | | valuepets0 -
How do you avoid duplicate content when you sell products that are produced by other manufacturers?
I have a packaging product site, and they sell products from various manufacturers. What can we do with the product detail pages? As of now, the client has copy pasted content straight from the "About" sections on the manufacturers' sites. Obviously, those manufacturers want my client to sell their products, and the products need to be described. How much of a no-no is this copy pasting, and how can I fix it?
On-Page Optimization | | lhc670 -
Duplicate Content
We offer Wellness programs for dogs and cats. A lot of the information is the same except for specifics that relate to young vs. senior pets. I have these different pages: Senior Wellness Kitten Wellness Puppy Wellness Adult Wellness Can each page have approx. 75% of the same text? Or should I rewrite each page so the information (though the same) appears unique.
On-Page Optimization | | PMC-3120870 -
Archetecture to avoid content duplicate
Hi, I have lots of duplicate stuff and I need a better site architecture. http://www.furnacefilterscanada.com/ We are selling furnace filters. All furnace filters are sold in 50 different sizes, each sizes comes in 3 different qualities, Bronze, Silver and Gold. Total: 150 products. Right now I have created many categories and subcategories for furnace filters sizes. When the client pickup is sizes, he will end-up to the products page with 3 different options, Bronze, Silver and Gold. They can then compare the filter a select the one he wants to purchase. The problem is, it is not possible to provide different content for each filters, Gold has a description, Silver has another one and also Bronze. The only text that will change in the descriptions, is the filter size. This makes Duplicates text description. Not good when you what to index your site. The positive things to 150 different products, is the page title. example 16x25x4 furnace filters. Those exacte tem get search in Google. A new site architecture with 3 categories, Gold, Silver and Bronze & 50 variables by products (filters sizes) might not be the best options, because no filter size will be index. Can you please help me to find the best architecture in a SEO point of view? Also what about the top navigation bar menu, what is the best options in using it? Right now it is use for Legal, Contact, Policy and I fill it is a wast, those page only get less then 1% clicks. It might be more convenient to use those for categories for example, what is your recommendations in a SEO point of view? Can I create a information page in the left navigation menu and includ all the standard page, like: Policy, Legal ... If I do, will I get penalize by Google? Thank you for your help. We have puts lots of money in AdWords before, but now the next step is to come home organics. I'm using SEOmoz tools, read there new book, and I want increase traffic. I just need your help. Thank you, BigBlaze
On-Page Optimization | | BigBlaze2050 -
Duplicate content issue with dynamically generated url
Hi, For those who have followed my previous question, I have a similar one regarding dynamically generated urls. From this page http://www.selectcaribbean.com/listing.html the user can make a selection according to various criteria. 6 results are presented and then the user can go to the next page. I know I should probably rewrite url's such as these: http://www.selectcaribbean.com/listing.html?pageNo=1&selType=&selCity=&selPrice=&selBeds=&selTrad=&selMod=&selOcean= but since all the results presented are basically generated on the fly for the convenience of the user, I am afraid google my consider this as an attempt to generate more pages as there are pages for each individual listing. What is my solution for this? Nofollow these pages? Block them thru robots txt?
On-Page Optimization | | multilang0