Duplicate content penalty
-
when moz crawls my site they say I have 2x the pages that I really have & they say I am being penalized for duplicate content. I know years ago I had my old domain resolve over to my new domain. Its the only thing that makes sense as to the duplicate content but would search engines really penalize me for that? It is technically only on 1 site. My business took a significant sales hit starting early July 2013, I know google did and algorithm update that did have SEO aspects. I need to resolve the problem so I can stay in business
-
Thx Jane- No I wasn't aware of that. I don't get it because I put canonical tags right under the Head and I used the code below to do it. I will check again but am unsure how to fix it
I don't even know how to fix coding on the Http://cheaptubes.com site. It seems like when I add content to the canonical site it updates all of them. Thx for pointing out errors, you are giving me something to fix and improve.
-
Hi again,
Are you aware that you have a canonical tag on http://cheaptubes.com that points to a non-existent URL? i.e. http://i.imgur.com/yEd2377.png
http://www.cheaptubes.com/default.html
If http://cheaptubes.com/ 301 redirected to http://www.cheaptubes.com/, this would resolve the issue.
Are you aware that the www version of your site shows for a brand search (https://www.google.co.uk/search?q=cheaptubes.com&oq=cheaptubes.com&aqs=chrome..69i58j69i60l2j69i57j69i60j0.3367j0j4&sourceid=chrome&es_sm=91&ie=UTF-8) but that the canonical tags on each page point to the non-www version, e.g. http://i.imgur.com/P7Tizsv.png and http://i.imgur.com/lhTA95w.png?
The canonical tag on the www.cheaptubes.com/ page also points to http://www.cheaptubes.com/default.html. Sorry to show so many errors, but it doesn't look like canonicalisation has been implemented properly here.
-
Thx Jane - I may not have put a canonical tag on that page yet but its the same for every page. I can't access the http://cheaptubes.com but I can access the canonical version to publish. I did put canonical tags on most of my other pages such as the SWNTs page but it still shows a non canonical version when moz crawls it. Perhaps a 301 from http://cheaptubes.com to the canonical page? I'm just not sure how to handle it.
-
Hi Mike,
Are you saying that there is a canonical tag on http://cheaptubes.com/cntmaterialsafetydatasheet.htm, pointing to http://www.cheaptubes.com/cntmaterialsafetydatasheet.htm? This would solve the duplicate content problem, but I do not see a canonical tag on either of those pages...
-
Hi Everyone - I'm hoping you can help me out again. I have a functional 301 on cheaptubesinc.com. that cleared about 1/2 my dup content penalty on the moz crawl this week. As you can see in the results below, I still have 57 pages with dup content according to Moz.
57 Duplicate Page Content
13 4XX (Client Error)
57 Duplicate Page TitleI checked and I think it is mostly a canonical problem. I do have Rel Canonical tags on all my pages. When I clicked on the 1st one it appears that is the case, see below
cheaptubes.com carbon nanotubes msds
http://www.cheaptubes.com/cntmaterialsafetydatasheet.htm29414872001 duplicate
cheaptubes.com carbon nanotubes msds
http://cheaptubes.com/cntmaterialsafetydatasheet.htm25My question is, do I need another 301 from http://cheaptubes.com to the canonical version? I'ld rather not since I had to fight with network solutions for a week for them to add the / after .com so my other pages would work. Is this a penalty I should still be concernedabout given that I have the rel canonical tags? Please let me know your thoughts on thisMike
-
Thanks Everyone. I got the old url cheaptubesinc.com 301'd to cheaptubes.com this week. Of course network solutions left off the / after .com and before the page name so that only the home page would 301 and they could try to sell me more 301s, it cost $60 for 1 and I have 48 pages on my site. I called and emailed them all week and they kept saying they had done it right and and they couldn't force google to change the links. I then realized if I typed www.cheaptubesinc/graphene.htm that it didn't work because it 301'd to www.cheaptubes.comgraphene.htm. They were argumentative with me even though I was polite with them even though I didn't want to be. I finally got a tech on the phone who said he would add the slash and ask his boss for forgiveness. However given the history of having the domain parked and pointed before and that not working over time & now this, I think my best bet is to transfer my domains to someone else. I heard bluehost is good. My concern is if they were that unethical in our dealings and the boss was argumentative in emails than they could go in an remove the slash at any time.
I also found a ton of code errors right at the top of my pages. I now know it was from putting up temporary messages but not checking to make sure the code was clean. The woman I bought my them from (6.5 years ago I paid her $60 and she still helps me for free, what difference between her & NS) notice open H1's & P elements at the top of the pages. I was still ranking well for acronyms but missing out on the long keywords since last july which caused my sales to drop off. I figure I lost at least 150K in sales because I neglected my website and didn't clean up the code on my pages a painful lesson I won't soon forget. On tuesday, when I searched single walled carbon nanotubes I had to go 8 pages back in google to find my page. By week's end I was #8 on page 1 and ahead of sigma aldrich a major materials supplier.
Thank you so much for your help everyone, it is sincerely appreciated
Mike
-
Thank you Oleg - I did put the tag
into the head right below the robots & google bot code on every page. I mistakenly deleted some very old non updated pages. Thx to moz, i have a list of the pages and will contact hosting co to 301 it. I think I ultimately have to 301 each page. I had moz recrawl my site last night but it said it dropped from 100 duplicate content penalties to 89, an improvement but not the one I hoped for. I did have a client tell my the site was down today, contacted network solutions and they said it was up now but they had an outage last night. Perhaps it affected the moz recrawl,but I can't know that. I also want to change the names of the pages as an interim measure before I update the site to newer format. Should I create new optimized by name pages first and then get on the phone with tech support and 301 them all to the knew pages? seems logical but so did deleting old pages until moz couldn't find them, then i realized the bots will count it against me rather than the housekeeping that it was.
Mike
Mike
-
Read these two posts... they cover everything.
-
ok, got it, thank you so much Jane
-
Hi,
You don't need to redirect at all (with a 301 or otherwise) if the canonical tag is in place. So don't worry about that at all - both URLs can load together if the canonical tag points Google from the "duplicate" to the "correct / canonical" one. Sorry if that wasn't clear.
I am not sure the frequency of Moz's crawling or if you can force a refresh, I'm sorry.
-
Thx Jane
The problem is I can't simply 301 it because I'm not on apache. I can do the canonical tag. Of course I've already gone in and changed it over to the tag + refresh but server is down so it won't publish right now. I was trying to get it done ahead of moz crawling my site today. Is there a way to get moz to recrawl it after the changes are updated or do I need to wait another week?
-
Hi,
Hard to say, but it definitely won't have helped. As Bryan says, you've split authority between over twice the number of pages the site should have, and Google can take action against sites that produce a large amount of duplicate content. I'd get the canonical tags in place (and thoroughly check they're set up right, as it can be a mess if they're implemented incorrectly) and check on progress over two or three weeks. If you see nothing happen, I'd say your reason for dropping could be something else.
-
Hi again,
The canonical tag sounds like the right way to go for you.
Regarding the meta refresh method of redirection - this works perfectly for users... it was always the case that search engines did not honour this as a redirect though. This may have changed in the recent past (and realistically, it should have - a lot of people used this tactic for redirection and Google should understand that it shows a moved page). However, it is generally thought that the meta refresh does not pass all authority (as noted here), and this thread shows a Googler advising against it (this is a post from 2010 though).
Honestly, with the canonical tag, you don't need to do the refresh / redirection - this will take care of the issue
Cheers,
Jane
-
Hi there,
I'll answer these one at a time as there are a few responses to go through.
default.htm is the home page as created by the CMS, but you want to either use that URL or www.cheaptubes.com as the home page, not both.
The solution is a 301 or the canonical tag so that home page content does not appear on both URLs.
-
Hi Jane, Oleg, & Bryan
I checked with the woman who designed my theme (she is awesome). She offered the following suggestions which seem like the way to go for me. Are there any negatives that I'm not aware of with the options below?
Since you are still using FrontPage, just open your site, locate the appropriate pages, and type the following into the head area:
If you are on a Windows server, your web host can do the 301 redirect for you. You will tell them the name of the old pages and the name of the new pages and they will do the rest.
An easy alternate is for you to do the redirect yourself with an easy tag that goes into the head area of the old pages. This tag is called a redirect and redirects from the old page to the new one.
URL="http://www.newsite.com/newurl.html">
Google, Bing, and Yahoo all recognize the meta tag for the redirect and will adjust their indexing accordingly. I will usually leave an old page on the server for about 3 months to give the search engines time to catch up. Then I can delete the page.
You can, of course, get more "bang for your buck" by using both the canonical link and the meta refresh at the same time.
URL="http://www.newsite.com/newurl.html">
I like the last one, am going to try that unless you think its a flawed strategy.
Thanks for your help
Mike
-
Hi Jane
How do I change to canonical url's if I can't do a 301?
Mike
-
so how do I use the canonical tag since i can't 301 it?
-
It certainly could. Google sees the www. version as a 2nd website, so essentially you're splitting your 'ranking authority' between 2 webpages.
-
Thanks Oleg
I can't 301 because I'm not using apache, still on frontpage. I know its old, getting out my abacus now : )
-
To sum up...
- 301 redirect all non-www urls to www versions (since it has a higher page authority) and add canonicals to all pages with the www version of the url
- For all lower case / upper case page duplicates... pick one, set a canonical tag and 301 to the chosen case, make sure all your links point to the correct url case.
- 301 redirect default.htm to your root domain - http://www.cheaptubes.com
-
does the 2 versions problem help to explain why my sales started dropping significantly after the google july 4th update? I know there were some SEO penalties in that update. I also know a friendly competitor who saw a similar drop starting in early July.
-
Hi Jane
Thank you so much. I am reviewing the link you provided. I don't think I can 301 redirect because it is done in front page, not apache. I have tried for years to find another platform but failed. I spent years trying to figure out drupal, even ordered several books but no luck. I tried concrete 5 and just using HTML 5 editor like coffee cup. I keep struggling with getting them to work. I've bought themes to use but can't get them operational.
I thought default.htm was supposed to be the home page, is that incorrect?
Mike
-
Hi again,
Yep - your non-www and www pages are both resolving... e.g. http://cheaptubes.com/ and http://www.cheaptubes.com/ bring up the same content. Also, http://cheaptubes.com/default.htm and http://www.cheaptubes.com/default.htm is also a duplicate of the home page.
Internally, I am seeing the same thing, e.g. http://www.cheaptubes.com/carbon-nanotubes-prices.htm and http://cheaptubes.com/carbon-nanotubes-prices.htm - same page, one on the www subdomain ("www." is a subdomain like any other, just with an extremely common name) and one just sitting on the root.
The solution here is either to 301 redirect the non-www version of the site to the www version for every page, or to use the canonical tag to point from the non-preferred versions to the "canonical" versions. More information on this is available here.
You also have a situation where upper-case URLs will resolve as well as lower case ones, e.g. http://www.cheaptubes.com/SWNTs.htm and http://www.cheaptubes.com/swnts.htm (as well as http://cheaptubes.com/swnts.htm!).
URLs should only be allowed to resolve with one case, preferably lower. The upper / mixed case should 301 redirect to the proper version.
Essentially, the "two versions of the site" issue is the biggest problem, with all pages being available on at least two URLs - one with www and one without. There are other tidiness issues like /default.htm bringing up the home page as well.
Does this make sense? Let me know if this is not clear.
Best,
Jane
-
also on a page that moz ranks as an "F", I still rank high in organic results, see the results from when I searched for MWNTs below, I was 1st organic result. If long form, multi walled carbon nanotubes I fell to 6th or 7th but still on the first page.
-
Thank you Oleg, Bryan, & Jane. I am a rookie when it comes to web development but my pages always ranked well because enough Moz tips sunk in. See the alert from last weeks crawl below.
Pages with High Priority Issues
98Duplicate Page Content24XX (Client Error)If we look at the home page, it has 3 URLs, see belowURLPage AuthorityLinking Root DomainsExternal Link CountInternal Link CountStatus CodeDuplicate URLsDownload Duplicates
cheaptubes.com the source for carbon nanotubes home page
http://www.cheaptubes.com3322611882003 duplicates
cheaptubes.com the source for carbon nanotubes home page
http://www.cheaptubes.com/default.htm2422622001 of 3 duplicates
cheaptubes.com the source for carbon nanotubes home page
http://cheaptubes.com2931502002 of 3 duplicates
cheaptubes.com the source for carbon nanotubes home page
http://cheaptubes.com/default.htm2410462003 of 3 duplicates Does this help?-Mike
-
Hi there,
This could definitely be a case of both non-www and www URLs resolving, but I'd like to echo the guys above me and ask for more information - if you could share the actual examples, either on here or in a private message, it would be easier to find why Moz has found twice the number of URLs your site should have.
Thanks,
Jane
-
+Really need more information
If you have URLs constructed dynamically depending on where the user navigates from this could also be an issue, but I would expect more than 2x the pages.
-
Could you share more details?
What do the duplicate content examples look like? http vs https? www. vs non-www?
If the content is replicated on 2 domains, yes that is duplicate content and you should consolidate to one site via 301 redirects.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How do I fix my portfolio causing duplicate content issues?
Hi, Im new to this whole duplicate content issue. I have a website, fatcatpaperie.com that I use the portofolio feature in Wordpress as my gallery for all my wedding invitations. I have a ton of duplicate content issues from this. I don't understand at all how to fix this. I'd appreciate any help! Below is an example of one duplicate content issue. They have slightly different names, different urls, different images and all have no text. But are coming up as duplicates. Would it be as easy as putting a different metadescription for each?? Thanks for the help! Rena | "Treasure" by Designers Fine Press - Fat Cat Paperie http://fatcatpaperie.com/portfolio-item/treasure-designers-fine-press 1 0 0 0 200 3 duplicates "Perennial" by Designers Fine Press - Fat Cat Paperie http://fatcatpaperie.com/portfolio-item/perennial-by-designers-fine-press 1 0 0 0 200 1 of 3 duplicates "Primrose" by Designers Fine Press - Fat Cat Paperie http://fatcatpaperie.com/portfolio-item/8675 1 0 0 0 200 2 of 3 duplicates "Catalina" by Designers Fine Press - Fat Cat Paperie http://fatcatpaperie.com/portfolio-item/catalina-designers-fine-press |
On-Page Optimization | | HonestSEOStudio0 -
Duplicate content issues - page content and store URLs
Hi, I'm experiencing some heavy duplicate content Crawl errors on Moz with www.redrockdecals.com and therefore I really need some help. It brings up different connections between products and I'm having a hard time figuring out what it means. It is listing the same products as duplicate content but they have different URL endings. For example:http://www.redrockdecals.com/car-graphics/chevrolet-silverado?___store=nl&___from_store=us
On-Page Optimization | | speedbird1229
&
http://www.redrockdecals.com/car-graphics/chevrolet-silverado?___store=d&___from_store=us It seems like Moz considers the copy-pasted parts in the Full Description (scrolled a bit down on product pages) as Duplicate Content. For example the general text found on this page: http://www.redrockdecals.com/caution-tow-limited-turning-radius-decal Or this page: http://www.redrockdecals.com/if-you-don-t-succeed-first-time-then-skydiving-isn-t-for-you-bumper-sticker I am planning to write new and unique descriptions for all products but what do you suggest - should I either remove the long same descriptions or just shorten them perhaps so they don't outweigh the short but unique descriptions above? I've heard search engines understand that some parts of the page can be same on other pages but I wonder if in my case this has gone too deep... Thanks so much!0 -
Duplicate Content: Snippets from Blog on Website
Our company has a website and a blog, each on a separate domain. On our website's home page, we include snippets from some of our blog posts and links to such posts. The snippets include the same photo, title, and first line or so of text. We have been told and am concerned that search engines will categorize this as duplicate content. Any recommendation as to how we can get around this issue aside from not including the blog posts on our site? I imagine we are not the first company to want to do this. www.angelicolaw.com Thanks,
On-Page Optimization | | BrazilLaw
Greg0 -
Content Update
Hello, If I update the existing content i.e.I added some content to the already existing indexed content in a post,how will it effect SEO wise? Venkee
On-Page Optimization | | Venkee0 -
Duplicate Content - Potential Issue.
Hello, here we go again, If I write an article somewhere, lets say Squidoo for instance, then post it to my blog on my website will google see this as duplicate content and probably credit Squidoo for it or is there soemthing I can do to prevent this, maybe a linkk back to Squidoo from my website or a dontfollow on my website? Im not sure so any help here would be great, Also If I use other peoples material in my blog and link back to them, obviously I dont want the credit for the original material I am simply collating some of this on my blog for others to have a specific library if you like. Is this going to damage my websites reputation? Thanks again peeps. Craig Fenton IT
On-Page Optimization | | craigyboy0 -
How to avoid content duplication of my websites
Hello, We are having 4 domains abc.com, abc.in, abc.co.uk, abc.com.au with same content and same inner pages (abc.com/page1, abc.in/page1 etc.) targeting on different geographical areas. How can we avoid duplicate content issue in the home page as well as in the inner pages. Abc.com is the major site. Thank you.
On-Page Optimization | | semvibe0 -
Home Page Content - In a Div?
Is putting content in a div so it doesn't muck up the look of the home page create a problem in doing well organically? Example - http://www.callawaygardens.com. We have lots of clients that want no text on the home page and we are trying to figure out how to do this while still ranking well organically. What are your thoughts? Can we get in trouble? Are there negative impacts with SEO doing it like this? Thank you!
On-Page Optimization | | RezStream80 -
Duplicate Title question
Thanks Mozzers in advance for any insight into what I'm sure is a basic SEO question. I'm working with a resort in the great state of Maine. Their home page title reads Maine Resorts, Resorts in Maine, (company name). The site has about 400 URL's and over half of the URL's utilize the first keyword phrase of the home page title, "Maine Resorts." Predominately, I find them used on the Accommodations pages (pages that describe each room with a picture) which I would label as deeper pages and non-conversion type pages. The page titles themselves are not exact duplicates of the Home Page Title but might read something like "Maine Resorts, Company Name, Accommodation Listing." My concern is that the heavy use of "Maine Resorts" as the first phrase in over 200 plus pages might be competing against the home page and pulling the home page ranking down. Thanks for any help given!
On-Page Optimization | | hawkvt10