Duplicate page content and Duplicate page title errors
-
Hi,
I'm new to SeoMoz and to this forum. I've started a new campaign on my site and got back loads of error.
Most of them are Duplicate page content and Duplicate page title errors. I know I have some duplicate titles but I don't have any duplicate content.
I'm not a web developer and not so expert but I have the impression that the crawler is following all my internal links (Infact I have also plenty of warnings saying "Too many on-page links".
Do you think this is the cause of my errors? Should I implement the nofollow on all internal links? I'm working with Joomla.
Thanks a lot for your help
Marco
-
Hi Marco,
I took a look at your page at http://www.beautifulpuglia.com/it/linea-costiera/isole-tremiti.html
Looks like you've got the canonical in place okay here. The next step is to add the canonical on every page that is a duplicate of this page. And you want to make sure to point to the right page. Let me be clear: Every page that is a duplicate of this page should have the same canonical. In this case:
<link rel=”canonical” href=[”http://www.beautifulpuglia.com/it/gargano/isole-tremiti.html”/](view-source:http://www.beautifulpuglia.com/linea-costiera/%E2%80%9Dhttp://www.beautifulpuglia.com/it/gargano/isole-tremiti.html%E2%80%9D/)>
You can find the other pages you need to add this tag to in your SEOmoz report. In each duplicated content report, it will list the number of other pages that are duplicates. Simply click on the number to see the URLs.
I'm not a Joomla expert, but webmasters I've talked to have expressed that other platforms such as Wordpress and Drupal are much more accommodating of these types of fixes. There are some various plugin modules you can use, but you'll have to select one appropriate to your configuration.
Here's a good resource from Dr. Pete: http://www.seomoz.org/blog/duplicate-content-in-a-post-panda-world
Hope this helps. Best of luck.
-
Elias,
I too have 'thousands' of duplicated errors in SEOmoz. Most of which are because it is returning
/abc.com as a different page to /ABC.com
Surely Google doesn't do that? Just because one URL is in capital and the other small case? I also have no idea where SEOmoz is picking that up from......possibly links internal to the page with the hyperlink using different case?
It seems to me this is too sensitive and for me to fix that would take WEEKS!!!! I fail to see if there would be any uplift if Google sees beyond that issue as its cosmetic and not functional.
Regards
Andy
-
It looks fine to me. You will need to do the same on all of your pages.
If you've just added the code you will need to wait up to a week for SEOmoz to re-crawl your website depending on when you're site crawl is scheduled.
Let me know how you get on.
Elias
-
Hi Elias, Hi Marisa,
thanks you both
you are right, in the meantime I had done this but I have the impression it is not working and I don't know what I'm doing wrong.
I'm attaching a link to a page of my site (I hope I can do this). Please have a look at the code, you will see the tag rel=”canonical” href=”http://www.beautifulpuglia.com/it/gargano/isole-tremiti.html”/> which is indicating the URL I want to use. However SeoMoz is still giving me the error. And this is happening for both the Italian and English version.
So far I've only added the tag to this page, I want to find the solution before modifying all pages currently affected.
http://www.beautifulpuglia.com/it/linea-costiera/isole-tremiti.html
Thanks a lot again
-
Hi Marco, as Marissa says - by putting the canonical tag on one page you are putting it on all of them as they are in fact the same page - they are just reached by different URLs.
-
www.site.com/ and www.site.com/index.html, site.com/index.html/, ect, are already the same page. So, there's only one page TO put the tag on. You're just telling the crawlers that you only want one of them to get the credit, and which version of the page you prefer to be displayed.
-
Hi Elias,
thanks a lot for your reply. I've read few posts about the canonical tag and Yes I'm going to try it.
Just couple of things:
-
Let's say I have 4 duplicate for one page, I presume I have to add the tag in the head of only one page right? Does it make any difference which one I pick?
-
Any idea on how this can be implemented in Joomla?It doesn't seem to be very straightforward.
Thanks a lot
Marco
-
-
Hi Marco,
It seems to me like you need to implement the canonical tag.
Site crawlers/bots will consider the following pages as different pages because of their URL and thus tell indicate to them that the content is duplicated on each page...
By implementing the following tag on each of your sites pages (changing the URL for each page) you will tell the crawler which page they should be indexing and to ignore the other.
Here's an example of a canonical tag (to be placed within the head tag of the page)
I think this will sort out your duplication issues.
You can find more information about canonical URLs here http://www.seomoz.org/blog/canonical-url-tag-the-most-important-advancement-in-seo-practices-since-sitemaps
I hope this helps!
Elias
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content through 'Gclid'
Hello, We've had the known problem of duplicate content through the gclid parameter caused by Google Adwords. As per Google's recommendation - we added the canonical tag to every page on our site so when the bot came to each page they would go 'Ah-ha, this is the original page'. We also added the paramter to the URL parameters in Google Wemaster Tools. However, now it seems as though a canonical is automatically been given to these newly created gclid pages; below https://www.google.com.au/search?espv=2&q=site%3Awww.mypetwarehouse.com.au+inurl%3Agclid&oq=site%3A&gs_l=serp.3.0.35i39l2j0i67l4j0i10j0i67j0j0i131.58677.61871.0.63823.11.8.3.0.0.0.208.930.0j3j2.5.0....0...1c.1.64.serp..8.3.419.nUJod6dYZmI Therefore these new pages are now being indexed, causing duplicate content. Does anyone have any idea about what to do in this situation? Thanks, Stephen.
Intermediate & Advanced SEO | | MyPetWarehouse0 -
What is considered duplicate content?
Hi, We are working on a product page for bespoke camper vans: http://www.broadlane.co.uk/campervans/vw-campers/bespoke-campers . At the moment there is only one page but we are planning add similar pages for other brands of camper vans. Each page will receive its specifically targeted content however the 'Model choice' cart at the bottom (giving you the choice to select the internal structure of the van) will remain the same across all pages. Will this be considered as duplicate content? And if this is a case, what would be the ideal solution to limit penalty risk: A rel canonical tag seems wrong for this, as there is no original item as such. Would an iFrame around the 'model choice' enable us to isolate the content from being indexed at the same time than the page? Thanks, Celine
Intermediate & Advanced SEO | | A_Q0 -
Duplicated Content with Index.php
Good Afternoon, My website uses Joomla CMS and has the htaccess rewrite code enabled to ensure the use of search engine friendly URLs (SEF's). While browsing the crawl diagnostics I have found that Moz considers the /index.php URL a duplicate to our root. I will always under the impression that the htaccess rewrite took care of that issue and obviously I would like to address it. I attempted to create a 301 redirect from the index.php URL to the root but ran into an issue when attempting to login to the admin portion of the website as the redirect sent me back to the homepage. I was curious if anyone had advice for handling the index.php duplication issue, specifically with Joomla. Additionally, I have confirmed that in Google Webmasters, under URL parameters, the index.php parameter is set as 'Representative URL'.
Intermediate & Advanced SEO | | BrandonEML0 -
Noindexing Duplicate (non-unique) Content
When "noindex" is added to a page, does this ensure Google does not count page as part of their analysis of unique vs duplicate content ratio on a website? Example: I have a real estate business and I have noindex on MLS pages. However, is there a chance that even though Google does not index these pages, Google will still see those pages and think "ah, these are duplicate MLS pages, we are going to let those pages drag down value of entire site and lower ranking of even the unique pages". I like to just use "noindex, follow" on those MLS pages, but would it be safer to add pages to robots.txt as well and that should - in theory - increase likelihood Google will not see such MLS pages as duplicate content on my website? On another note: I had these MLS pages indexed and 3-4 weeks ago added "noindex, follow". However, still all indexed and no signs Google is noindexing yet.....
Intermediate & Advanced SEO | | khi50 -
Duplicate pages with http and https
Hi all, We changed the payment part of our site to https from http a while ago. However once on the https pages, all the footer and header links are relative URLs, so once users have reached the payment pages and then re-navigate back to other pages in our website they stay on https. The build up of this happening has led to Google indexing all our pages in https (something we did not want to happen), and now we are in the situation where our homepage listing on Google is https rather than http. We would prefer the organic listings to be http (rather than https) and having read lots on this (included the great posts on the moz (still feels odd not refering to it as seomoz!) blog around this subject), possible solutions include redirects or a canoncial tags. My additional questions around these options are: 1. We already have 2 redirects on some pages (long story), will another one negatively impact our rankings? 2. Is a canonical a strong enough hint to Google to stop Google indexing the https versions of these page to the extent that out http pages will appear in natural listings again? If anyone has any other suggestions or other ideas of how to address this issue, that would be great! Thanks 🙂 Diana
Intermediate & Advanced SEO | | Diana.varbanescu0 -
Duplicate content
I run about 10 sites and most of them seemed to fall foul of the penguin update and even though I have never sought inorganic links I have been frantically searching for a link based answer since April. However since asking a question here I have been pointed in another direction by one of your contributors. It seems At least 6 of my sites have duplicate content issues. If you search Google for "We have selected nearly 200 pictures of short haircuts and hair styles in 16 galleries" which is the first bit of text from the site short-hairstyles.com about 30000 results appear. I don't know where they're from nor why anyone would want to do this. I presume its automated since there is so much of it. I have decided to redo the content. So I guess (hope) at some point in the future the duplicate nature will be flushed from Google's index? But how do I prevent it happening again? It's impractical to redo the content every month or so. For example if you search for "This facility is written in Flash® to use it you need to have Flash® installed." from another of my sites that I coincidently uploaded a new page to a couple of days ago, only the duplicate content shows up not my original site. So whoever is doing this is finding new stuff on my site and getting it indexed on google before even google sees it on my site! Thanks, Ian
Intermediate & Advanced SEO | | jwdl0 -
Duplicate Content - Panda Question
Question: Will duplicate informational content at the bottom of indexed pages violate the panda update? **Total Page Ratio: ** 1/50 of total pages will have duplicate content at the bottom off the page. For example...on 20 pages in 50 different instances there would be common information on the bottom of a page. (On a total of 1000 pages). Basically I just wanted to add informational data to help clients get a broader perspective on making a decision regarding "specific and unique" information that will be at the top of the page. Content ratio per page? : What percentage of duplicate content is allowed per page before you are dinged or penalized. Thank you, Utah Tiger
Intermediate & Advanced SEO | | Boodreaux0 -
Duplicate content for swatches
My site is showing a lot of duplicate content on SEOmoz. I have discovered it is because the site has a lot of swatches (colors for laminate) within iframes. Those iframes have all the same content except for the actual swatch image and the title of the swatch. For example, these are two of the links that are showing up with duplicate content: http://www.formica.com/en/home/dna.aspx?color=3691&std=1&prl=PRL_LAMINATE&mc=0&sp=0&ots=&fns=&grs= http://www.formica.com/en/home/dna.aspx?color=204&std=1&prl=PRL_LAMINATE&mc=0&sp=0&ots=&fns=&grs= I do want each individual swatch to show up in search results and they currently are if you search for the exact swatch name. Is the fact that they all have duplicate content affecting my individual rankings and my domain authority? What can I do about it? I can't really afford to put unique content on each swatch page so is there another way to get around it? Thanks!
Intermediate & Advanced SEO | | AlightAnalytics0