Duplicate Homepage In Google
-
Hi
Just found through my SEO dashboard, Google has two versions of the same homepage, the root page, plus the index.html page, causing duplicate content from both the pages.
what is the best option to ensure google only have 1 version of the homepage listed?
-
I'm not a big fan of doubling-up on canonicalization tactics, because there's no good way to tell what's working. The 301 is probably a tiny bit stronger (not much, in my experience), but the advantage of the canonical tag on the home-page is that one tag will sweep up any variants. If you 301 "index.html" to the root, and then someone comes along and uses the non-www version of your home-page or adds a tracking parameter ("index.html?track=1234"), etc., the 301 won't do anything - you'll have to create 301s for each situation. The canonical will prevent those problems, which are very common on home pages.
-
Correct - usually, it's all the same physical page (on the server), so you only need to put it in one place.
-
Hi
just found I have same issues on my own website. However bit confused how I would change this. When the website was developed they where php pages, we did a rewrite so these pages went to .html pages instead, see file below. However now with my website www.rfksolutions.co.uk and www.rfksolutions.co.uk/index.html seem to be duplicate?
see my current htaccess file
any ideas?
DirectoryIndex index.html index.php
RewriteEngine on
RewriteRule ^index.html$ index.php [L]
RewriteRule ^about-us.html$ about-us.php [L]
RewriteRule ^portfolio.html$ portfolio.php [L]
RewriteRule ^partners.html$ partners.php [L]
RewriteRule ^logo-design-portfolio.html$ logo-design-portfolio.php [L]
RewriteRule ^graphic-design-portfolio.html$ graphic-design-portfolio.php [L]
RewriteRule ^graphic-design-portfolio2.html$ graphic-design-portfolio2.php [L]
RewriteRule ^testimonials.html$ testimonials.php [L]
RewriteRule ^portfolio/([0-9a-zA-Z_-]+).html$ portfolio_details.php?&title=$1 [L]RewriteCond %{HTTP_HOST} ^rfksolutions.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.rfksolutions.co.uk$
RewriteRule ^index.php$ "http://www.rfksolutions.co.uk" [R=301,L]RewriteCond %{HTTP_HOST} ^falkirkwebdesign.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.falkirkwebdesign.co.uk$
RewriteRule ^(.*)$ "http://www.rfksolutions.co.uk/$1" [R=301,L]RewriteCond %{HTTP_HOST} ^falkirkwebdesigners.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.falkirkwebdesigners.co.uk$
RewriteRule ^(.*)$ "http://www.rfksolutions.co.uk/$1" [R=301,L]RewriteCond %{HTTP_HOST} ^webdesignfalkirk.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.webdesignfalkirk.co.uk$
RewriteRule ^(.*)$ "http://www.rfksolutions.co.uk/$1" [R=301,L] -
I am confused on when to use both 301 and rel=canonical at the same time and when to use each separately. I have received advice through other Q&As saying to use both for my homepage in this exact same scenario, but sometimes SEOs advise to use rel=canonical on every page to point to its same location.
If 301s pass along "most" of the link juice, then why use both? What's the disadvantage of just using a 301?
D
-
Good job, 301's are usually the reliable & trusted solution. Canonical tags are really useful for other redirecting purposes such as similar content issues. All the best
-
Hi
thanks for that, it was a 301 i was thinking about, that is what I would normally have done, if for instance an old page had some link juice I would 301 it to another page, but I hadnt done one for this reason
I have just setup a 301 now for the domain name
-
Hi There
There are a couple of options;
-
301 Redirect the index.html page to your root domain, e.g. http://example.com/index.html 301 redirected to http://example.com
-
Use a Canonical tag, pointing the index.html page to your root domain.
You'll find a really helpful Redirection Guide here on SEOmoz.
You can also find here on SEOmoz a useful guide to Canonical tags.
I'd almost always recommend for this particular scenario, opting for a 301 Redirect on the page index.html to your root domain. Chances are (for these two pages) your inbound links will tend to mainly point to your root domain rather than the index.html page.
I hope that helps,
Regards
Simon
-
-
thanks for your help, sorry just to confirm
if I wanted google to keep the www.mydomain.com instead of www.mydomain.com/index.html as the main homepage I would put the link rel= onto the index/html and put the www.mydomain.com in there?
-
Adding the canonical tag within your telling search engines which homepage to use - this seemed to work for us:
If you're having duplicate content throughout your site, I've read having your site admin do something with the htaccess file would do the trick.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Swapping Homepages in WordPress
Hi, Th situation is: I've cloned my homepage & added new content to the cloned page I've then updated the WordPress settings to make the cloned page the new homepage Will I lose PA as the new cloned page is in effect a new article?
Technical SEO | | jasongmcmahon0 -
Is this going to be seen by google as duplicate content
Hi All, Thanks in advance for any help that you can offer in regards to this. I have been conducted a bit of analysis of our server access file to see what googlebot is doing, where it is going etc. Now firstly, I am not SEO but have an interest. What I am seeing a lot of is that we have URL's that have an extension that sets the currency that is displayed on the products so that we can conduct Adwords campaigns in other countries, these show as follows: feedurl=AUD, feedurl=USD, feedurl=EUR etc. What I can see is that google bot is hitting a URL such as /some_product, then /someproduct?feedurl=USD and then /someproduct?feedurl=EUR and then /someproduct?feedurl=AUD all after each other. Now this is the same product page and just has the price shown slightly different on each. Would this count as a duplicate content issue? Should I disavow feedurl? Any assistance that you can offer would be greatly appreciated. Thanks, Tim
Technical SEO | | timsilver0 -
Duplicate Content Brainstorming
Hi, New here in the SEO world. Excellent resources here. We have an ecommerce website that sells presentation templates. Today our templates come in 3 flavours - for PowerPoint, for Keynote and both - called Presentation Templates. So we've ended up with 3 URLS with similar content. Same screenshots, similar description.. Example: https://www.improvepresentation.com/keynote-templates/social-media-keynote-template https://www.improvepresentation.com/powerpoint-templates/social-media-powerpoint-template https://www.improvepresentation.com/presentation-templates/social-media-presentation-template I know what you're thinking. Why not make a website with a template and give 3 download options right? But what about https://www.improvepresentation.com/powerpoint-templates/ https://www.improvepresentation.com/keynote-templates/ These are powerfull URL's in my opinion taking into account that the strongest keyword in our field is "powerpoint templates" How would you solve this "problem" or maybe there is no problem at all.
Technical SEO | | slidescamp0 -
What does Google PageSpeed measure?
What does the PageSpeed tool actually measure? Does it say that a webserver is fast or slow? Thanks in advanced!
Technical SEO | | DanielMulderNL0 -
Google Dancing?
Hello, I was wondering why my website for some keywords goes from 2nd 3rd page in Google to 7th or even more sometimes? This happens since a while. Any suggestion? Thanks. Eugenio
Technical SEO | | socialengaged0 -
Duplicate pages in Google index despite canonical tag and URL Parameter in GWMT
Good morning Moz... This is a weird one. It seems to be a "bug" with Google, honest... We migrated our site www.three-clearance.co.uk to a Drupal platform over the new year. The old site used URL-based tracking for heat map purposes, so for instance www.three-clearance.co.uk/apple-phones.html ..could be reached via www.three-clearance.co.uk/apple-phones.html?ref=menu or www.three-clearance.co.uk/apple-phones.html?ref=sidebar and so on. GWMT was told of the ref parameter and the canonical meta tag used to indicate our preference. As expected we encountered no duplicate content issues and everything was good. This is the chain of events: Site migrated to new platform following best practice, as far as I can attest to. Only known issue was that the verification for both google analytics (meta tag) and GWMT (HTML file) didn't transfer as expected so between relaunch on the 22nd Dec and the fix on 2nd Jan we have no GA data, and presumably there was a period where GWMT became unverified. URL structure and URIs were maintained 100% (which may be a problem, now) Yesterday I discovered 200-ish 'duplicate meta titles' and 'duplicate meta descriptions' in GWMT. Uh oh, thought I. Expand the report out and the duplicates are in fact ?ref= versions of the same root URL. Double uh oh, thought I. Run, not walk, to google and do some Fu: http://is.gd/yJ3U24 (9 versions of the same page, in the index, the only variation being the ?ref= URI) Checked BING and it has indexed each root URL once, as it should. Situation now: Site no longer uses ?ref= parameter, although of course there still exists some external backlinks that use it. This was intentional and happened when we migrated. I 'reset' the URL parameter in GWMT yesterday, given that there's no "delete" option. The "URLs monitored" count went from 900 to 0, but today is at over 1,000 (another wtf moment) I also resubmitted the XML sitemap and fetched 5 'hub' pages as Google, including the homepage and HTML site-map page. The ?ref= URls in the index have the disadvantage of actually working, given that we transferred the URL structure and of course the webserver just ignores the nonsense arguments and serves the page. So I assume Google assumes the pages still exist, and won't drop them from the index but will instead apply a dupe content penalty. Or maybe call us a spam farm. Who knows. Options that occurred to me (other than maybe making our canonical tags bold or locating a Google bug submission form 😄 ) include A) robots.txt-ing .?ref=. but to me this says "you can't see these pages", not "these pages don't exist", so isn't correct B) Hand-removing the URLs from the index through a page removal request per indexed URL C) Apply 301 to each indexed URL (hello BING dirty sitemap penalty) D) Post on SEOMoz because I genuinely can't understand this. Even if the gap in verification caused GWMT to forget that we had set ?ref= as a URL parameter, the parameter was no longer in use because the verification only went missing when we relaunched the site without this tracking. Google is seemingly 100% ignoring our canonical tags as well as the GWMT URL setting - I have no idea why and can't think of the best way to correct the situation. Do you? 🙂 Edited To Add: As of this morning the "edit/reset" buttons have disappeared from GWMT URL Parameters page, along with the option to add a new one. There's no messages explaining why and of course the Google help page doesn't mention disappearing buttons (it doesn't even explain what 'reset' does, or why there's no 'remove' option).
Technical SEO | | Tinhat0 -
Duplicate content, how to solve?
I have about 400 errors about duplicate content on my seomoz dashboard. However I have no idea how to solve this, I have 2 main scenarios of duplication in my site: Scenario 1: http://www.theprinterdepo.com/catalogsearch/advanced/result/?name=64MB+SDRAM+DIMM+MEMORY+MODULE&sku=&price%5Bfrom%5D=&price%5Bto%5D=&category= 3 products with the same title, but different product models, as you can note is has the same price as well. Some printers use a different memory product module. So I just cant delete 2 products. Scenario 2: toners http://www.theprinterdepo.com/brother-high-capacity-black-toner-cartridge-compatible-73 http://www.theprinterdepo.com/brother-high-capacity-black-toner-cartridge-compatible-75 In this scenario, products have a different title but the same price. Again, in this scenario the 2 products are different. Thank you
Technical SEO | | levalencia10 -
Duplicate Page Title
The crawl of my website http://www.aboutaburningfire.com revealed an error showing a duplicate page title. Can someone please explain to me how to fix this? I'm not sure what it means or how to fix it. | House Church Chicago, Organic Church, Illinois http://www.aboutaburningfire.com/ 1 Pending Pending House Church Chicago, Organic Church, Illinois http://www.aboutaburningfire.com/index.html |
Technical SEO | | severity0