Duplicate Homepage In Google
-
Hi
Just found through my SEO dashboard, Google has two versions of the same homepage, the root page, plus the index.html page, causing duplicate content from both the pages.
what is the best option to ensure google only have 1 version of the homepage listed?
-
I'm not a big fan of doubling-up on canonicalization tactics, because there's no good way to tell what's working. The 301 is probably a tiny bit stronger (not much, in my experience), but the advantage of the canonical tag on the home-page is that one tag will sweep up any variants. If you 301 "index.html" to the root, and then someone comes along and uses the non-www version of your home-page or adds a tracking parameter ("index.html?track=1234"), etc., the 301 won't do anything - you'll have to create 301s for each situation. The canonical will prevent those problems, which are very common on home pages.
-
Correct - usually, it's all the same physical page (on the server), so you only need to put it in one place.
-
Hi
just found I have same issues on my own website. However bit confused how I would change this. When the website was developed they where php pages, we did a rewrite so these pages went to .html pages instead, see file below. However now with my website www.rfksolutions.co.uk and www.rfksolutions.co.uk/index.html seem to be duplicate?
see my current htaccess file
any ideas?
DirectoryIndex index.html index.php
RewriteEngine on
RewriteRule ^index.html$ index.php [L]
RewriteRule ^about-us.html$ about-us.php [L]
RewriteRule ^portfolio.html$ portfolio.php [L]
RewriteRule ^partners.html$ partners.php [L]
RewriteRule ^logo-design-portfolio.html$ logo-design-portfolio.php [L]
RewriteRule ^graphic-design-portfolio.html$ graphic-design-portfolio.php [L]
RewriteRule ^graphic-design-portfolio2.html$ graphic-design-portfolio2.php [L]
RewriteRule ^testimonials.html$ testimonials.php [L]
RewriteRule ^portfolio/([0-9a-zA-Z_-]+).html$ portfolio_details.php?&title=$1 [L]RewriteCond %{HTTP_HOST} ^rfksolutions.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.rfksolutions.co.uk$
RewriteRule ^index.php$ "http://www.rfksolutions.co.uk" [R=301,L]RewriteCond %{HTTP_HOST} ^falkirkwebdesign.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.falkirkwebdesign.co.uk$
RewriteRule ^(.*)$ "http://www.rfksolutions.co.uk/$1" [R=301,L]RewriteCond %{HTTP_HOST} ^falkirkwebdesigners.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.falkirkwebdesigners.co.uk$
RewriteRule ^(.*)$ "http://www.rfksolutions.co.uk/$1" [R=301,L]RewriteCond %{HTTP_HOST} ^webdesignfalkirk.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.webdesignfalkirk.co.uk$
RewriteRule ^(.*)$ "http://www.rfksolutions.co.uk/$1" [R=301,L] -
I am confused on when to use both 301 and rel=canonical at the same time and when to use each separately. I have received advice through other Q&As saying to use both for my homepage in this exact same scenario, but sometimes SEOs advise to use rel=canonical on every page to point to its same location.
If 301s pass along "most" of the link juice, then why use both? What's the disadvantage of just using a 301?
D
-
Good job, 301's are usually the reliable & trusted solution. Canonical tags are really useful for other redirecting purposes such as similar content issues. All the best
-
Hi
thanks for that, it was a 301 i was thinking about, that is what I would normally have done, if for instance an old page had some link juice I would 301 it to another page, but I hadnt done one for this reason
I have just setup a 301 now for the domain name
-
Hi There
There are a couple of options;
-
301 Redirect the index.html page to your root domain, e.g. http://example.com/index.html 301 redirected to http://example.com
-
Use a Canonical tag, pointing the index.html page to your root domain.
You'll find a really helpful Redirection Guide here on SEOmoz.
You can also find here on SEOmoz a useful guide to Canonical tags.
I'd almost always recommend for this particular scenario, opting for a 301 Redirect on the page index.html to your root domain. Chances are (for these two pages) your inbound links will tend to mainly point to your root domain rather than the index.html page.
I hope that helps,
Regards
Simon
-
-
thanks for your help, sorry just to confirm
if I wanted google to keep the www.mydomain.com instead of www.mydomain.com/index.html as the main homepage I would put the link rel= onto the index/html and put the www.mydomain.com in there?
-
Adding the canonical tag within your telling search engines which homepage to use - this seemed to work for us:
If you're having duplicate content throughout your site, I've read having your site admin do something with the htaccess file would do the trick.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Home Page
Hi everyone! So, I;m using the crawl diagnostics in Moz and it's telling that I've got duplicate content for these two pages: http://www.bridgelanguages.com/
Technical SEO | | Bridge_Education_Group
http://www.bridgelanguages.com/index.php?p=3233&source=3 Would a redirect from the 2nd page to the 1st one be a solution? I'm not even sure where that 2nd link is on the site? Any suggestions or has anyone experienced the same? Thanks! Kelly0 -
Duplicate Page Errors
Hey guys, I'm wondering if anyone can help... Here is my issue... Our website:
Technical SEO | | TCPReliable
http://www.cryopak.com
It's built on Concrete 5 CMS I'm noticing a ton of duplicate page errors (9530 to be exact). I'm looking at the issues and it looks like it is being caused by the CMS. For instance the home page seems to be duplicating.. http://www.cryopak.com/en/
http://www.cryopak.com/en/?DepartmentId=67
http://www.cryopak.com/en/?DepartmentId=25
http://www.cryopak.com/en/?DepartmentId=4
http://www.cryopak.com/en/?DepartmentId=66 Do you think this is an issue? Is their anyway to fix this issue? It seems to be happening on every page. Thanks Jim0 -
Duplicate Content Issue
My issue with duplicate content is this. There are two versions of my website showing up http://www.example.com/ http://example.com/ What are the best practices for fixing this? Thanks!
Technical SEO | | OOMDODigital0 -
Duplicate Content
SEOmoz is reporting duplicate content for 2000 of my pages. For example, these are reported as duplicate content: http://curatorseye.com/Name=“Holster-Atlas”---Used-by-British-Officers-in-the-Revolution&Item=4158
Technical SEO | | jplill
http://curatorseye.com/Name=âHolster-Atlasâ---Used-by-British-Officers-in-the-Revolution&Item=4158 The actual link on the site is http://www.curatorseye.com/Name=“Holster-Atlas”---Used-by-British-Officers-in-the-Revolution&Item=4158 Any insight on how to fix this? I'm not sure where the second version of the URL is coming from. Thanks,
Janet0 -
Optimizing the homepage : should I have a h1 or not?
Hi everyone, I am launching a debate that may seem a bit basic for the most experienced of you 😉 In fact, after completing internships in two different agencies, I had quite opposite views of the optimization process for the homepage: View 1: The home page should be optimized on a keyword and should definitely have a h1. View 2: The home page is a branded page and should not necessarily have an h1 (unlike other pages). Bonus question: assuming that the view 1 is the right one, is it a good idea to put a h1 on the logo (only for the homepage) ? Thank you very much for your help
Technical SEO | | Axel_Janvier0 -
Thin/Duplicate Content
Hi Guys, So here's the deal, my team and I just acquired a new site using some questionable tactics. Only about 5% of the entire site is actually written by humans the rest of the 40k + (and is increasing by 1-2k auto gen pages a day)pages are all autogen + thin content. I'm trying to convince the powers that be that we cannot continue to do this. Now i'm aware of the issue but my question is what is the best way to deal with this. Should I noindex these pages at the directory level? Should I 301 them to the most relevant section where actual valuable content exists. So far it doesn't seem like Google has caught on to this yet and I want to fix the issue while not raising any more red flags in the process. Thanks!
Technical SEO | | DPASeo0 -
About Google Spider
Hello, people! I have some questions regarding on Google spider. Many people are saying that "Google spiders only have US IP address." Is this really true? But I also saw video from Google's offical blog and it said "Google spider come from all around the world." At this point I am really confused. Q1) I researched and it seems like Google spiders have only US IP address. THen what does exactly mean by "Google spider come from all around the world."? Q2) If Google spider have only US IP address, what happen to site which use IP delivery? Is this means that Google spider always redirect to us site since they only have US IP? Can anyone help me to understand?? One more questions! When Google analyzing for cloaking issue, do you think Google analyze when spider crawls the site or after they crawled the site?
Technical SEO | | Artience0 -
How to disallow google and roger?
Hey Guys and girls, i have a question, i want to disallow all robots from accessing a certain root link: Get rid of bots User-agent: * Disallow: /index.php?_a=login&redir=/index.php?_a=tellafriend%26productId=* Will this make the bots not to access any web link that has the prefix you see before the asterisk? And at least google and roger will get away by reading "user-agent: *"? I know this isn't the standard proceedure but if it works for google and seomoz bot we are good.
Technical SEO | | iFix0