Duplicate Homepage In Google
-
Hi
Just found through my SEO dashboard, Google has two versions of the same homepage, the root page, plus the index.html page, causing duplicate content from both the pages.
what is the best option to ensure google only have 1 version of the homepage listed?
-
I'm not a big fan of doubling-up on canonicalization tactics, because there's no good way to tell what's working. The 301 is probably a tiny bit stronger (not much, in my experience), but the advantage of the canonical tag on the home-page is that one tag will sweep up any variants. If you 301 "index.html" to the root, and then someone comes along and uses the non-www version of your home-page or adds a tracking parameter ("index.html?track=1234"), etc., the 301 won't do anything - you'll have to create 301s for each situation. The canonical will prevent those problems, which are very common on home pages.
-
Correct - usually, it's all the same physical page (on the server), so you only need to put it in one place.
-
Hi
just found I have same issues on my own website. However bit confused how I would change this. When the website was developed they where php pages, we did a rewrite so these pages went to .html pages instead, see file below. However now with my website www.rfksolutions.co.uk and www.rfksolutions.co.uk/index.html seem to be duplicate?
see my current htaccess file
any ideas?
DirectoryIndex index.html index.php
RewriteEngine on
RewriteRule ^index.html$ index.php [L]
RewriteRule ^about-us.html$ about-us.php [L]
RewriteRule ^portfolio.html$ portfolio.php [L]
RewriteRule ^partners.html$ partners.php [L]
RewriteRule ^logo-design-portfolio.html$ logo-design-portfolio.php [L]
RewriteRule ^graphic-design-portfolio.html$ graphic-design-portfolio.php [L]
RewriteRule ^graphic-design-portfolio2.html$ graphic-design-portfolio2.php [L]
RewriteRule ^testimonials.html$ testimonials.php [L]
RewriteRule ^portfolio/([0-9a-zA-Z_-]+).html$ portfolio_details.php?&title=$1 [L]RewriteCond %{HTTP_HOST} ^rfksolutions.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.rfksolutions.co.uk$
RewriteRule ^index.php$ "http://www.rfksolutions.co.uk" [R=301,L]RewriteCond %{HTTP_HOST} ^falkirkwebdesign.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.falkirkwebdesign.co.uk$
RewriteRule ^(.*)$ "http://www.rfksolutions.co.uk/$1" [R=301,L]RewriteCond %{HTTP_HOST} ^falkirkwebdesigners.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.falkirkwebdesigners.co.uk$
RewriteRule ^(.*)$ "http://www.rfksolutions.co.uk/$1" [R=301,L]RewriteCond %{HTTP_HOST} ^webdesignfalkirk.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.webdesignfalkirk.co.uk$
RewriteRule ^(.*)$ "http://www.rfksolutions.co.uk/$1" [R=301,L] -
I am confused on when to use both 301 and rel=canonical at the same time and when to use each separately. I have received advice through other Q&As saying to use both for my homepage in this exact same scenario, but sometimes SEOs advise to use rel=canonical on every page to point to its same location.
If 301s pass along "most" of the link juice, then why use both? What's the disadvantage of just using a 301?
D
-
Good job, 301's are usually the reliable & trusted solution. Canonical tags are really useful for other redirecting purposes such as similar content issues. All the best
-
Hi
thanks for that, it was a 301 i was thinking about, that is what I would normally have done, if for instance an old page had some link juice I would 301 it to another page, but I hadnt done one for this reason
I have just setup a 301 now for the domain name
-
Hi There
There are a couple of options;
-
301 Redirect the index.html page to your root domain, e.g. http://example.com/index.html 301 redirected to http://example.com
-
Use a Canonical tag, pointing the index.html page to your root domain.
You'll find a really helpful Redirection Guide here on SEOmoz.
You can also find here on SEOmoz a useful guide to Canonical tags.
I'd almost always recommend for this particular scenario, opting for a 301 Redirect on the page index.html to your root domain. Chances are (for these two pages) your inbound links will tend to mainly point to your root domain rather than the index.html page.
I hope that helps,
Regards
Simon
-
-
thanks for your help, sorry just to confirm
if I wanted google to keep the www.mydomain.com instead of www.mydomain.com/index.html as the main homepage I would put the link rel= onto the index/html and put the www.mydomain.com in there?
-
Adding the canonical tag within your telling search engines which homepage to use - this seemed to work for us:
If you're having duplicate content throughout your site, I've read having your site admin do something with the htaccess file would do the trick.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Fetch as Google issues
HI all, Recently, well a couple of months back, I finally got around to switching our sites over to HTTPS://. In terms of rankings etc all looks fine and we have not move about much, only the usual fluctuations of a place or two on a daily basis in a competitive niche. All links have been updated, redirects in place, the usual https domain migration stuff. I am however, troubled by one thing! I cannot for love nor money get Google to fetch my site in GSC. No matter what I have tried it continues to display "Temporarily unreachable". I have checked the robots.txt and it is on a new https:// profile in GSC. Has anyone got a clue as I am stumped! Have I simply become blinded by looking too much??? Site in Q. caravanguard co uk. Cheers and looking forward to your comments.... Tim
Technical SEO | | TimHolmes0 -
Purchasing duplicate content
Morning all, I have a client who is planning to expand their product range (online dictionary sites) to new markets and are considering the acquisition of data sets from low ranked competitors to supplement their own original data. They are quite large content sets and would mean a very high percentage of the site (hosted on a new sub domain) would be made up of duplicate content. Just to clarify, the competitor's content would stay online as well. I need to lay out the pros and cons of taking this approach so that they can move forward knowing the full facts. As I see it, this approach would mean forgoing ranking for most of the site and would need a heavy dose of original content as well as supplementing the data on page to build around the data. My main concern would be that launching with this level of duplicate data would end up damaging the authority of the site and subsequently the overall domain. I'd love to hear your thoughts!
Technical SEO | | BackPack851 -
Page missing from Google index
Hi all, One of our most important pages seems to be missing from the Google index. A number of our collections pages (e.g., http://perfectlinens.com/collections/size-king) are thin, so we've included a canonical reference in all of them to the main collection page (http://perfectlinens.com/collections/all). However, I don't see the main collection page in any Google search result. When I search using "info:http://perfectlinens.com/collections/all", the page displayed is our homepage. Why is this happening? The main collection page has a rel=canonical reference to itself (auto-generated by Shopify so I can't control that). Thanks! WUKeBVB
Technical SEO | | leo920 -
Duplication, pagination and the canonical
Hi all, and thank you in advance for your assistance. We have an issue of paginated pages being seen as duplicates by pro.moz crawlers. The paginated pages do have duplicated by content, but are not duplicates of each other. Rather they pull through a summary of the product descriptions from other landing pages on the site. I was planing to use rel=canonical to deal with them, however I am concerned as the paginated pages are not identical to each other, but do feature their own set of duplicate content! We have a similar issue with pages that are not paginated but feature tabs that alter the URL parameters like so: ?st=BlueWidgets ?st=RedSocks ?st=Offers These are being seen as duplicates of the main URL, and again all feature duplicate content pulled from elsewhere in the site, but are not duplicates of each other. Would a canonical tag be suitable here? Many Thanks
Technical SEO | | .egg0 -
Home page deindexed by google
when I search my website on google by site:www.mydomain.com I have found my domain with www has been de-indexed by google, but when I search site:mydomain.com, my home page--**mydomain.com **show up on the search results without www, put it simple, google only index my domain without www, I wonder how to make my domain with www being indexed, and how to prevent this problem occure again.
Technical SEO | | semer0 -
Do Collections in Shopify create Duplicate Pages according to Google/Bing/Yahoo?
I'm using the e-commerce platform Shopify to host an e-store. We've put our products into different collections. Shopify automatically creates different URL paths to a product in multiple collections. I'm worried that the same product listed in different collections is soon as different pages, and therefore duplicate content by Google/Bing/Yahoo. Would love to get your opinion on this concern! Thanks! Matthew
Technical SEO | | HappinessDigital0 -
Having trouble removing homepage from google
For various reasons my client wants their homepage removed from google, no just the content of the page off but the page not to be indexed (yep strange request but we are mere service providers) today I requested in webmaster tool that default.asp was removed. Wht says done but the sites homepage is still listed. The page also has a no index tag on but 24 hours and 18k Google bot hits later it still remains. Anyone got any other suggestions to deindex just the homepage asap please
Technical SEO | | Grumpy_Carl0 -
Duplication Penalty through Specs?
I am trying to figure our how to correct a recently incurred duplication penalty on a partner site. I didn't see any posts on this yet specific to my problem. The site used to be ranked on page 1 of Google for all important keywords but now we ran into the situation that many pages were bumped to pos 100 or lower due to duplication issues. This is an aviation site, discussing airplanes and each page discusses a different model but each page also has the specs of the plane and while the data parts are different for each plane the specification terms are the same ,see here: Primary Function:
Technical SEO | | WizardHQ
Crew:
Engine:
Thrust:
Weight Empty:
Max. Weight:
Length:
Wingspan:
Cruise Speed:
Max.Speed:
Climb:
Ceiling:
Range:
First Flight:
Year Deployed: Is there an easy way to get Google to stop including these terms (not the data in the 2nd column) from the page anaysis to prevent this causing the duplication issues we are are seeing due to this? Thanks in advance!0