Canonicalization of index.html - please help
-
I've read up on the subject but am new at this so I thought I would just put forth a simple question. We want our home page to be referred to as www.domain.com. We want the search engines to find and return this URl in search results. But the page has to have a name and the actual name is NOT to www.domain.com/index.html. This, I believe is what can cause duplicate cotnent issues (not really duplicate but perceived by the serach engines as duplicate content). Is it best to insert http://www.domain.com/" /> in the HEAD section of the index.html page or am I totally misunderstanding this concept?
-
When you do your 301 redirects as outlined by John don't forget to 301 redirect your non-www URL version to your www URL version (or visa-versa).
Here is an example of all the URLs that could be on your website.
http://www.domain.com
http://www.domain.com/index.html
http://domain.com
http://domain.com/index.html -
Hi Tag,
As John is suggesting, you could do a straight 301 but the problem is this will lead to an infinite loop and a page error. Your best bet is to use the technique here:http://www.askapache.com/htaccess/redirect-index-blog-root.html to avoid that. Happy hunting.
Hope this helps.
-
Yes, this does create a duplicate content issue. The best solution is to have /index.html 301 redirect to /. However, the canonical as you outlined above should also to fix the issue if you don't have access to your server configuration for redirects.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Not Indexing Pages (Wordpress)
Hello, recently I started noticing that google is not indexing our new pages or our new blog posts. We are simply getting a "Discovered - Currently Not Indexed" message on all new pages. When I click "Request Indexing" is takes a few days, but eventually it does get indexed and is on Google. This is very strange, as our website has been around since the late 90's and the quality of the new content is neither duplicate nor "low quality". We started noticing this happening around February. We also do not have many pages - maybe 500 maximum? I have looked at all the obvious answers (allowing for indexing, etc.), but just can't seem to pinpoint a reason why. Has anyone had this happen recently? It is getting very annoying having to manually go in and request indexing for every page and makes me think there may be some underlying issues with the website that should be fixed.
Technical SEO | | Hasanovic1 -
No index
Screaming frog spider does index pages on our website like: wp-content/plugins/woocommerce/assets/js/frontend/jquery-ui-touch-punch.min.js?ver=2.3.9 wp-content/plugins/mailchimp-for-wp/assets/css/checkbox.min.css?ver=2.3.2 Is it a bad/good idea to set my parameters in Webmastertools and tell Google not to crawl pages that begin with wp/content? Thanks!
Technical SEO | | Happy-SEO1 -
Canonicalization help
Hi Moz Community, If I have two different sub-category pages: http://www.example.com/rings/anniversary-rings/
Technical SEO | | IceIcebaby
http://www.example.com/wedding/anniversary-rings/ And the first one is ranking for all KWs, should I add a rel=canonical to the second URL or leave it since it's slightly different? Or should I try and create different unique content for the second URL? Everything in terms of content is the same on both these pages except for the URLs, which aren't that different to begin with. Thanks for your help! -Reed0 -
CDN Being Crawled and Indexed by Google
I'm doing a SEO site audit, and I've discovered that the site uses a Content Delivery Network (CDN) that's being crawled and indexed by Google. There are two sub-domains from the CDN that are being crawled and indexed. A small number of organic search visitors have come through these two sub domains. So the CDN based content is out-ranking the root domain, in a small number of cases. It's a huge duplicate content issue (tens of thousands of URLs being crawled) - what's the best way to prevent the crawling and indexing of a CDN like this? Exclude via robots.txt? Additionally, the use of relative canonical tags (instead of absolute) appear to be contributing to this problem as well. As I understand it, these canonical tags are telling the SEs that each sub domain is the "home" of the content/URL. Thanks! Scott
Technical SEO | | Scott-Thomas0 -
Magento CMS Block Issue --- Help Please
Good Morning, We have a Magento shopping cart based site running on RedHat version of Linux. We had a CMS block created for the homepage of http://goo.gl/JgK1e designed to be visible only on the homepage only and nowhere else. We copied the entire site structure onto a new URL http://goo.gl/XUH3f . (this one running on CentOS) and have an odd situation on our hands... Even though the CMS block “static_after_footer_block” is “enabled”, it either completely disappears (moments later), or whenever it does display, it is visible in ALL levels of the site (not just the homepage it was designed for) Other than this anomaly, the site seems to be operating correctly… Anyone out there with some insight? Thanks!
Technical SEO | | Prime850 -
Help us define a category/product structure please
Hi, Apologies in advance for the long winded question... we need some guidance with our category/product/options structure in our shop. We primarily sell car parts and lots of our parts have multiple fitments for what is basically the same part. Some ranges can have 1,000s of products. We can't work out what is an appropriate level of information and granularity for our product structure.We recognise the importance of having fitments and specific terms in the product title and URL, but we also know that having loads of almost identical product pages is a definite negative and fragments our SEO potential. But where's the happy medium? For example, let's say we have a specific brand of brake pad (we'll call it Brako) with 4 different product-models (Super1, Super2, Super3, Super4), each fits 100 different cars, which are made by 10 different manufacturers. We have a few different ways of presenting/splitting up these 400 simple products: (ignore the URLs here, this is just to illustrate the browsing structure & likely product page titles) 1 category for the Brake Brand with 400 product pages inside, 1 product page for each specific combination of brake product-model and car-fitment. /Brako/Brako-Super1-brakes_BMW-M3.html 1 category, 400 product pages, 0 choices on each product page. 1 category for the Brake Brand with 40 products inside, 1 product for each specific combination of brake product-model and car-manufacturer. Each product page would then let you choose from a dropdown which of the 10 specific cars you had. /Brako/Brako-Super1-brakes_BMW.html 1 category, 40 product pages, 10 choices on each product page. 1 category for the Brake Brand with 4 sub-categories inside for the brake product-models with 100 products inside each, 1 product for each specific combination of car-fitment. /Brako/Brako-Super1-brakes/Brako-Super1-brakes_BMW-M3.html 1 category, 4 sub-categories, 40 product pages, 10 choices on the product page. 1 category for the Brake Brand with 4 sub-categories inside for the brake product-models, with 10 products inside each.1 product for each specific combination of brake product-model and car-manufacturer. Each product page would then let you choose from a dropdown which of the 10 specific cars you had. /Brako/Brako-Super1-brakes/brakebrand-Super1-brakes_BMW.html 1 category, 4 sub-categories, 40 product pages, 10 choices on each product page. 1 category for the Brake Brand with 4 products inside, 1 product for each brake product-model. Each product page would then let you choose from 2 dropdowns, each with 10 options: one for car manufacturer, the next for car model. /Brako/Brako-Super1-brakes.html 1 category, 4 product pages, 100 (10x10) choices on each product page. 1 product page containing options to choose all 400 Brako products using 3 drop down boxes: Car Manufacturer, Car Model and Product-Model /Brako/Brako-brakes.html 1 category, 1 product page, 100 (10x10) choices on each product page. Or we could mix it up and split the sub-categories by manufacturer: 1 category for the Brake Brand with 10 sub-categories (1 sub-category for each of the car manufacturers with 40 products inside each), 1 product page for each specific variation of car-fitment and product-model. /Brako/Brako-brakes-BMW/Brako-Super1-brakes_BMW-M3.html 1 category, 10 sub-categories, 40 product pages, 0 choices on the product page. 1 category for the Brake Brand with 10 sub-categories (1 sub-category for each of the car manufacturers with 10 products inside each), 1 product page for each specific variation of car-fitment. Drop dowjn box on the product page lets you choose product-model (Super1-4) /Brako/Brako-brakes-BMW/Brako-brakes_BMW-M3.html 1 category, 10 sub-categories, 10 product pages, 4 choices on the product page. 1 category for the Brake Brand with 10 sub-categories (1 sub-category for each of the car manufacturers with products inside each), 1 product page for each specific variation of product-model. /Brako/Brako-brakes-BMW/Brako-Super1-brakes_BMW.html 1 category, 10 sub-categories, 4 product pages, 10 choices on the product page. Obviously, option 1) is going to be the best search match for someone searching for 'BMW M3 Brako Super1 brakes' but that page will have almost identical content to 100 other pages and very similar content to a further 300 pages, which takes it's quality ranking down a lot. At the other end of the scale of complexity is option 5) which concentrates all search potential for the Brako Super1 down to a single page, which can be well written and have great content, but wouldn't have a match in the title, url or product name for anyone searching for 'BMW M3 Brako Super1 brakes'. 'BMW M3' would be mentioned in the page, but only once in a drop-down along with 100 other cars and possibly once in the content if there's something noteworthy about that application. So which option would you go for and why?
Technical SEO | | DWJames0 -
Domain restructure, sitemaps and indexing
I've got a handcoded site with around 1500 unique articles and a handcoded sitemap. Very old school. The url structure is a bit of a mess, so to make things easier for a developer who'll be making the site database-driven, I thought I'd recategorise the content. Same content, but with new url structure (I thought I'd juice up the urls for SEO purposes while I was at it) To this end, I took categories like: /body/amazing-big-shoes/
Technical SEO | | magdaknight
/style/red-boots/
/technology/cyber-boots/ And rehoused all the content like so, doing it all manually with ftp: /boots/amazing-boots/
/boots/red-boots/
/boots/cyber-boots/ I placed 301 redirects in the .htaccess file like so: redirect 301 /body/amazing-boots/ http://www.site.co.uk/boots/amazing-boots/ (not doing redirects for each article, just for categories which seemed to make the articles redirect nicely.) Then I went into sitemap.xml and manually overwrote all the entries to reflect the new url structure, but keeping the old dates of the original entries, like so: <url><loc>http://www.site.co.uk/boots/amazing-boots/index.php</loc>
<lastmod>2008-07-08</lastmod>
<changefreq>monthly</changefreq>
<priority>0.5</priority></url> And resubmitted the sitemap to Google Webmasters. This was done 4 days ago. Webmaster said that the 1400 of 1500 articles indexed had dropped to 860, and today it's climbed to 939. Did I adopt correct procedure? Am I going about things the right way? Given a little time, can I expect Google to re-index the new pages nicely? I appreciate I've made a lot of changes in one fell swoop which could be a bit of a no-no... ? PS Apologies if this question appears twice on Q&A - hopefully I haven't double-posted0 -
Is this 404 page indexed?
I have a URL that when searched for shows up in the Google index as the first result but does not have any title or description attached to it. When you click on the link it goes to a 404 page. Is it simply that Google is removing it from the index and is in some sort of transitional phase or could there be another reason.
Technical SEO | | bfinternet0