Html code for none .index
-
In the diagnostic I have several errors in Duplicate Page Content and Title. The thing is that the errors is on the same page but with "different" names. One is called http://siteX.com/ another is called http://www.siteX.com/ and the same third one is called
http://www.siteX.com/index.htmlHow do I go about changing all three sites, I have changed the /index.html one but dont know how to catch the other once. Is it possible, if it is I would like to know how?
-
An other thing.
I missed changing the example.com to my site and no it goes to example.com, even if I change it or even delete the file...still there.
I there a way to go around it?
-
I have done the following, is that correct? If it is why cant I see any directions, for ex. I put www.siteX.com/index.html or siteX.com and get the same ?
-FrontPage-
RewriteEngine On
RewriteCond %{HTTP_HOST} ^example.com
RewriteRule (.*) http://www.example.com/$1 [R=301,L]RewriteEngine on
RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /index.html\ HTTP/
RewriteRule ^index.html$ http://example.com/ [R=301,L]IndexIgnore .htaccess /.?? *~ *# /HEADER /README /_vti
<limit get="" post="">order deny,allow
deny from all
allow from all</limit>
<limit put="" delete="">order deny,allow
deny from all</limit> -
Okay, the fact you already have an .htaccess file means you should be able to try adding the rules I provided. Put them at the top of the .htaccess file and test.
-
Please take what I say below with a grain of salt, as I am very good with .htaccess, but not so great when Frontpage is in the loop. Also back up ALL files before making ANY changes for quick replace if creates a "Internal Server Error"
_vti_bin/
_vti_adm
_vti_authShould be in your structure, each with an .htaccess
add the line
Options +FollowSymlinks
to each one
Now the just add everything streamline metrics has suggested, to the current .htaccess and test
You can try adding what streamline metrics suggests, without the above steps, as .htaccess is not dependent on Frontpage, and the Frontpage extensions have nothing to do with .htaccess (from research i found on the web)
-
I am not sure what kind of server I have but this is whats in my .htaccess
-FrontPage-
IndexIgnore .htaccess /.?? *~ *# /HEADER /README /_vti
<limit get="" post="">order deny,allow
deny from all
allow from all</limit>
<limit put="" delete="">order deny,allow
deny from all</limit>How do I do it?
-
The first step would be to redirect the http://siteX.com to http://www.siteX.com or vice versa. You can easily do this with .htaccess if you have a LAMP server. Here is the code to put in your .htaccess to redirect from non-www to www (replace example.com with your site name) -
RewriteEngine On
RewriteCond %{HTTP_HOST} ^example.com
RewriteRule (.*) http://www.example.com/$1 [R=301,L]As for handling http://siteX.com/index.html, simply redirect that as well with .htaccess -
RewriteEngine on
RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /index.html\ HTTP/
RewriteRule ^index.html$ http://example.com/ [R=301,L]I would also suggest adding a rel="canonical" tag to your pages just in case the search engines come across URLs with parameters, such as index.html?q=1235 then they know to index only the version of the page you designated.
Hope that helps!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Strange google indexing behaviour
Hi all Looking for a second opinion on a strange issue with has occurred on my site. The site is a magento store and because I am using all the default merchant descriptions at the moment I have noindexed the product pages (there are 300k products, the plan is to rewrite the content as we go, starting with most popular sellers). The Gbot is blocked from the pages and all the products have header tag. We forgot to noindex the popular search terms page on the site and as a result google has indexed some search result pages - we may keep this open, not sure yet, We are seeing a very strange thing in the serps. Google has indexed the search result pages, as mentioned above, however, the description and title tag being used do not belong to that page, they belong to the product page the search result links to. If i do a search in google for the indexed pages i get the categories and lots of, what appears to be, product pages. https://www.google.co.uk/search?q=site:arropa.co.uk/store&espv=2&biw=1536&bih=772&ei=LE5xVd3qA4HlUNnggKgH&start=250&sa=N One would assume that a page listed with the title of Ladies 1 Pair Young Trasparenze Mumbai Animal Print . and the description of Come on, program a little of your crazy side! Part of the edgy, sassy Young Trasparenze Medley, these soft touch, nontransparent stockings function a crazy, (along with the price) would be an entry for that individual product. However, clicking on that product opens up a search results page (very slowly as the site is processing an update still - it is not for public use thus far) which can be seen here http://arropa.co.uk/store/catalogsearch/result/?q=+ladies+1+pair+young+trasparenze+mumbai+animal+print+tights+75+off+military+l+ yes, the search result page is for that particular item but nowhere on the page is the title, description and price, nor has it ever been. Am a little puzzled about this and what it would do re duplicate content as im using the manufacturer data at present. Ideally i would like to keep the search results pages open. Any thoughts would be most welcome. Couple of things to note. Im aware the site is too slow for general public use. It will be fully cached once running, as i say, it has 300k+ products so isn't small. Also, am aware that there are no images. They exist but we are moving the images around, hence being down. Always a fun task when there are 25gb of the things!! Many thanks Carl
On-Page Optimization | | WonkyDog0 -
Indexing pages after de-indexing them
I have been de-indexing duplicate content on my website which has almost 40 pages contain duplicate content from other websites. later on the website ranking drop down. so should i re index them or just wait ?
On-Page Optimization | | MohammadSabbagh0 -
Why Can't I Get Indexed?
I cannot seem to get my website indexed by Google! I submitted the sitemap using Google WMT about a month ago but only one page is being indexed. There are very few backlinks to the site, so I don't believe there are any penalties due to over-optimization that would prevent indexing. Also, my robots.txt file is properly configured and is not preventing any pages from being crawled. I've tried using the "Fetch as Google" settings in WMT with no luck. Any ideas?
On-Page Optimization | | socialfirestarter0 -
When You Add a Robots.txt file to a website to block certain URLs, do they disappear from Google's index?
I have seen several websites recently that have have far too many webpages indexed by Google, because for each blog post they publish, Google might index the following: www.mywebsite.com/blog/title-of-post www.mywebsite.com/blog/tag/tag1 www.mywebsite.com/blog/tag/tag2 www.mywebsite.com/blog/category/categoryA etc My question is: if you add a robots.txt file that tells Google NOT to index pages in the "tag" and "category" folder, does that mean that the previously indexed pages will eventually disappear from Google's index? Or does it just mean that newly created pages won't get added to the index? Or does it mean nothing at all? thanks for any insight!
On-Page Optimization | | williammarlow0 -
Google Index Report
Hi, I have just checked my google webmaster tools account and viewed the index status of my website and it produced the attached graph, which show quite a big spike in indexing during July and August 2012. Does this look normal or does it reveal anything peculiar? We did have a new website launched in June 2012 and I re-submitted the sites URL's to google as part of the re-launch and so I am unsure if this may account for the spike. Any advice appreciated. Thanks indexing.png
On-Page Optimization | | UnderMe0 -
My Images Aren't Indexed By Goolge
My site is indexed by google, and there hasn't really been much of a problem with my content, but now I am noticing that none of our images are coming up in the images searches of google. I mean none. I have even typed in the alt text of the image verbatim and nothing shows up. I use wordpress if this helps anyone. Any advice would be awesome, Thanks a lot.
On-Page Optimization | | Caseman0 -
Remove internal site SERPS from Google Index?
1. Internal Serp pages did not have a robots meta tag 2. As a result, client site has thousands (~4,400) of internal site SERP pages in the Google index. 3. We added the NoIndex, Follow attribute to all internal SERPS 4. We Disallowed: domain.com/internal-search-operator in Robots.txt 5. No new SERP pages are being indexed, but the other 4000 something that were already there are still in the index weeks later. 6. The pages are dynamically created and still work, so I can't use the Remove Content tool from google, because the pages don't 404. Is there any way to get these pages out of the index besides just waiting and hoping google eventuall drops them? Thanks
On-Page Optimization | | delegator.com0 -
Got loads of pages, but none indexing?
I have a WordPress site with loads of pages on a url like this http://mysite.com.au However, Google has indexed http://www.mysite.com.au and as a result only indexing 2 pages. How do I fix this? Many thanks Dan
On-Page Optimization | | Pokodot0