Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Moving html site to wordpress and 301 redirect from index.htm to index.php or just www.example.com
-
I found page duplicate content when using Moz crawl tool, see below.
http://www.example.com
Page Authority 40
Linking Root Domains 31
External Link Count 138
Internal Link Count 18
Status Code 200
1 duplicatehttp://www.example.com/index.htm
Page Authority 19
Linking Root Domains 1
External Link Count 0
Internal Link Count 15
Status Code 200
1 duplicateI have recently transfered my old html site to wordpress.
To keep the urls the same I am using a plugin which appends .htm at the end of each page.My old site home page was index.htm. I have created index.htm in wordpress as well but now there is a conflict of duplicate content. I am using latest post as my home page which is index.php
Question 1.
Should I also use redirect 301 im htaccess file to transfer index.htm page authority (19) to www.example.comIf yes, do I use
Redirect 301 /index.htm http://www.example.com/index.php
or
Redirect 301 /index.htm http://www.example.comQuestion 2
Should I change my "Home" menu link to http://www.example.com instead of http://www.example.com/index.htm that would fix the duplicate content, as indx.htm does not exist anymore.Is there a better option?
Thanks
-
About three different homepages: This is my reasoning: Wordpress based website uses index.php as a home page if you use "latest posts" as your home page, Even index.php is not displayed in the url address bar. Does that mean WP sites always have 2 homepages? I have removed index.htm at the moment, but I see that as a problem as I am losing PA value for that page.
Zee: So it sounds like you should create a "dynamic" front page (this link should help you: https://codex.wordpress.org/Creating_a_Static_Front_Page). If for some reason you are unable to remove both duplicate homepages, I'd say deprecate one and 301 redirect it to your main homepage, and implement a rel=canonical tag from the duplicated homepage to your main one (e.g. example.com/index.php -301-> example.com)
I can see that this is a bit messy.. I did this as I wanted to preserve
original htm based site URL's. Are you saying that search engines see
www.example.com/dir1/dir2/page.htm and www.example.com/dir1/dir2/page as
identical URL's.Zee: I can't say for sure, without seeing your site, but if they're indexed (discoverable by search engine crawlers), you could definitely be penalized. If your .htm pages have built up link equity, you'll need to 301 redirect them to their corresponding pages
With my old htm based website I had too many subdirectories. I have
removed some of the subs to make URLs shorter and used 301 redirects in
.htaccess file. I probably should have removed .htm appendixes as well
at this stage, but I did not.
Is there a point to do 301 redirects again? 90% of my pages have been 301 redirected as I have removed some of the old subdirectories. Are you saying that I should again 301 redirect my newly 301 redirected pages?
Zee: No, that'll result in a 301 redirect chain--you should ALWAYS 301 redirect to whatever the most appropriate, final destination is (think of 301 redirects as nonstop flights, never layovers). If I 301 redirect A --> B, but then B needs to be redirected as B --> C, you must update A's to reflect this as well, A --> C, to avoid a redirect chain.
I would like to use example.com as my home page only, I am just trying to transfer PA value of my index.htm page to example.com. Not sure how to do this.
Zee: You may want to canonical this one, in that case (especially if you can confirm that these pages present the same information)--you can implement a rel=canonical on your .htm homepage that points back to your main homepage.
-
Zee, thank you for taking time to answer my questions
Hey gozmoz--I might need a little more information from you in order to help here. It sounds like you've got potentially THREE different homepages (see below). First of all, is that right?
1. example.com
2. example.com/index.htm
3. example.com/index.php
About three different homepages:
This is my reasoning: Wordpress based website uses index.php as a home page if you use "latest posts" as your home page, Even index.php is not displayed in the url address bar.
Does that mean WP sites always have 2 homepages?
I have removed index.htm at the moment, but I see that as a problem as I am losing PA value for that page.
Big red flag to me is the Wordpress plugin you're using that automatically appends .htm to the end of every URL you've transferred to your new WP CMS--why were you unable to keep the URLs as-is, without the .htm? My rec here would be to undo that, and keep the original URLs (without .htm).
I can see that this is a bit messy..
I did this as I wanted to preserve original htm based site URL's.
Are you saying that search engines see
www.example.com/dir1/dir2/page.htm and
www.example.com/dir1/dir2/page
as identical URL's.With my old htm based website I had too many subdirectories. I have removed some of the subs to make URLs shorter and used 301 redirects in .htaccess file. I probably should have removed .htm appendixes as well at this stage, but I did not.
Question 1: As far as this question goes, I do think you'll need to clarify the above before I can make a solid rec for you. Again, my preference would be for you to use your original URLs as the main ones and deprecate these .htm versions if possible. That may require you to 301 redirect the .htm versions to the non-htm counterparts.
Is there a point to do 301 redirects again? 90% of my pages have been 301 redirected as I have removed some of the old subdirectories. Are you saying that I should again 301 redirect my newly 301 redirected pages?
Question 2: what do you mean, example.com/index.htm no longer exists? Have you deleted this page entirely? If so, you may not need to do anything here. I do however, think you need to establish a singular URL to be your homepage, and see no reason you shouldn't use example.com (on its own) as your home, instead of appending /index.htm or /index.php
I would like to use example.com as my home page only, I am just trying to transfer PA value of my index.htm page to example.com. Not sure how to do this.
Regads
Gozmoz -
Hey gozmoz--I might need a little more information from you in order to help here. It sounds like you've got potentially THREE different homepages (see below). First of all, is that right?
1. example.com
2. example.com/index.htm
3. example.com/index.phpBig red flag to me is the Wordpress plugin you're using that automatically appends .htm to the end of every URL you've transferred to your new WP CMS--why were you unable to keep the URLs as-is, without the .htm? My rec here would be to undo that, and keep the original URLs (without .htm).
**Question 1: **As far as this question goes, I do think you'll need to clarify the above before I can make a solid rec for you. Again, my preference would be for you to use your original URLs as the main ones and deprecate these .htm versions if possible. That _may _require you to 301 redirect the .htm versions to the non-htm counterparts.
Question 2: what do you mean, example.com/index.htm no longer exists? Have you deleted this page entirely? If so, you may not need to do anything here. I do however, think you need to establish a singular URL to be your homepage, and see no reason you shouldn't use example.com (on its own) as your home, instead of appending /index.htm or /index.php
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Do bulk 301 redirects hurt seo value?
We are working with a content based startup that needs to 301 redirect a lot of its pages to other websites. Will give you an example to help you understand. If we assume this is the startups domain and URL structure www.ourcompany.com/brand1/article What they want to do is do a 301 redirect of www.ourcompany.com/brand1/ to www.brand1.com I have never seen 301 as a problem to SEO or link juice. But in this case where all the major URLs are getting redirected to other sites i was wondering if it would have a negative effect. Right now they have just 20-30 brands but they are planning to hit a couple of hundreds this year.
Intermediate & Advanced SEO | | aaronfernandez0 -
E-Commerce Site Collection Pages Not Being Indexed
Hello Everyone, So this is not really my strong suit but I’m going to do my best to explain the full scope of the issue and really hope someone has any insight. We have an e-commerce client (can't really share the domain) that uses Shopify; they have a large number of products categorized by Collections. The issue is when we do a site:search of our Collection Pages (site:Domain.com/Collections/) they don’t seem to be indexed. Also, not sure if it’s relevant but we also recently did an over-hall of our design. Because we haven’t been able to identify the issue here’s everything we know/have done so far: Moz Crawl Check and the Collection Pages came up. Checked Organic Landing Page Analytics (source/medium: Google) and the pages are getting traffic. Submitted the pages to Google Search Console. The URLs are listed on the sitemap.xml but when we tried to submit the Collections sitemap.xml to Google Search Console 99 were submitted but nothing came back as being indexed (like our other pages and products). We tested the URL in GSC’s robots.txt tester and it came up as being “allowed” but just in case below is the language used in our robots:
Intermediate & Advanced SEO | | Ben-R
User-agent: *
Disallow: /admin
Disallow: /cart
Disallow: /orders
Disallow: /checkout
Disallow: /9545580/checkouts
Disallow: /carts
Disallow: /account
Disallow: /collections/+
Disallow: /collections/%2B
Disallow: /collections/%2b
Disallow: /blogs/+
Disallow: /blogs/%2B
Disallow: /blogs/%2b
Disallow: /design_theme_id
Disallow: /preview_theme_id
Disallow: /preview_script_id
Disallow: /apple-app-site-association
Sitemap: https://domain.com/sitemap.xml A Google Cache:Search currently shows a collections/all page we have up that lists all of our products. Please let us know if there’s any other details we could provide that might help. Any insight or suggestions would be very much appreciated. Looking forward to hearing all of your thoughts! Thank you in advance. Best,0 -
6 .htaccess Rewrites: Remove index.html, Remove .html, Force non-www, Force Trailing Slash
i've to give some information about my website Environment 1. i have static webpage in the root. 2. Wordpress installed in sub-dictionary www.domain.com/blog/ 3. I have two .htaccess , one in the root and one in the wordpress
Intermediate & Advanced SEO | | NeatIT
folder. i want to www to non on all URLs Remove index.html from url Remove all .html extension / Re-direct 301 to url
without .html extension Add trailing slash to the static webpages / Re-direct 301 from non-trailing slash Force trailing slash to the Wordpress Webpages / Re-direct 301 from non-trailing slash Some examples domain.tld/index.html >> domain.tld/ domain.tld/file.html >> domain.tld/file/ domain.tld/file.html/ >> domain.tld/file/ domain.tld/wordpress/post-name >> domain.tld/wordpress/post-name/ My code in ROOT htaccess is <ifmodule mod_rewrite.c="">Options +FollowSymLinks -MultiViews RewriteEngine On
RewriteBase / #removing trailing slash
RewriteCond %{REQUEST_FILENAME} !-d
RewriteRule ^(.*)/$ $1 [R=301,L] #www to non
RewriteCond %{HTTP_HOST} ^www.(([a-z0-9_]+.)?domain.com)$ [NC]
RewriteRule .? http://%1%{REQUEST_URI} [R=301,L] #html
RewriteCond %{REQUEST_FILENAME} !-f
RewriteCond %{REQUEST_FILENAME} !-d
RewriteRule ^([^.]+)$ $1.html [NC,L] #index redirect
RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /index.html\ HTTP/
RewriteRule ^index.html$ http://domain.com/ [R=301,L]
RewriteCond %{THE_REQUEST} .html
RewriteRule ^(.*).html$ /$1 [R=301,L]</ifmodule> The above code do 1. redirect www to non-www
2. Remove trailing slash at the end (if exists)
3. Remove index.html
4. Remove all .html
5. Redirect 301 to filename but doesn't add trailing slash at the end0 -
Several 301 Redirects to Same Page
Hi, I have 3 Pages we won't use anymore in our website. Let's call them url A, url B and url C. To keep their SEO strength on our domain, I've though about redirecting all of them to url D. For what I understand, when 301 redirecting, about 85-90% of the link SEO juice is passed. Then, if I redirect 3 URLs to the same page... does url D receive all the link SEO juices for URLs added up? (approximately)
Intermediate & Advanced SEO | | viatrading1
e.g. future url D juice = 100% current url D juice + 85% url A juice + 85% url B juice + 85% url C juice Is this the best practice, or is there a better way? Cheers,0 -
Question about moving content from one site to another without a 301
I could use a second opinion about moving content from some inactive sites to my main site. Once upon a time, we had a handful of geotargeted websites set up targeting various cities that we serve. This was in addition to our main site, which was mostly targeted to our primary office and ranked great for those keywords. Our main site has plenty of authority, has been around for ages, etc. We built out these geo-targeted sites with some good landing pages and kept them active with regularly scheduled blog posts which were unique and either interesting or helpful. Although we had a little success with these, we eventually saw the light and realized that our main site was strong enough to rank for these cities as well, which made life a whole lot easier, not to mention a lot less spammy. We've got some good content on these other sites that I'd like to use on our main site, especially the blog posts. Now that I've got it through my head that there's no such thing as a duplicate content penalty, I understand that I could just start moving this content over so long as I put a 301 redirect in place where the content used to be on these old sites. Which leads me to my question. Our SEO was careful not to have these other websites pointing to our main site to avoid looking like we were trying to do something shady from a link building perspective. His concern is that these redirects would undermine that effort and having a bunch of redirects from a half dozen sites could end up hurting us somehow. Do you think that is the case? What he is suggesting we do is remove all of the content that we'd like to use and use Webmaster Tools to request that this content be removed from the index. Then, after the sites have been recrawled, we'll check for ourselves to confirm they've been removed and proceed with using the content however we'd like. Thoughts?
Intermediate & Advanced SEO | | LeeAbrahamson0 -
How do you 301 redirect URLs with a hashbang (#!) format? We just lost a ton of pagerank because we thought javascript redirect was the only way! But other sites have been able to do this – examples and details inside
Hi Moz, Here's more info on our problem, and thanks for reading! We’re trying to Create 301 redirects for 44 pages on site.com. We’re having trouble 301 redirecting these pages, possibly because they are AJAX and have hashbangs in the URLs. These are locations pages. The old locations URLs are in the following format: www.site.com/locations/#!new-york and the new URLs that we want to redirect to are in this format: www.site.com/locations/new-york We have not been able to create these redirects using Yoast WordPress SEO plugin v.1.5.3.2. The CMS is WordPress version 3.9.1 The reason we want to 301 redirect these pages is because we have created new pages to replace them, and we want to pass pagerank from the old pages to the new. A 301 redirect is the ideal way to pass pagerank. Examples of pages that are able to 301 redirect hashbang URLs include http://www.sherrilltree.com/Saddles#!Saddles and https://twitter.com/#!RobOusbey.
Intermediate & Advanced SEO | | DA20130 -
Is it safe to redirect our .nl (netherlands) domain that we have just purchased to our .com domain?
Hi all! We've recently developed a German version of our website with German translation and now we have just purchased a .nl domain, but with this one, we want all of the copy to remain in English. Is it ok to redirect our .nl domain to our current .com website or will this give us bad SEO points? Thank you!
Intermediate & Advanced SEO | | donaldsze0 -
301 redirect from .html to non .html?
Previously our site was using this as our URL structure: www.site.com/page.html. A few months ago we updated our URL structure to this: www.site.com/page & we're not using the .html. I've read over this guide & don't see anywhere that discusses this: http://www.seomoz.org/learn-seo/redirection. I've currently got a programmer looking into, but am always a bit weary with their workarounds, as I'd previously had them cause more problems then fix it. Here is the solution he is looking to do: The way that I am doing the redirect is fine. The problem is of where to put the code. The issue is that the files are .html files that need to be redirected to the same url with out a .html on them. I can see if I can add that to the 404 redirect page if there is one inside of there and see if that does the trick. That way if there is no page that exists without the .html then it will still be a 404 page. However if it is there then it will work as normal. I will see what I can find and get back. Any help would be greatly appreciated. Thanks, BJ
Intermediate & Advanced SEO | | seointern0