Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Moving html site to wordpress and 301 redirect from index.htm to index.php or just www.example.com
-
I found page duplicate content when using Moz crawl tool, see below.
http://www.example.com
Page Authority 40
Linking Root Domains 31
External Link Count 138
Internal Link Count 18
Status Code 200
1 duplicatehttp://www.example.com/index.htm
Page Authority 19
Linking Root Domains 1
External Link Count 0
Internal Link Count 15
Status Code 200
1 duplicateI have recently transfered my old html site to wordpress.
To keep the urls the same I am using a plugin which appends .htm at the end of each page.My old site home page was index.htm. I have created index.htm in wordpress as well but now there is a conflict of duplicate content. I am using latest post as my home page which is index.php
Question 1.
Should I also use redirect 301 im htaccess file to transfer index.htm page authority (19) to www.example.comIf yes, do I use
Redirect 301 /index.htm http://www.example.com/index.php
or
Redirect 301 /index.htm http://www.example.comQuestion 2
Should I change my "Home" menu link to http://www.example.com instead of http://www.example.com/index.htm that would fix the duplicate content, as indx.htm does not exist anymore.Is there a better option?
Thanks
-
About three different homepages: This is my reasoning: Wordpress based website uses index.php as a home page if you use "latest posts" as your home page, Even index.php is not displayed in the url address bar. Does that mean WP sites always have 2 homepages? I have removed index.htm at the moment, but I see that as a problem as I am losing PA value for that page.
Zee: So it sounds like you should create a "dynamic" front page (this link should help you: https://codex.wordpress.org/Creating_a_Static_Front_Page). If for some reason you are unable to remove both duplicate homepages, I'd say deprecate one and 301 redirect it to your main homepage, and implement a rel=canonical tag from the duplicated homepage to your main one (e.g. example.com/index.php -301-> example.com)
I can see that this is a bit messy.. I did this as I wanted to preserve
original htm based site URL's. Are you saying that search engines see
www.example.com/dir1/dir2/page.htm and www.example.com/dir1/dir2/page as
identical URL's.Zee: I can't say for sure, without seeing your site, but if they're indexed (discoverable by search engine crawlers), you could definitely be penalized. If your .htm pages have built up link equity, you'll need to 301 redirect them to their corresponding pages
With my old htm based website I had too many subdirectories. I have
removed some of the subs to make URLs shorter and used 301 redirects in
.htaccess file. I probably should have removed .htm appendixes as well
at this stage, but I did not.
Is there a point to do 301 redirects again? 90% of my pages have been 301 redirected as I have removed some of the old subdirectories. Are you saying that I should again 301 redirect my newly 301 redirected pages?
Zee: No, that'll result in a 301 redirect chain--you should ALWAYS 301 redirect to whatever the most appropriate, final destination is (think of 301 redirects as nonstop flights, never layovers). If I 301 redirect A --> B, but then B needs to be redirected as B --> C, you must update A's to reflect this as well, A --> C, to avoid a redirect chain.
I would like to use example.com as my home page only, I am just trying to transfer PA value of my index.htm page to example.com. Not sure how to do this.
Zee: You may want to canonical this one, in that case (especially if you can confirm that these pages present the same information)--you can implement a rel=canonical on your .htm homepage that points back to your main homepage.
-
Zee, thank you for taking time to answer my questions
Hey gozmoz--I might need a little more information from you in order to help here. It sounds like you've got potentially THREE different homepages (see below). First of all, is that right?
1. example.com
2. example.com/index.htm
3. example.com/index.php
About three different homepages:
This is my reasoning: Wordpress based website uses index.php as a home page if you use "latest posts" as your home page, Even index.php is not displayed in the url address bar.
Does that mean WP sites always have 2 homepages?
I have removed index.htm at the moment, but I see that as a problem as I am losing PA value for that page.
Big red flag to me is the Wordpress plugin you're using that automatically appends .htm to the end of every URL you've transferred to your new WP CMS--why were you unable to keep the URLs as-is, without the .htm? My rec here would be to undo that, and keep the original URLs (without .htm).
I can see that this is a bit messy..
I did this as I wanted to preserve original htm based site URL's.
Are you saying that search engines see
www.example.com/dir1/dir2/page.htm and
www.example.com/dir1/dir2/page
as identical URL's.With my old htm based website I had too many subdirectories. I have removed some of the subs to make URLs shorter and used 301 redirects in .htaccess file. I probably should have removed .htm appendixes as well at this stage, but I did not.
Question 1: As far as this question goes, I do think you'll need to clarify the above before I can make a solid rec for you. Again, my preference would be for you to use your original URLs as the main ones and deprecate these .htm versions if possible. That may require you to 301 redirect the .htm versions to the non-htm counterparts.
Is there a point to do 301 redirects again? 90% of my pages have been 301 redirected as I have removed some of the old subdirectories. Are you saying that I should again 301 redirect my newly 301 redirected pages?
Question 2: what do you mean, example.com/index.htm no longer exists? Have you deleted this page entirely? If so, you may not need to do anything here. I do however, think you need to establish a singular URL to be your homepage, and see no reason you shouldn't use example.com (on its own) as your home, instead of appending /index.htm or /index.php
I would like to use example.com as my home page only, I am just trying to transfer PA value of my index.htm page to example.com. Not sure how to do this.
Regads
Gozmoz -
Hey gozmoz--I might need a little more information from you in order to help here. It sounds like you've got potentially THREE different homepages (see below). First of all, is that right?
1. example.com
2. example.com/index.htm
3. example.com/index.phpBig red flag to me is the Wordpress plugin you're using that automatically appends .htm to the end of every URL you've transferred to your new WP CMS--why were you unable to keep the URLs as-is, without the .htm? My rec here would be to undo that, and keep the original URLs (without .htm).
**Question 1: **As far as this question goes, I do think you'll need to clarify the above before I can make a solid rec for you. Again, my preference would be for you to use your original URLs as the main ones and deprecate these .htm versions if possible. That _may _require you to 301 redirect the .htm versions to the non-htm counterparts.
Question 2: what do you mean, example.com/index.htm no longer exists? Have you deleted this page entirely? If so, you may not need to do anything here. I do however, think you need to establish a singular URL to be your homepage, and see no reason you shouldn't use example.com (on its own) as your home, instead of appending /index.htm or /index.php
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is there any benefit to changing 303 redirects to 301?
A year ago I moved my marketplace website from http to https. I implemented some design changes at the same time, and saw a huge drop in traffic that we have not recovered from. I've been searching for reasons for the organic traffic decline and have noticed that the redirects from http to https URLs are 303 redirects. There's little information available about 303 redirects but most articles say they don't pass link juice. Is it worth changing them to 301 redirects now? Are there risks in making such a change a year later, and is it likely to have any benefits for rankings?
Intermediate & Advanced SEO | | MAdeit0 -
301 redirect hops from non-https and www
It's best practice to minimize the amount of 301 redirect hops. Ideally only one redirect hop. It's also best practice to 301 redirect (or at least canonical) your non-https and/or your non-www (or www) to the canonical protocol/subdomain. The simplest (and possibly the most common) way to implement canonical protocol/subdomain redirects is through a load balancer or before your app processes the request. Both of which will just blanket 301 to the canonical domain/protocol regardless if the path exists or not In which case, you could have: Two hops. i.e. hop #1 http://example.com/foo to https://example.com/foo, hop #2 https://example.com/foo to https://example.com/bar 301 to a 404. Let's say https://example.com/dog never existed, but somebody for whatever reason linked to it (maybe a typo). If I request https://www.example.com/dog, the load balancer would 301 to a 404 page. Either scenario above should be fairly rare. However, you can't control how people link to you. Should I care about either above scenario? I could have my app attempt to check if the page exists before forwarding, but that code could be complicated.
Intermediate & Advanced SEO | | dsbud0 -
E-Commerce Site Collection Pages Not Being Indexed
Hello Everyone, So this is not really my strong suit but I’m going to do my best to explain the full scope of the issue and really hope someone has any insight. We have an e-commerce client (can't really share the domain) that uses Shopify; they have a large number of products categorized by Collections. The issue is when we do a site:search of our Collection Pages (site:Domain.com/Collections/) they don’t seem to be indexed. Also, not sure if it’s relevant but we also recently did an over-hall of our design. Because we haven’t been able to identify the issue here’s everything we know/have done so far: Moz Crawl Check and the Collection Pages came up. Checked Organic Landing Page Analytics (source/medium: Google) and the pages are getting traffic. Submitted the pages to Google Search Console. The URLs are listed on the sitemap.xml but when we tried to submit the Collections sitemap.xml to Google Search Console 99 were submitted but nothing came back as being indexed (like our other pages and products). We tested the URL in GSC’s robots.txt tester and it came up as being “allowed” but just in case below is the language used in our robots:
Intermediate & Advanced SEO | | Ben-R
User-agent: *
Disallow: /admin
Disallow: /cart
Disallow: /orders
Disallow: /checkout
Disallow: /9545580/checkouts
Disallow: /carts
Disallow: /account
Disallow: /collections/+
Disallow: /collections/%2B
Disallow: /collections/%2b
Disallow: /blogs/+
Disallow: /blogs/%2B
Disallow: /blogs/%2b
Disallow: /design_theme_id
Disallow: /preview_theme_id
Disallow: /preview_script_id
Disallow: /apple-app-site-association
Sitemap: https://domain.com/sitemap.xml A Google Cache:Search currently shows a collections/all page we have up that lists all of our products. Please let us know if there’s any other details we could provide that might help. Any insight or suggestions would be very much appreciated. Looking forward to hearing all of your thoughts! Thank you in advance. Best,0 -
Redirected Old Pages Still Indexed
Hello, we migrated a domain onto a new Wordpress site over a year ago. We redirected (with plugin: simple 301 redirects) all the old urls (.asp) to the corresponding new wordpress urls (non-.asp). The old pages are still indexed by Google, even though when you click on them you are redirected to the new page. Can someone tell me reasons they would still be indexed? Do you think it is hurting my rankings?
Intermediate & Advanced SEO | | phogan0 -
301 Redirect of subdomain?
Fellow Mozzers, I'm having a hard time wrapping my brain around a redirect issue and thought it was worth posing the question to the Moz community. I did a search first but couldn't find the exact answer I was looking for. How does a 301 redirect work when you redirect a sub domain example.homepage.com to www.homepage.com but you keep the sub directories of example.homepage.com/page-1 active and are trying to rank them? I'm dealing with a current project where this is happening and this doesn't make sense to me, to redirect the subdomain if you're also trying to rank/create search traffic for pages, sub directories on example.homepage.com. This also get's into the debate of if a sub domain site is viewed as it's own website and therefore has to rank itself. If this is true, it seems like we're kind of killing the authority of the site by redirecting it. Additionally, www.homepage.com has a much stronger link profile than example.homepage.com I hope this makes sense. Any thoughts are appreciated. Thanks for your time.
Intermediate & Advanced SEO | | SMG-Texas0 -
Remove URLs that 301 Redirect from Google's Index
I'm working with a client who has 301 redirected thousands of URLs from their primary subdomain to a new subdomain (these are unimportant pages with regards to link equity). These URLs are still appearing in Google's results under the primary domain, rather than the new subdomain. This is problematic because it's creating an artificial index bloat issue. These URLs make up over 90% of the URLs indexed. My experience has been that URLs that have been 301 redirected are removed from the index over time and replaced by the new destination URL. But it has been several months, close to a year even, and they're still in the index. Any recommendations on how to speed up the process of removing the 301 redirected URLs from Google's index? Will Google, or any search engine for that matter, process a noindex meta tag if the URL's been redirected?
Intermediate & Advanced SEO | | trung.ngo0 -
.htaccess 301 Redirect Help! Specific Redirects and Blanket Rule
Hi there, I have the following domains: OLD DOMAIN: domain1.co.uk NEW DOMAIN: domain2.co.uk I need to create a .htaccess file that 301 redirects specific, individual pages on domain1.co.uk to domain2.co.uk I've searched for hours to try and find a solution, but I can't find anything that will do what I need. The pages on domain1.co.uk are all kinds of filenames and extensions, but they will be redirected to a Wordpress website that has a clean folder structure. Some example URL's to be redirected from the old website: http://www.domain1.co.uk/charitypage.php?charity=357 http://www.domain1.co.uk/adopt.php http://www.domain1.co.uk/register/?type=2 These will need to be redirected to the following URL types on the new domain: http://www.domain2.co.uk/charities/ http://www.domain2.co.uk/adopt/ http://www.domain2.co.uk/register/ I would also like a blanket/catch-all redirect from anything else on www.domain1.co.uk to the homepage of www.domain2.co.uk if there isn't a specific individual redirect in place. I'm literally tearing my hair out with this, so any help would be greatly appreciated! Thanks
Intermediate & Advanced SEO | | Townpages0 -
Wordpress blog in a subdirectory not being indexed by Google
HI MozzersIn my websites sitemap.xml, pages are listed, such as /blog/ and /blog/textile-fact-or-fiction-egyptian-cotton-explained/These pages are visible when you visit them in a browser and when you use the Google Webmaster tool - Fetch as Google to view them (see attachment), however they aren't being indexed in Google, not even the root directory for the blog (/blog/) is being indexed, and when we query:site: www.hilden.co.uk/blog/ It returns 0 results in Google.Also note that:The Wordpress installation is located at /blog/ which is a subdirectory of the main root directory which is managed by Magento. I'm wondering if this causing the problem.Any help on this would be greatly appreciated!AnthonyToTOHuj.png?1
Intermediate & Advanced SEO | | Tone_Agency0