Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Redirecting Canonical 301s and Magento Website
- 
					
					
					
					
 I have an issue with a client's website where it has 3700+ pages, but roughly half of them are duplicates. Thankfully, the only difference between the original and the duplictes is the "?print" at the end of each URL (I suppose this is Magento's way of making a printable page version of the same page. I don't know, I didn't build it.) My questions is, how can I get all the pages like this http://www.mycompany.com/blah.html?print to redirect to pages like this... http://www.mycompany.com/blah.html Also, do they NEED to be Canonical, or will a 301 redirect be sufficient. Also, after having done this, if anybody knows, is there a way I can turn that feature off in Magento, because we're expanding our product line, and I don't want to have to keep chasing after these "?print" pages after the fact. 
- 
					
					
					
					
 Late to this game, but just in case you're still waiting on your dev... Magento has an automated add-on system called Magento Connect, and you can access it from your admin (unless the original installer disabled it on your account). You can just use that to install Yoast's plugin. Check out http://www.magentocommerce.com/magento-connect/canonical-url-for-magento.html Aside from that, if you are using something after 1.4, you should have canonical built into your store (it's in the config section). If you're not using something after 1.4, consider upgrading. It's not painless, but anything prior to 1.4.1 is pretty rough to use. I'm not surprised you've got bugs and general sadness. 
- 
					
					
					
					
 Hehe, hey now, not all us developers are lazy  You know your system better than any of us do. My 1. and 2. are just the best-case order in which to get things done. Do what works for you and your site. 
- 
					
					
					
					
 like I said, I will email your solution concerning the plugin to my web admin guy, but the reason why I'm kind of reticent to do that is it's more a matter of bureacracy (to be sensitive to his time constraints) rather than technical or lack of know-how. I want to get it done right, but I also want to get it done in a timely manner. But I will forward this to him. Thanks you sir. 
- 
					
					
					
					
 I don't understand why you don't just use the rel="canonical" plugin I mentioned above... ? 
- 
					
					
					
					
 thank you sir....I'll try to avoid the htaccess route then. 
- 
					
					
					
					
 Yeah I guess this is the only way to go. Now I just got to get the webmaster to get around to it. (sigh)... 
- 
					
					
					
					
 Yes to your first questions. Here's the process for each (as I see it): 1. Fix/remove the ability for system to generate ?print URL's and implement canonical tags; open beer and wait 'til search engines sort things out. Nothing more you can do here. 2. No fix to system so we still have ?print URL's. In this case, setup the 301's in your .htaccess file; however, as long as the system is still generating these ?print URLs, you will have to keep the redirects in the .htaccess in tact, permanently. Untested: RewriteEngine On RewriteBase / RewriteRule ^([^/]*).html?print$ /$1.html [L,NC,R=301] 
- 
					
					
					
					
 I guess my question now is, just doing the htaccess route is a bad idea? in both 301s AND canonicals or just 301s? I guess I'm not looking for easy, but economical. Thanks for your responses. 
- 
					
					
					
					
 I agree with THB on this, use rel="canonical" you simply want to say to the search engines "Hey this is the preferred URL for my content". 301's are for saying "Hey this page has permanently moved to a new page/site" I would use the rel="canonical" plugin I posted for you above, it will automatically add the canonical tag for you, job done. 
- 
					
					
					
					
 Yeah, I know, right. The problem is, I found this out only AFTER I bought the PRO version and mapped out the entirety of the site. Some of those ?print URLs are now indexed in the SEs. So I agree with nipping this problem in the bud (or the root, whichever one prefers), but I still need to know how to do it via the htaccess. In other words, I have to go backwards and take care of the rankings, THEN figure out how to turn it off (and I can go to the Magento forums for that). 
- 
					
					
					
					
 So, assuming this works, can I eventually remove the ".html?print" versions of the pages after the SEs have changed the URLs in their indices? also, I'm not getting the impression it's going to save me time on specifying different pages (it may save time, I'm not sure), but in the chance it doesn't or the plugin fails to do as advertised, what is the htaccess option for this? Because at least, in this case, I can see the immediacy in it AND you can do canonical rewrites FROM the htaccess. My situation is, I'm not THAT advanced in wildcards to make this happen (otherwise, I'd do it myself via just trial and error until it works) AND I myself don't have access to the site (the webmaster does, whose part time) and I have a choice between "Hey, here's several (or one line) of code to put into htaccess to resolve this problem" OR "can you go through and implement this plug to do the Canonical redirects on every page, oh and by the way, please back up first." So it's not merely JUST a technical problem or a know-how problem, it's also a bureaucratic problem that can mean the difference between getting it done in a few minutes, and it could take two weeks to make happen depending on this person's perception. 
- 
					
					
					
					
 Just to clarify. If there is in fact no difference between the pages (as you originally stated), then please just use the canonical tag. As much as you might want to setup 301 redirects, they would not be the way to go in this case. Trust me. Otherwise, here is what I would do, honestly: find out why the ?print is causing information to be displayed improperly in some cases. If it's accessing the same db tables using the same queries, then that shouldn't be happening. I'd fix that, and implement the canonical tags, and wait it out. That would be the easiest approach and most beneficial with the least impact to your site and any rankings. If this is something your not capable of fixing (not sure if you're proficient with coding, etc), then you can setup 301's as a 'hack', but they should not be left in permanently as the process in which I stated just above is the best way to resolve the issue. In order to assist you with any .htaccess markup, you'll need to provide some examples of your URL's, and whether they have any common identifiers. 
- 
					
					
					
					
 OK - I was basing my answers on what you said, "the only difference between the original and the duplictes is the "?print" at the end of each URL" If there is in fact different content on each page, and the ?print page is the one with the errors, then you should remove the ability for ?print pages to even be generated in the first place instead of having them constantly redirecting user/bots. Forever 301 re-directions can hurt you down the road. Once you've removed the ability for users/bots to find and access the ?print pages, then setup the 301's and insert the canonical tags. 
- 
					
					
					
					
 It looks like this plugin will add the rel="canonical" tag for you automatically: http://yoast.com/tools/magento/canonical/ View the source code after you have installed it to confirm it's working as expected. 
- 
					
					
					
					
 I was answering this Kjay's response while this one was coming down the pipeline. I get you on the fact if they were TRULY identical, but the reason I wanted to do the 301s is because more than likely, the Magento engine is faulty, and I've found situaitons where the prices are different between the two versions, or the images and text haven't been updated, etc. etc., hence, the need for a 301 
- 
					
					
					
					
 I guess my question is, if I use JUST the canonical, then the SEs will get around to changing the address, but will still go to the "?print" pages until that time. Also, the Magento help aide on that said I had to do it with EACH individual page. It's going to be especially time-consuming to have to go back out into the admin, go back into the admin, and check to see EVERY time the page that I'm canonizing is the right html version. I think this is where accessing the htaccess will save me a bunch of time (I still have to change the title tags on the remaining original 1500 pages, as well as find out from the Magento site, to access the H1 tags in the templates). If I use the basic 301 redirect, I get the benefit of the immediate redirect, but I fail to see the downside of having to "endure" the 301 redirect other than additional rules for the browser to access the server. I eventually want to get RID of these "?print" pages because I'm getting the feeling that prices won't update as reliably on the ".html?print" version of the pages, update images (which we HAVE had trouble with in the past) etc. etc. And there's also the possibility that people may still access those ?print pages even if I did just do a canonical. It's just better to admin and SEO 1500+ pages as opposed to 1500+ pages and their duplicates. I guess, what I'm looking for is, more than likely, the syntax command that's going to include a wildcard function to accomodate everything between "http://www.mycompany.com/" and ".html?print" or ".html". What would that look like? 
- 
					
					
					
					
 Agreed  
- 
					
					
					
					
 Ya, this is what I was talking about. Just a standard canonical html tag inserted into the framework. That will clear everything up for you (might take a wee bit, but Google will recognize it right away). No need for .htaccess whatsoever since the content is identical. If the content were different, ie. the ?print page showed a completely different style format, then sure, setup some 301's to get the user's to the right page. But not needed for your situation. 
- 
					
					
					
					
 I would just add: rel="canonical" href="http://www.mycompany.com/blah.html" /> No need to add 301's. This might be useful it's Magento specific: http://www.magentocommerce.com/wiki/adding_canonical_url_to_cms_pages 
- 
					
					
					
					
 Okay, so if I were in the htaccess file, what would it look like? would it be a Query string rewrite? RewriteEngine On _ RewriteBase /_ RewriteCond %{QUERY_STRING} ^(*)$ RewriteRule ^()html?print$ http://www.mycompany.com/()html$ [R=301] or just a straight one line redirect Redirect 301 /()html?print http://www.mycompany.com/().html 
- 
					
					
					
					
 Canonical will suffice as it is basically a 301 anyways. Cleaner too as there is no actual redirect for the user, or bot, to endure. You can also set it up in Google Webmaster Tools under 'Site Configuration > URL Parameters' to ignore that parameter; however, using the canonical tag will more than suffice in this case. 
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
- 
		
		Moz ToolsChat with the community about the Moz tools. 
- 
		
		SEO TacticsDiscuss the SEO process with fellow marketers 
- 
		
		CommunityDiscuss industry events, jobs, and news! 
- 
		
		Digital MarketingChat about tactics outside of SEO 
- 
		
		Research & TrendsDive into research and trends in the search industry. 
- 
		
		SupportConnect on product support and feature requests. 
Related Questions
- 
		
		
		
		
		
		301 Redirect in breadcrumb. How bad is it?
 Hi all, How bad is it to have a link in the breadcrumb that 301 redirects? We had to create some hidden category pages in our ecommerce platform bigcommerce to create a display on our category pages in a certain format. Though whilst the category page was set to not visable in bigcommerce admin the URL still showed in the live site bread crumb. SO, we set a 301 redirect on it so it didnt produce a 404. However we have lost a lot of SEO ground the past few months. could this be why? is it bad to have a 301 redirect in the breadrcrumb. Intermediate & Advanced SEO | | oceanstorm0
- 
		
		
		
		
		
		If my website do not have a robot.txt file, does it hurt my website ranking?
 After a site audit, I find out that my website don't have a robot.txt. Does it hurt my website rankings? One more thing, when I type mywebsite.com/robot.txt, it automatically redirect to the homepage. Please help! Intermediate & Advanced SEO | | binhlai0
- 
		
		
		
		
		
		Fast/Easy Way to Implement Canonical tags in Bulk in Magento CMS?
 Hello Amazing SEO Community! Quick Q for a client with a TON of duplicate content. (yikes!) My client is currently undertaking a large SEO project around canonical tagging for their thousands of duplicate pages. Currently, one product sits on multiple URLs and they are being indexed as different pages (with the same content). The issue is found across all products and other pages, and across their international sites as well. One core challenge they face now is lack of time/resources from their developer side. The solution we see to the duplicate content is to manually add a canonical tag to each of our tens of thousands of pages. Their content management system is Magento. Has anyone ever tackled canonicalization for a large site that uses Magento? Any more efficient solutions to manual tagging is ideal. Thanks in advance for your input. -Bonnie Intermediate & Advanced SEO | | accpar0
- 
		
		
		
		
		
		Blog subdomain not redirecting
 Over the last few weeks I have been focused on fixing high and medium priority issues, as reported by the Moz crawler, after a recent transition to WordPress. I've made great progress, getting the high priority issues down from several hundred (various reasons, but many duplicates for things like non-www and www versions) to just five last week. And then there's this weeks report. For reasons I can't fathom, I am suddenly getting hundreds of duplicate content pages of the form http://blog.<domain>.com</domain> (being duplicates with the http://www.<domain>.com</domain> versions). I'm really unclear on why these suddenly appeared. I host my own WordPress site ie WordPress.org stuff. In Options / General everything refers to http://www.<domain>.com</domain> and has done for a number of weeks. I have no idea why the blog versions of the pages have suddenly appeared. FWIW, the non-www version of my pages still redirect to the www version, as I would expect. I'm obviously pretty concerned by this so any pointers greatly appreciated. Thanks. Mark Intermediate & Advanced SEO | | MarkWill0
- 
		
		
		
		
		
		How important is the optional <priority>tag in an XML sitemap of your website? Can this help search engines understand the hierarchy of a website?</priority>
 Can the <priority>tag be used to tell search engines the hierarchy of a site or should it be used to let search engines know which priority to we want pages to be indexed in?</priority> Intermediate & Advanced SEO | | mycity4kids0
- 
		
		
		
		
		
		Redirect at Registrar or Server
 Hi folks, I have run into a situation were a new client has 3 TLDs (e.g. mycompany.com, mycompany.org and mycompany.biz), all with the same content. They are on a Windows IIS environment, which I am not familiar with. Until now, all of my clients have been Linux/Apache environment, so I always dealt with these issues utilizing htaccess. Currently all resolve to the same IP, but the URL remains the same in the browser address field (e.g. if you type-in mycompany.org - it remains as such). We want the .org and .biz version to 301 Redirect to the .com TLD. I am wondering what the best practice might be in this situation? Could we simply redirect at the registrar level or would implementation at the server level be best? If so, I would really appreciate an example from someone with experience implementing redirects on IIS. Thank you! Intermediate & Advanced SEO | | SCW0
- 
		
		
		
		
		
		Wildcard Redirects & Canonical Tags
 I have an interesting situation. Current URLs Example1: www.domain.com/red-widgets-cid-1234.html Intermediate & Advanced SEO | | NakulGoyal
 www.domain.com/red-widgets-cid-1234-1.html
 www.domain.com/red-widgets-cid-1234-1-1.html Canonical on All Above URLs:
 www.domain.com/red-widgets-cid-1234.html New URL:
 www.domain.com/red-widgets-cid-4567.html Current URLs Example2: www.domain.com/red-widgets-cid-1234+10.html
 www.domain.com/red-widgets-cid-1234+10-1.html
 www.domain.com/red-widgets-cid-1234+10-1-1.html Canonical on All Above URLs:
 www.domain.com/red-widgets-cid-1234+10.html New URL:
 www.domain.com/red-widgets-cid-6789.html Current URLs Example3: www.domain.com/red-widgets-cid-1234+10+5.html
 www.domain.com/red-widgets-cid-1234+10+5-1.html
 www.domain.com/red-widgets-cid-1234+10+5-1-1.html Canonical on All Above URLs:
 www.domain.com/red-widgets-cid-1234+10+5.html New URL:
 www.domain.com/american-red-widgets-cid-6789+5.html I want to make sure all variations of the above URL redirect to the new URLs. However, as you see in Example 3, we are dealing with variables that are passed on. (+5 in this case). Question 1: What wildcard 301 redirect / regular expression can I use to tackle these ? Question 2: If we redirect www.domain.com/red-widgets-cid-1234+10+5.html to www.domain.com/red-widgets-cid-6789+5.html and www.domain.com/red-widgets-cid-6789+5.html contains the canonical tag www.domain.com/american-red-widgets-cid-6789+5.html, any concerns or red flags here ?0
- 
		
		
		
		
		
		Splitting one Website into 2 Different New Websites with 301 redirects, help?
 Here's the deal. My website stbands.com does fairly well. The only issue it is facing a long term branding crisis. It sells custom products and sporting goods. We decided that we want to make a sporting goods website for the retail stuff and then a custom site only focusing on the custom stuff. One website transformed and broken into 2 new ones, with two new brand names. The way we are thinking about doing this is doing a lot of 301 redirects, but what do we do with the homepage (stbands.com) and what is the best practice to make sure we don't lose traffic to the categories, etc.? Which new website do we 301 the homepage to? It's rough because for some keywords we rank 3 or 4 times on the first page. Scary times, but something must be done for the long term. Any advise is greatly appreciated. Thank you in advance. We are set for a busy next few months 🙂 Intermediate & Advanced SEO | | Hyrule0
 
			
		 
			
		 
			
		 
			
		 
					
				 
					
				 
					
				 
					
				 
					
				 
					
				 
					
				