Magento Layered Navigation & Duplicate Content
-
Hello Dear SeoMoz,
I would like to ask your help with something that I am not sure off. Our ecommerce web site is built with Magento. I have found many problems so far and I know that there will be many more in the future. Currently, I am trying to find the best way to deal with the duplicate content that is produced from the layered navigation (size, gender etc). I have done a lot of research so far in order to understand which might be the best practice and I found the following practices:
- **Block layered navigation URLSs from the Google Webmaster Tools (**Apparently this works for Google Only).
- Block these URLs with the robots.txt file
- Make links no-follow
- **Make links JavaScript from Magento ***
- Avoid including these links in the xml site map.
- Avoid including these link in the A-Z Product Index.
- Canonical tag
- Meta Tags (noindex, nofollow)
Question
If I turn the layered navigation links into JavaScript links from the Magento Admin, the layered navigation links are still found by the crawlers but they look like that:
|
instead of:
http://www.mysite.com/girls-basics.html?gender_filte...
|
Can these new URLS (http://www.mysite.com/# ) solve the duplicate content problems with the layered navigation or do I need to implement other practices too to make sure that everything is done right.
Kind Regards
Stefanos Anastasiadis
-
I'm not sure if you guys found a solution to this but I've used Mageworx with my Magento sites and it seems to handle everything I need. I do have to do some Mod-rewrites but nothing too much for a developer to handle.
-
From what I can gather about Magento is the Layered Nav can create seemingly endless URL's. Even if you were to use one of the modules created to make them 'friendly', you would still technically have reems of duplicate pages...right? All nicely re-written but effectively with the same titles and meta...
You may be able to put a wildcard disallow in the robots file for the parameter 'dir=' , which is associated with all the filters. I dont know how well this will work or if Google may on occasision ignore this or find a way into the layered pages anyway? Does anyone know? What if the spider entered the site through a direct link to filtered page...would the robots.txt file go by the way side in this instance?
You could in theory also use WMT to dictate that Google does not index pages with the 'dir=' parameter. Again, I am not sure as to the success rate using this.
Its one of those areas that has many open and unaswered discussions but nothing definitive anywhere to address the issue. Yet Magento is very popular and as you look at people sites who use it you can see they have some how found a way to sort this out. Id love to be a fly on the wall in their office!
-
Stefanos:
Hi! Did you ever find an answer to this question? I have a Magento install as well and need some advanced technical SEO. Are you working with a Magento consultant at all?
Thanks!
Lynn
-
Thanks a lot for your reply. I already know this extension but it is not what I am looking for.
-
I don't know if you stumble upon this Extension,
but it may resolve your problems.
http://www.magentocommerce.com/magento-connect/EcommerceTeam/extension/4420/layered_navigation_seo
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google AMP or CDN?
Hello. I'm running a CMS that cannot currently support both CDN and Google AMP. I would have to choose one or the other. Does anyone have any insight on which may be the better choice until I can figure out how to have both? I installed CDN first to reduce the time it took for my pages/images to load. I'd like to have AMP because it can do the same, and perhaps be a little more Google friendly (their product). I would appreciate any thoughts. Thanks! Steve
On-Page Optimization | | recoil0 -
Duplicate products - is this fix acceptable?
Hey Mozzers, Questions around this have been asked time and time again. But i have a specific example I would like some advice on. I have 2 products, Product 1: https://goo.gl/Gzo1WC
On-Page Optimization | | ATP
Product 2: https://goo.gl/VbrHQJ As you can see, the products are almost identical bar some technical specifications. The owner of the business wants them listing as 2 products, combining them into a single listing with configurable options is not an option. As such I have simply made one a canonical of the other. Whilst not ideal this seems to be the best "SEO" fix. Option 2: My second option is to rewrite the descriptions to they are different - not too hard on this product and a future options when i have more time, however.... I am presented with a similar problem for another product where there are 23 versions of the same product, i cannot rewrite the same info this many times. They are different sizes, ranges, capacities, resolutions and accuracies and must be listed separately but contain all the same features and basic product information. The basic info is too important not to talk about, and talking about all the technical specs would be too much and teaching the customers likely to buy them to suck eggs. As such I have taken the 23 products and broken them down into 5 similar groups of 2 to 6 products. I have then picked 1 product from each group and written a unique description and changed all similar products in its group to match choosing 1product in each group as the canonical for all the others. So 23 same products become 5 unique products with 18 duplicated products pointing to them as canonicals. Any product pointing to another only differs in technical info, 95% of the page is the same. Whilst obviously not ideal, Is this an acceptable use of canonicals?0 -
Content in Tabs
I speed read an article recently and forgot to save it regarding Contents on a page in tabs. Is it correct that now Google is rendering the entire page it's better not to have content in tabs hidden by Javascript? As it stands at the moment, we've got the tabs set-up so that the main part of the page containing the keyword rich text is in a tab and not the first thing presented to the user
On-Page Optimization | | Ham19790 -
"Issue: Duplicate Page Content " in Crawl Diagnostics - but these pages are noindex
Saw an issue back in 2011 about this and I'm experiencing the same issue. http://moz.com/community/q/issue-duplicate-page-content-in-crawl-diagnostics-but-these-pages-are-noindex We have pages that are meta-tagged as no-everything for bots but are being reported as duplicate. Any suggestions on how to exclude them from the Moz bot?
On-Page Optimization | | Deb_VHB0 -
Duplicate Content Issues with Forum
Hi Everyone, I just signed up last night and received the crawl stats for my site (ShapeFit.com). Since April of 2011, my site has been severely impacted by Google's Panda and Penguin algorithm updates and we have lost about 80% of our traffic during that time. I have been trying to follow the guidelines provided by Google to fix the issues and help recover but nothing seems to be working. The majority of my time has been invested in trying to add content to "thin" pages on the site and filing DMCA notices for copyright infringement issues. Since this work has not produced any noticeable recovery, I decided to focus my attention on removing bad backlinks and this is how I found SEOmoz. My question is about duplicate content. The crawl diagnostics showed 6,000 errors for duplicate page content and the same for duplicate page title. After reviewing the details, it looks like almost every page is from the forum (shapefit.com/forum). What's the best way to resolve these issues? Should I completely block the "forum" folder from being indexed by Google or is there something I can do within the forum software to fix this (I use phpBB)? I really appreciate any feedback that would help fix these issues so the site can hopefully start recovering from Panda/Penguin. Thank you, Kris
On-Page Optimization | | shapefit0 -
E-commerce site product descriptions and duplicate content
Hi everyone. I'm developing an e-commerce site using Prestashop and concerned about the issue of duplicate content among product descriptions. My main concerns are: If there are 500 or more products and those product descriptions are obtained from a manufacturer or supplier's website hence running into external duplicate content issues. Internal duplicate content is also an issue, if there are multiple similar products and each product has the same description across several pages. What would be the best approach to eliminate the possibility of incurring a duplicate content penalty due to similar product descriptions? I've already considered the suggestion of noindex-ing the complete range of products to help protect from duplicate content penalties and having unique articles written in the site blog discussing products instead linking to certain products on the site. Another consideration I had was noindex-ing all product pages except pages for featured products in the store and rewriting descriptions for a set amount of those featured products regularly (this will still have the problem of internal duplicate content across pages if similar product descriptions are rewritten). The product range is intended to be very large so I'm really seeking an alternative solution from the insane task of rewriting many product descriptions. Any suggestions to make SEO work efficient are very much welcome and appreciated. Thank you!
On-Page Optimization | | valuepets0 -
Duplicated Content Column in excel
I'd like to see all duplicated content URLs in excel. But when I do the export to csv, and then use text to columns, I end up with an empty duplicated content column. The URLs should be in column AF in excel, but this column is empty. Can somebody help me on this?
On-Page Optimization | | jdclerck0 -
Website Content
Is it bad to have html pages on a blog? I converted a completely HTML site to wordpress, but havd hundreds of article pages that are still html.
On-Page Optimization | | azguy0