Magento Layered Navigation & Duplicate Content
-
Hello Dear SeoMoz,
I would like to ask your help with something that I am not sure off. Our ecommerce web site is built with Magento. I have found many problems so far and I know that there will be many more in the future. Currently, I am trying to find the best way to deal with the duplicate content that is produced from the layered navigation (size, gender etc). I have done a lot of research so far in order to understand which might be the best practice and I found the following practices:
- **Block layered navigation URLSs from the Google Webmaster Tools (**Apparently this works for Google Only).
- Block these URLs with the robots.txt file
- Make links no-follow
- **Make links JavaScript from Magento ***
- Avoid including these links in the xml site map.
- Avoid including these link in the A-Z Product Index.
- Canonical tag
- Meta Tags (noindex, nofollow)
Question
If I turn the layered navigation links into JavaScript links from the Magento Admin, the layered navigation links are still found by the crawlers but they look like that:
|
instead of:
http://www.mysite.com/girls-basics.html?gender_filte...
|
Can these new URLS (http://www.mysite.com/# ) solve the duplicate content problems with the layered navigation or do I need to implement other practices too to make sure that everything is done right.
Kind Regards
Stefanos Anastasiadis
-
I'm not sure if you guys found a solution to this but I've used Mageworx with my Magento sites and it seems to handle everything I need. I do have to do some Mod-rewrites but nothing too much for a developer to handle.
-
From what I can gather about Magento is the Layered Nav can create seemingly endless URL's. Even if you were to use one of the modules created to make them 'friendly', you would still technically have reems of duplicate pages...right? All nicely re-written but effectively with the same titles and meta...
You may be able to put a wildcard disallow in the robots file for the parameter 'dir=' , which is associated with all the filters. I dont know how well this will work or if Google may on occasision ignore this or find a way into the layered pages anyway? Does anyone know? What if the spider entered the site through a direct link to filtered page...would the robots.txt file go by the way side in this instance?
You could in theory also use WMT to dictate that Google does not index pages with the 'dir=' parameter. Again, I am not sure as to the success rate using this.
Its one of those areas that has many open and unaswered discussions but nothing definitive anywhere to address the issue. Yet Magento is very popular and as you look at people sites who use it you can see they have some how found a way to sort this out. Id love to be a fly on the wall in their office!
-
Stefanos:
Hi! Did you ever find an answer to this question? I have a Magento install as well and need some advanced technical SEO. Are you working with a Magento consultant at all?
Thanks!
Lynn
-
Thanks a lot for your reply. I already know this extension but it is not what I am looking for.
-
I don't know if you stumble upon this Extension,
but it may resolve your problems.
http://www.magentocommerce.com/magento-connect/EcommerceTeam/extension/4420/layered_navigation_seo
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content errors
I have multiple duplicate content errors in my crawl diagnostics. The problem is though that i already took care of these problems with the canonical tag but MOZ keeps saying there is a problem. For example this page http://www.letspump.dk/produkter/56-aminosyre/ has a canonical tag, but moz still says it has an error. Why is that?
On-Page Optimization | | toejklemme0 -
Duplicate Content when Using "visibility classes" in responsive design layouts? - a SEO-Problem?
I have text in the right column of my responsive layout which will show up below the the principal content on small devices. To do this I use visibility classes for DIVs. So I have a DIV with with a unique style text that is visible only on large screen sizes. I copied the same text into another div which shows only up only on small devices while the other div will be hidden in this moment. Technically I have the same text twice on my page. So this might be duplicate content detected as SPAM? I'm concerned because hidden text on page via expand-collapsable textblocks will be read by bots and in my case they will detect it twice?Does anybody have experiences on this issue?bestHolger
On-Page Optimization | | inlinear0 -
Removing syndicated duplicate content from website - what steps do I need to take to make sure Google knows?
Hey all, So I've made the decision to cancel the service that provides my blog with regular content / posts, since it seems that having duplicate content on my site isn't doing me any favors. So I'm on a Wordpress system - I'll be exporting the posts so I have them for reference, and then deleting the posts. There are like 150 or so - What steps should I take to ensure that Google learns of the changes I've made? Or do I not need to do anything at all in that department? Also - I guess I've assumed that the best decision would be to 'remove' the content from my blog. IS that the best way to go? Or should I leave it in place and start adding unique content? (my guess is that I need to remove it...) Thanks for your help, Kurt
On-Page Optimization | | KurtBullock0 -
Duplicated Content Column in excel
I'd like to see all duplicated content URLs in excel. But when I do the export to csv, and then use text to columns, I end up with an empty duplicated content column. The URLs should be in column AF in excel, but this column is empty. Can somebody help me on this?
On-Page Optimization | | jdclerck0 -
Best practice to solve this Unique duplicate page content issue?
I just got Seomoz Pro (it's awesome!), and when I did a campaign for my website I discovered that I have a big issue with duplicate page content (as well as titles). The Crawl Diagnostics Summary told me I have 196 Crawl Errors Found (I had a total of 362 pages crawled on my site), and as much as 160 of these was duplicate page content. Which to me sounds like a big problem, correct me if I'm wrong (I'm very new to SEO). So our website is an ecommerce that sells greeting cards. The unique part about our platform is that we offer the customer to make a customization of the cards.
On-Page Optimization | | danielpett
Let me walk you through each step a customer takes so you fully understand: They find a card they like and visit the product page of that card (just like on any ecommerce store.) They then decide they want to buy it. There is no "Add to cart" button, they will instead click on a "customize the card" button. 3) This takes them to a step by step process of customizing the card. They change the name on the front of the greeting card so it says for example: "Happy Birthday Katy!". And then adds a personal text on the inside of the card. They then add an delivery address and when it should be delivered. After that they proceed to checkout and it's all done. This is my website (it's in Swedish): loveday.se - it will take you to a product page so that you can click the green button and see what I mean with the customization pages. Hopefully it helps even though it's in Swedish. My issue starts at the customization part of the site (the bolded step above), as I can see the permalinks in the diagnostics I got.
This step-by-step process looks exactly the same with every card in the store. Same call-to-action headline, same descriptive text etc. The only difference is a JPEG-file with the unique greeting card design. So, what is your take on this? Let me know if I was unclear about something. Any help or advice is greatly appreciated.0 -
Duplicate content
Hello, I have two pages showing dulicate content. They are: http://www.cedaradirondackchairs.net/ http://www.cedaradirondackchairs.net/index Not sure how to resolve this issue. Any help would be greatly appreciated! Thanks.
On-Page Optimization | | Ronb10230 -
What should I do with these duplicate mass production?
Hi, I'm reviewing somebodies site and just realized that it's overflown with duplicates. Like these: <colgroup><col width="3496"></colgroup>
On-Page Optimization | | jjtech
| www.joannalark.com/store/products/24"-Sting.html |
| www.joannalark.com/store/products/24"-Sting.html?setCurrencyId=1 |
| www.joannalark.com/store/products/24"-Sting.html?setCurrencyId=6 |
| www.joannalark.com/store/products/24"-Sting.html?setCurrencyId=7 | It also produces something like this: | <colgroup><col width="3496"></colgroup>
| www.joannalark.com/store/pages/pages/pages/pages/pages.php?pageid=8 |
| www.joannalark.com/store/pages/pages/pages/pages/pages/pages.php?pageid=8 |
| www.joannalark.com/store/pages/pages/pages/pages/pages/pages/pages.php?pageid=8 |
| www.joannalark.com/store/pages/pages/pages/pages/pages/pages/pages/pages.php?pageid=8 |
| www.joannalark.com/store/pages/pages/pages/pages/pages/pages/pages/pages/pages.php?pageid=8 |
| www.joannalark.com/store/pages/pages/pages/pages/pages/pages/pages/pages/pages/pages.php?pageid=8 | |
|
|
|
|
| I don't know what to do with that and would appreciate any help Thanks, JJ <colgroup><col width="3496"></colgroup>
| |
| |
| |
| |
| |
| |0 -
Is the www and non www isue realy seen by Google as duplicate content?
I realy don't understand how Google could posibly devaluate a link because the site displays the same content with www and without www. I mean did somebody recently saw a devaluation of a domain because of this isue? I somehow can not belive this because it is the standard when geting a new webspace that the new website display the same content with and without www. Is a redirect realy necessary?
On-Page Optimization | | MichaelJanik0