Magento and Duplicate content
-
I have been working with Magento over the last few weeks and I am becoming increasingly frustrated with the way it is setup. If you go to a product page and remove the sub folders one by one you can reach the same product pages causing duplicate content. All magento sites seem to have this weakness. So use this site as an example because I know it is built on magento,
http://www.gio-goi.com/men/clothing/tees/throve-t-short.html?cid=756
As you remove the tees then the clothing and men sub folders you can still reach the product page. My first querstion is how big an issue is this and two does anyone have any ideas of how to solve it?
Also I was wondering how does google treat question marks in urls? Should you try and avoid them unless you are filtering?
Thanks
-
Gregster,
I assume that you have found an answer to your question by now. However, I wanted to offer up what looks to be an extremely in depth and comprehensive walkthrough on Magento SEO from yoast.com. They have several sections on duplicate content, as well as a canonical plugin you may find useful.
http://yoast.com/articles/magento-seo/
Best of Luck!
-
"I recommend you nofollow the login, search, and cart pages through XML layout. That will cross off another 500 pages or so." Not nofollow. Don't use nofollow . This is for untrusted links - so should not be used for internal links.
It's Noindex. And then use the canonical tag if 301 Redirects are not an option. To make life more complicated, you need to be careful not to do use noindex and canonical tag simultaneously.
-
Hi Kevin,
I would be interested to talk more with you about this issue. What does your custom extension do that others don't?
Thanks again.
-
Hi Gregster. I feel your pain. Having worked on Magento for the past three years, I've come across a lot of "issues" you'd expect a top-tier e-commerce solution provider to have under control.
I've written about getting canonical URLs in CMS pages here, something that many Magento SEO extensions don't do. I also had a custom SEO extension created and would be happy to share with you. No cost. Just use it.
I don't know if you have multiple languages, but that alone will create an exponential amount of duplicate content from dynamic parameters. Go into your WMT and set those parameters to be ignored. If you aren't sure how to do that, it's well documented here and on Google, Yahoo, and Bing webmaster sites.
I recommend you nofollow the login, search, and cart pages through XML layout. That will cross off another 500 pages or so.
One last mention is that RocketTheme has created a pretty neat extension that will get rid of the p parameter altogether by using JS to switch from grid and list views. Or you could just select in your admin to only allow either grid or list instead of both.
Any more questions just ask.
-
Hi,
Magento is surely a "beast"... the way to solve your problem, as far as I understood it, is to use the rel="canonical", in order to show to the Search Engines what URL they have to consider in case of duplicated content.
The solutions?
- or you have very good devs skills (or a developer very fond of Magento);
- or you have to rely to the many extensions existing.
Very well know is the Yoast extension, but it seems it can give serious problem on the lastest version of Magento.
Another SEO extension is SEO Suite Pro Magento Extension (which exists also in a Ultimate version), Very good extension, but not for free.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content, although page has "noindex"
Hello, I had an issue with some pages being listed as duplicate content in my weekly Moz report. I've since discussed it with my web dev team and we decided to stop the pages from being crawled. The web dev team added this coding to the pages <meta name='robots' content='max-image-preview:large, noindex dofollow' />, but the Moz report is still reporting the pages as duplicate content. Note from the developer "So as far as I can see we've added robots to prevent the issue but maybe there is some subtle change that's needed here. You could check in Google Search Console to see how its seeing this content or you could ask Moz why they are still reporting this and see if we've missed something?" Any help much appreciated!
Technical SEO | | rj_dale0 -
ViewState and Duplicate Content
Our site keeps getting duplicated content flagged as an issue... however, the pages being grouped together have very little in common on-page. One area which does seem to recur across them is the ViewState. There's a minimum of 150 lines across the ones we've investigated. Could this be causing the reports?
Technical SEO | | RobLev0 -
Duplicate Content in Dot Net Nuke
Our site is built on Dot Net Nuke. SEOmoz shows a very large amount of duplicate content because at the beginning each page got an extension in the following format: www.domain.com/tabid/110/Default.aspx The site additionally exists without the tabid... part. Our web developer says an easy fix with a canonical tag or 301 redirect is not possible. Does anyone have DNN experience and can point us in the right direction? Thanks, Ricarda
Technical SEO | | jsillay0 -
"Daily Special" = Duplicate Content?
I believe this has been addresses and answered previously, but despite searching the Q&A archives, I was unable to find the question and answer. So, please be gentle and patient: We have an eCommerce site with several hundred products, most of which use the structure: www.mysite.com/subcategory/itemA.html. We wish to feature itemA as a "daily special" item, and our Magento developer has recommended: www.mysite.com/internet-daily-special/**itemA.html ** Because itemA.html is the same page—albeit following a different path—will Google see this as duplicate content? Thanks.
Technical SEO | | RScime250 -
How can i see the pages that cause duplicate content?
SEOmoz PRO is giving me back duplicate content errors. However, i don't see how i can get a list of pages that are duplicate to the one shown. If i don't know which pages/urls cause the issue i can't really fix it. The only way would be placing canonical tags but that's not always the best solution. Is there a way to see the actual duplicate pages?
Technical SEO | | 5MMedia0 -
Multiple URLs in CMS - duplicate content issue?
So about a month ago, we finally ported our site over to a content management system called Umbraco. Overall, it's okay, and certainly better than what we had before (i.e. nothing - just static pages). However, I did discover a problem with the URL management within the system. We had a number of pages that existed as follows: sparkenergy.com/state/name However, they exist now within certain folders, like so: sparkenergy.com/about-us/service-map/name So we had an aliasing system set up whereby you could call the URL basically whatever you want, so that allowed us to retain the old URL structure. However, we have found that the alias does not override, but just adds another option to finding a page. Which means the same pages can open under at least two different URLs, such as http://www.sparkenergy.com/state/texas and http://www.sparkenergy.com/about-us/service-map/texas. I've tried pointing to the aliased URL in other parts of the site with the rel canonical tag, without success. How much of a problem is this with respect to duplicate content? Should we bite the bullet, remove the aliased URLs and do 301s to the new folder structure?
Technical SEO | | ufmedia0 -
Duplicate Content Issue
Hi Everyone, I ran into a problem I didn't know I had (Thanks to the seomoz tool) regarding duplicate content. my site is oxford ms homes.net and when I built the site, the web developer used php to build it. After he was done I saw that the URL's looking like this "/blake_listings.php?page=0" and I wanted them like this "/blakes-listings" He changed them with no problem and he did the same with all 300 pages or so that I have on the site. I just found using the crawl diagnostics tool that I have like 3,000 duplicate content issues. Is there an easy fix to this at all or does he have to go in and 301 Redirect EVERY SINGLE URL? Thanks for any help you can give.
Technical SEO | | blake-766240 -
Aspx filters causing duplicate content issues
A client has a url which is duplicated by filters on the page, for example: - http://www.example.co.uk/Home/example.aspx is duplicated by http://www.example.co.uk/Home/example.aspx?filter=3 The client is moving to a new website later this year and is using an out-of-date Kentico CMS which would need some development doing to it in order to enable implementation of rel canonical tags in the header, I don't have access to the server and they have to pay through the nose everytime they want the slightest thing altering. I am trying to resolve this duplicate content issue though and am wondering what is the best way to resolve it in the short term. The client is happy to remove the filter links from the page but that still leaves the filter urls in Google. I am concerned that a 301 redirect will cause a loop and don't understand the behaviour of this type of code enough. I hope this makes sense, any advice appreciated.
Technical SEO | | travelinnovations0