How do I avoid this issue of duplicate content with Google?
-
I have an ecommerce website which sells a product that has many different variations based on a vehicle’s make, model, and year. Currently, we sell this product on one page “www.cargoliner.com/products.php?did=10001” and we show a modal to sort through each make, model, and year. This is important because based on the make, model, and year, we have different prices/configurations for each. For example, for the Jeep Wrangler and Jeep Cherokee, we might have different products:
Ultimate Pet Liner - Jeep Wrangler 2011-2013 - $350
Ultimate Pet Liner - Jeep Wrangler 2014 - 2015 - $350
Utlimate Pet Liner - Jeep Cherokee 2011-2015 - $400
Although the typical consumer might think we have 1 product (the Ultimate Pet Liner), we look at these as many different types of products, each with a different configuration and different variants.
We do NOT have unique content for each make, model, and year. We have the same content and images for each. When the customer selects their make, model, and year, we just search and replace the text to make it look like the make, model, and year. For example, when a custom selects 2015 Jeep Wrangler from the modal, we do a search and replace so the page will have the same url (www.cargoliner.com/products.php?did=10001) but the product title will say “2015 Jeep Wrangler”.
Here’s my problem:
We want all of these individual products to have their own unique urls (cargoliner.com/products/2015-jeep-wrangler) so we can reference them in emails to customers and ideally we start creating unique content for them. Our only problem is that there will be hundreds of them and they don’t have unique content other than us switching in the product title and change of variants. Also, we don’t want our url www.cargoliner.com/products.php?did=10001 to lose its link juice.
Here’s my question(s):
My assumption is that I should just keep my url: www.cargoliner.com/products.php?did=10001 and be able to sort through the products on that page. Then I should go ahead and make individual urls for each of these products (i.e. cargoliner.com/products/2015-jeep-wrangler) but just add a “nofollow noindex” to the page.
Is this what I should do?
How secure is a “no-follow noindex” on a webpage? Does Google still index?
Am I at risk for duplicate content penalties?
Thanks!
-
Hi Don,
Using these generic blocks for multiple products is an awesome idea, but don't you think it'll end up creating almost duplicate content section on all these product pages?
Does bots consider duplicate content across the page, or section wise?
-
The only thing I'd add to this is to that nofollow isn't secure enough. I would recommend blocking the individual product pages that you don't want search engines to find. Thanks Donford for the detailed response.
Craig
-
Thanks for your response Don. I have a followup question for you.
I did understand your your t-shirt example but I think it's even more refined in our case. Since there are so many different variants, we do have to count them as different products, and we would like to avoid creating custom content right away (there's just so much we would have to create). I'm still thinking our perfect solution would be custom links but with a canonical tag pointing back to "www.cargoliner.com/products.php?did=10001".
Here's my solution based on your advise:
I'm going to keep my main page where people can sort to find there make, model, and year (www.cargoliner.com/products.php?did=10001).
Then I plan to have specific product pages per make, model, and year which will have duplicate content except for the make, model, and year searched and replaced. ie Jeep Wrangler 2015-2016 will be "www.cargoliner/products/jeep-wrangler-2015-2016" and in that page I will have a canonical tag point back to "www.cargoliner.com/products.php?did=10001" so I don't get hit with duplicate content.
After this and over the next 6-9 months, I plan to fill in all of the custom content for each product. Then I plan to remove the canonical tag once I have custom content on that page.
Does this sound like the correct approach?
-
Hi Kirby,
What you have here is a common hurdle to many online businesses. Just like a company selling T-shirts, they may have a smiley face t-shirt in s, m, l, xl, xxl and in 5 different colors. So how does one optimize their content to be found when somebody searches for XL red smiley face t-shirt?
You can take one of two approaches.
Option 1: You can optimize a page for the "main item" in my example it would be "smiley face t-shirt". Then try to get the long tailed keywords on the page by listing colors, sizes, on the page a couple times. The goal of this page would be to rank #1 for the broad keyword Smiley Face T-Shirt, and high on longer keywords like XL Smiley Face T-Shirts, and Red Smiley Face T-shirt. With this approach you would use parameters for items, so you could generate a unique url, but you would canonical it back to the main page.
So the main page would look like this:
url: thetshirtShop.us/mens-tees/smilely-face-tshirt
Title: Smiley Face T-Shirt, Men's Small Through XL
H1: Men's Smiley Face T-Shirt, S,M,L,XL, 2XL, Select A Color & SizeThen for each option you would have your parameters (example red, XL)
url: thetshirtShop.us/mens-tees/smilely-face-tshirt?size=XL&color=red
Canonical: thetshirtShop.us/mens-tees/smiely-face-tshirtThat is the basic structure, but be sure in your content you include the color's and sizes at least twice so when the page is crawled the crawlers see the sizes and colors.
Option 2: which in my opinion is the better of the two would be to create unique content for each item. I know this may seem like it would be difficult or time consuming, and to a degree it is. Again how much different is a red t-shirt from a green one? Aside from the color not much, but there is opportunity here.
Example option 2 color red / xl
url: thetshirtShop.us/mens-tees/smilely-face-tshirt?size=XL&color=red
Title: Red Smiley Face T-Shirt Men's Size XL
H1: Men's Red Smilely Face T-Shirt Size XLContent Block (Smiley Face T)
By artist Harvest Ross Ball the smiley face t-shirt puts a smile on your chest and a positive attitude by on-lookers. Staying true to the original creator the smiley face tee has been digital optimized and transferred to the tee using our proprietary screen process that insures the image will last. This is one of best selling iconic tees and available in colors, yellow, red, black, white and green. Men's U.S standard sizes, small, medium, large, extra large, and extra extra large.Content Block (Color-Red):
The T-Shirt Shop using the finest natural dyes in creating our vibrant red color for {Smiley Face T-Shirt}. Using red hibiscus, sumac berries and beets. Our red color is formulated to last for thousands of washes with no fading or wash bleeding. The end result we get true red {Smiley Face T-Shirt} which is quality tested to be HEX #FF0000 RGB 255,0,0 +- within 2 shades.Content Block (Size XL):
Our Men's Size XL for our {Red} {Smiley Face T-Shirt} follows U.S sizing standards; width 24" (61 cm), length 31.25" (80cm). Sleeve length for the {Red} {Smiley Face T-Shirt} is 9" (22cm), and can accommodate up to a 16" (40cm) bicep comfortably. All our T's are pre-washed and pre-shrunk to ensure our high sizing standards are meet.Okay, so let me break down what I did here. I created 2 different "specific" code blocks with dynamic elements inside them noted by the { } brackets. Along with the original description block. You can reuse these blocks for other t-shirts, and still achieve a unique page. For example:
Smiley Face T (red/xl) Content Blocks = Smiley Face T, Red, XL
Smiley Face T (green/large) Content Blocks = Smiley Face T, Green, LargeUsing these code blocks with dynamic insertion of product details you can achieve a unique page. However you need to get creative in writing these blocks, and do some back end coding or extra work in getting them uploaded to the thousands of different products properly. Just to note I have no knowledge of t-shirts or how best to market them, I just used this for an example to help you identify possible areas of creativity in your product.
In summary I have laid out 2 options for this problem. You can use parameters with canonical and a single page, or get creative and make some unique content pages. Each option will achieve unique urls for your email campaign, and follow SEO best practices.
Hope this helps,
Don
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Who gets punished for duplicate content?
What happens if two domains have duplicate content? Do both domains get punished for it, or just one? If so, which one?
Technical SEO | | Tobii-Dynavox0 -
Image Indexing Issue by Google
Hello All,My URL is: www.thesalebox.comI have Submitted my image Sitemap in google webmaster tool on 10th Oct 2013,Still google could not indexing any of my web images,Please refer my sitemap - www.thesalebox.com/AppliancesHomeEntertainment.xml and www.thesalebox.com/Hardware.xmland my webmaster status and image indexing status are below, Can you please help me, why my images are not indexing in google yet? is there any issue? please give me suggestions?Thanks!
Technical SEO | | CommercePundit0 -
Duplicate Content Issues
We have some "?src=" tag in some URL's which are treated as duplicate content in the crawl diagnostics errors? For example, xyz.com?src=abc and xyz.com?src=def are considered to be duplicate content url's. My objective is to make my campaign free of these crawl errors. First of all i would like to know why these url's are considered to have duplicate content. And what's the best solution to get rid of this?
Technical SEO | | RodrigoVaca0 -
Bad Duplicate content issue
Hi, for grappa.com I have about 2700 warnings of duplicate page content. My CMS generates long url like: http://www.grappa.com/deu/news.php/categoria=latest_news/idsottocat=5 and http://www.grappa.com/deu/news.php/categoria%3Dlatest_news/idsottocat%3D5 (this is a duplicated content). What's the best solution to fix this problem? Do I have to set up a 301 redirect for all the duplicated pages or insert the rel=canonical or rel=prev,next ? It's complicated becouse it's a multilingual site, and it's my first time dealing with this stuff. Thanks in advance.
Technical SEO | | nico860 -
Duplicate Content Issue with
Hello fellow Moz'rs! I'll get straight to the point here - The issue, which is shown in the attached image, is that for every URL ending in /blog/category/name, it has a duplicate page of /blog/category/name/?p=contactus. Also, its worth nothing that the ?p=contact us are not in the SERPs but were crawled by SEOMoz and they are live and duplicate. We are using Pinnacle cart. Is there a way to just stop the crawlers from ?p=contactus or? Thank you all and happy rankings, James
Technical SEO | | JamesPiper0 -
Duplicate content and http and https
Within my Moz crawl report, I have a ton of duplicate content caused by identical pages due to identical pages of http and https URL's. For example: http://www.bigcompany.com/accomodations https://www.bigcompany.com/accomodations The strange thing is that 99% of these URL's are not sensitive in nature and do not require any security features. No credit card information, booking, or carts. The web developer cannot explain where these extra URL's came from or provide any further information. Advice or suggestions are welcome! How do I solve this issue? THANKS MOZZERS
Technical SEO | | hawkvt10 -
Google crawl index issue with our website...
Hey there. We've run into a mystifying issue with Google's crawl index of one of our sites. When we do a "site:www.burlingtonmortgage.biz" search in Google, we're seeing lots of 404 Errors on pages that don't exist on our site or seemingly on the remote server. In the search results, Google is showing nonsensical folders off the root domain and then the actual page is within that non-existent folder. An example: Google shows this in its index of the site (as a 404 Error page): www.burlingtonmortgage.biz/MQnjO/idaho-mortgage-rates.asp The actual page on the site is: www.burlingtonmortgage.biz/idaho-mortgage-rates.asp Google is showing the folder MQnjO that doesn't exist anywhere on the remote. Other pages they are showing have different folder names that are just as wacky. We called our hosting company who said the problem isn't coming from them... Has anyone had something like this happen to them? Thanks so much for your insight!
Technical SEO | | ILM_Marketing
Megan0 -
Duplicate content
This is just a quickie: On one of my campaigns in SEOmoz I have 151 duplicate page content issues! Ouch! On analysis the site in question has duplicated every URL with "en" e.g http://www.domainname.com/en/Fashion/Mulberry/SpringSummer-2010/ http://www.domainname.com/Fashion/Mulberry/SpringSummer-2010/ Personally my thoughts are that are rel = canonical will sort this issue, but before I ask our dev team to add this, and get various excuses why they can't I wanted to double check i am correct in my thinking? Thanks in advance for your time
Technical SEO | | Yozzer0