Duplicate content via dynamic URLs where difference is only parameter order?
-
I have a question about the order of parameters in an URL versus duplicate content issues. The URLs would be identical if the parameter order was the same.
E.g.
www.example.com/page.php?color=red&size=large&gender=male versus
www.example.com/page.php?gender=male&size=large&color=redHow smart is Google at consolidating these, and do these consolidated pages incur any penalty (is their combined “weight” equal to their individual selves)?
Does Google really see these two pages as DISTINCT, or does it recognize that they are the same because they have the exact same parameters?
Is this worth fixing in or does it have a trivial impact?
If we have to fix it and can't change our CMS, should we set a preferred, canonical order for these URLs or 301 redirect from one version to the other?
Thanks a million!
-
To be fair to Highland, I do think canonical is a good bet here, but I just have to comment that I don't think Google handles these kinds of URLs very well. They should, in theory, but in my experience they rarely do. The problem with order variants is that you can easily spin 100s or 1000s of them and create serious indexation and ranking problems.
For this particular example, the canonical tag is probably best, but there may be cases where certain parameters have no particular value (like a "sort by" parameter). Those are sometimes better off blocked.
I cover a bunch of examples in my mega-post on duplicate content:
http://www.seomoz.org/blog/duplicate-content-in-a-post-panda-world
-
Agreed with Highland, this seems exactly the kind of problem canonical can fix. I wouldn't go down the road of 301ing because for parameters that simple you like aren't going to run into problems. The rule of thumb is you should act if you have more than two parameters in the URL (not sure where I read that), but I've seen Google 'figure out' up to 4 for some of my sites.
Another thing to check out is Google webmaster tools, you can set certain keywords and url parameters there to help Google 'learn' how to crawl your site. This Google blog posting might help too:
http://googlewebmastercentral.blogspot.com/2008/09/dynamic-urls-vs-static-urls.html
-
Google should recognize the difference but, just to be safe, I would add a canonical to your page so you don't have anything to worry about.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Where to Place Quality Content in Order to Create Links?
Assuming we have retained a an award winning journalist to write articles/blog posts about our business. Assuming the content is useful and engaging. Where would be the best place to publish it to create high quality backlinks? 1. Our website blog
Intermediate & Advanced SEO | | Kingalan1
2. Social media sites like our LinkedIn or Facebook pages.
3. Sending completed articles to websites that might potentially have an interest in publishing them.
4. Publishing the articles on our website and then promoting them with Adwords and Facebook to demographics that would find them interesting and link to them.
5. Combination of publishing an article on our website and posting a related article on social media and linking it back to the original article on our website.
6. Place a custom written article of extremely high quality on affiliate website run by the HOTH or a competitor. But before publishing check the affiliate website on AHREFS and Link Research Tools to ensure that the metrics are not at all spammy (decent domain rating). Which of the above options (or combination of) would most likely result in backlinks of good quality? Assume the quality of the writing is excellent. If pitching the content to other websites (#3) would work, how would we identify these websites? Thanks,
Alan0 -
Duplicate Content
Let's say a blog is publishing original content. Now let's say a second blog steals that original content via bot and publishes it as it's own. Now further assume the original blog doesn't notice this for several years. How much damage could this do to blog A for Google results? Any opinions?
Intermediate & Advanced SEO | | CYNOT0 -
SEO for video content that is duplicated accross a larger network
I have a website with lots of content (high quality video clips for a particular niche). All the content gets fed out 100+ other sites on various domains/subdomains which are reskinned for a given city. So the content on these other sites is 100% duplicate. I still want to generate SEO traffic though. So my thought is that we: a) need to have canonical tags from all the other domains/subdomains that point back to the original post on the main site b) probably need to disallow search engine crawlers on all the other domains/subdomains Is this on the right track? Missing anything important related to duplicate content? The idea is that after we get search engines crawling the content correctly, from there we'd use the IP address to redirect the visitor to the best suited domain/subdomain. any thoughts on that approach? Thanks for your help!
Intermediate & Advanced SEO | | PlusROI0 -
What URL parameter settings in GWT to choose for search results parameter?
Hello,we're about to disallow search results from crawling in robots.txt, but in GWT we have to specify URL parameters. URLs with 'search' parameter look like these: http://www.example.com/?search=keyword So in GWT we're setting the following parameter: search Question, what settings to set for it?
Intermediate & Advanced SEO | | poiseo0 -
Duplicated Content with Index.php
Good Afternoon, My website uses Joomla CMS and has the htaccess rewrite code enabled to ensure the use of search engine friendly URLs (SEF's). While browsing the crawl diagnostics I have found that Moz considers the /index.php URL a duplicate to our root. I will always under the impression that the htaccess rewrite took care of that issue and obviously I would like to address it. I attempted to create a 301 redirect from the index.php URL to the root but ran into an issue when attempting to login to the admin portion of the website as the redirect sent me back to the homepage. I was curious if anyone had advice for handling the index.php duplication issue, specifically with Joomla. Additionally, I have confirmed that in Google Webmasters, under URL parameters, the index.php parameter is set as 'Representative URL'.
Intermediate & Advanced SEO | | BrandonEML0 -
URL Parameter & crawl stats
Hey Guys,I recently used the URL parameter tool in WBT to mark different urls that offers the same content.I have the parameter "?source=site1" , "?source=site2", etc...It looks like this: www.example.com/article/12?source=site1The "source parameter" are feeds that we provide to partner sites and this way we can track the referral site with our internal analytics platform.Although, pages like:www.example.com/article/12?source=site1 have canonical to the original page www.example.com/article/12, Google indexed both of the URLs
Intermediate & Advanced SEO | | Mr.bfz
www.example.com/article/12?source=site1andwww.example.com/article/12Last week I used the URL parameter tool to mark "source" parameter "No, this parameter doesnt effect page content (track usage)" and today I see a 40% decrease in my crawl stats.In one hand, It makes sense that now google is not crawling the repeated urls with different sources but in the other hand I thought that efficient crawlability would increase my crawl stats.In additional, google is still indexing same pages with different source parameters.I would like to know if someone have experienced something similar and by increasing crawl efficiency I should expect my crawl stats to go up or down?I really appreciate all the help!Thanks!0 -
Dropped ranking - Penguin penalty or duplicate content issue?
Just this weekend a page that had been ranking well for a competitive term fell completely out of the rankings. There are two possible causes and I'm trying to figure out which it is, so I can take action. I found out that I had accidentally put a canonical on another page that was for the same page as the one that dropped out of the rankings. If there are two pages with the same canonical tag with different content, will google drop both of them from the index? The other possibility is that this is a result of the recent Penguin update. The page that dropped has a high amount of exact anchor text. As far as I can tell, there were no other pages with any penalties from the Penguin update. One last question: The page completely dropped from the search index. If this were a Penguin issue, would it have dropped out completely,or just been penalized with a drop in position? If this is a result of the conflicting canonical tags, should I just wait for it to reindex, or should I request a reconsideration of the page?
Intermediate & Advanced SEO | | gametv0 -
Pop Up Pages Being Indexed, Seen As Duplicate Content
I offer users the opportunity to email and embed images from my website. (See this page http://www.andertoons.com/cartoon/6246/ and look under the large image for "Email to a Friend" and "Get Embed HTML" links.) But I'm seeing the ensuing pop-up pages (Ex: http://www.andertoons.com/embed/5231/?KeepThis=true&TB_iframe=true&height=370&width=700&modal=true and http://www.andertoons.com/email/6246/?KeepThis=true&TB_iframe=true&height=432&width=700&modal=true) showing up in Google. Even worse, I think they're seen as duplicate content. How should I deal with this?
Intermediate & Advanced SEO | | andertoons0