Canonical for stupid _GET parameters or not? [deep technical details]
-
Hi,
Im currently working on www.kupwakacje.pl which is something like travel agency. People can search for holidays and buy/reserve them. I do know plenty of problems on my website, and thx to seomoz hopefully I will be able to fix them but one is crucial and it's kind of hard to fix I think. The search engine is provided by external party in form of simple API which is in the end responding with formatted HTML - which is completly stupid and pointless, but that's not the main problem. Let's dive in:
So for example the visitor goes to homepage, selects Egypt and hit search button. He will be redirected to
and this is not a joke
'wczasy-egipt' is my invention obviously and it means 'holidays-egypt'. I've tried to at least have 'something' in the url that makes google think it's related to Egypt indeed. Rest which is the complicated ep3[] thingy is a bunch of encoded parameters. This thing renders in first step a list of hotels, in next one hotel specific offer and in next one the reservation page. Problem is that all those links generated by this so-called API are only changing subparameters in ep3[] parameter so for example clicking on a single hotel changes to url to:
www.kupwakacje.p/wczasy-egipt/?url=wczasy-egipt/&ep3[]=%3Fsid%3Db5onrj4hdnspb5eku4s2iqm1g3lomq91%26l ang%3Dpl%26drt%3D30%26sd%3D10.06.2011%26ed%3D30.12.1999%26px%3D99999 %26dsr%3D11%253A%26ds%3D11%253A%26sp%3D
which is obviously looking not very different to the first one. what I would like to know is shall i make all pages starting with 'wczasy-egipt' a rel-canonical to the first one (www.kupwakacje.pl/wczasy-egipt) or shoudn't I? google recognizes the webpage according to webmasters central, and recognizes the url but responses with mass duplicate content. What about positioning my website for the hotel names - so long tail optimalization?
I know it's a long and complicated post, thx for reading and I would be very happy with any tip or response.
-
Also, here's a blog post from SEOmoz discussing the idea of Google, internal search results pages, and thin content: http://www.seomoz.org/blog/fat-pandas-and-thin-content
"Google has often taken a dim view of internal search results (sometimes called “search within search”, although that term has also been applied to Google’s direct internal search boxes). Essentially, they don’t want people to jump from their search results to yours – they want search users to reach specific, actionable information.
While Google certainly has their own self-interest in mind in some of these cases, it’s true that internal search can create tons of near duplicates, once you tie in filters, sorts, and pagination. It’s also arguable that these pages create a poor search experience for Google users.
The Solution
This can be a tricky situation. On the one hand, if you have clear conceptual duplicates, like search sorts, you should consider blocking or NOINDEXing them. Having the ascending and descending version of a search page in the Google index is almost always low value.
Likewise, filters and tags can often create low-value paths to near duplicates.
Search pagination is a difficult issue and beyond the scope of this post, although I’m often in favor of NOINDEXing pages 2+ of search results. They tend to convert poorly and often look like duplicates." -
Yeah, the iframe idea seems to be the easiest to implement and would give you a nice amount of control over both the URLs and the content on the pages. Generally Google tries to avoid indexing other sites' internal search results pages, so if you can add content around the iframe that helps make those search pages unique, that will help.
-
ok, will try all of these advices to be honest. I'm 99% sure I can't do much about the GET parameters, but will check.
Second thing which is making some kind of static pages and linking them with an iframe response seems really nice idea and is definetely doable. I will dive into that.
Third one is the most obvious one but I doubt I will manage to do it (even though I'm really not a bad developer ;)) there are about 30 parameters which need to be rewritten probably. It might be a better idea just to overwrite a few main ones (like which step user is at, which direction, which hotel etc). But can apache decode javascript?
hmm..
Thx for answers so far!
-
First, I'd look for a way to shorten the URL via the API. There are a TON of blank variables in that URL so I'm guessing the API has everything turned on, even though you're not pulling results for all those variables. If you can, get it to return data on only the things being searched for.
Next, if the API is just too unmanageable, I'd look into building static pages that pull search results into them via an iFrame. That way you could control all the URLs and content for several hundred popular searches, have nice clean URLs, but still have the dynamic search results as a portion of the page.
A last option, if possible, would be to setup URL rewrites to change the popular searches into normal sounding pages, but that could be difficult and cause things to break if the API changes suddenly or throws more random variables into the mix.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Canonical Tags for Legacy Duplicate Content
I've got a lot of duplicate pages, especially products, and some are new but most have been like this for a long time; up to several years. Does it makes sense to use a canonical tag pointing to one master page for each product. Each page is slightly different with a different feature and includes maybe a sentence or two that is unique but everything else is the same.
Technical SEO | | AmberHanson0 -
Should I keep writing about the same using rel canonical?
Hi, The service we provide has not so many searches per month. A long tail keyword that describes the service well has at the most 400 searches per month. We wrote a post for this keyword and we ranked number 1 for many months. Now we're on page 2 and I the truth is we stopped writing blog posts because we were raking well for our best keywords. I added a few new posts and lost ranking on my top keywords so I gave up, deleted them and recover the rankings for the keywords I wanted the most. The problem is that I have lost these positions and I know we're supposed to be updating the blog regularly. What would you suggest? Should we keep writing about the same thing and use rel canonical? There aren't that many keywords related to what we offer. I appreciate any ideas.
Technical SEO | | Naix0 -
How to solve Parameter Issue causing Duplicate Content
Hi everyone, My site home page comes up in SERP with following url www.sitename/?referer=indiagrid My question is:- Should I disallow using robots.txt.? or 301 redirect to the home page Other issue is i have few dynamic generated URL's for a form http://www.www.sitename/career-form.php?position=SEO Executive I am using parameter "position" in URL Parameter in GWT. But still my pages are indexed that is leading to duplicate page content. Please help me out.
Technical SEO | | himanshu3019890 -
Rel=canonical for similar (not exact) content?
Hi all, We have a software product and SEOMOZ tools are currently reporting duplicate content issues in the support section of the website. This is because we keep several versions of our documentation covering the current version and previous 3-4 versions as well. There is a fair amount of overlap in the documentation. When a new version comes out, we simply copy the documentation over, edit it as necessary to address changes and create new pages for the new functionality. This means there is probably an 80% or so overlap from one version to the next. We were previously blocking Google (using robots.txt) from accessing previous versions of the sofware documentation, but this is obviously not ideal from an SEO perspective. We're in the process of linking up all the old versions of the documenation to the newest version so we can use rel=canonical to point to the current version. However, the content isn't all exact duplicates. Will we be penalized by Google because we're using rel=canonical on pages that aren't actually exact duplicates? Thanks, Darren.
Technical SEO | | dgibbons0 -
Adding Rel Canonical to multiple pages
Hi, Our CMS generates a lot of duplicate content, (Different versions of every page for 3 different font sizes). There are many other reasons why we should drop this current CMS and go with something else, and we are in the process of doing that. But for now, does anyone know how would I do the following: I've created a spreadsheet that contains the following: Column 1: rel="canonical" tag for URL Column 2: Duplicate Content URL # 1 Column 3: Duplicate Content URL # 2 Column 4: Duplicate Content URL # 3 I want to add the tag from column 1 into the head of every page from column 2,3, and 4. What would be a fast way to do this considering that I have around 1800 rows. Check the screenshot of the builtwith.com result to see more information about the website if that helps. Farris bxySL
Technical SEO | | jdossetti0 -
Whats the quickest way of diagnosing a canonical problem
Salut from from positivley tropical 10 degrees C wetherby UK 🙂 Ok here goes... on this site http://www.cedarcourthotels.co.uk/ there is a canonical problem but I'm interested to know if my method of spotting a canonical problem is the most efficient. In the case of Cedar Court I started with http://www.cedarcourthotels.co.uk/ then i entered this https://www.cedarcourthotels.co.uk/ and noted they were pointing to the same home page. My question is there a quicker way of diagnosing a canonical problem or is it a case of knowcking out w's and adding s etc. Thanks in advance, David
Technical SEO | | Nightwing0 -
Google +1 not recognizing rel-canonical
So I have a few pages with the same content just with a different URL. http://nadelectronics.com/products/made-for-ipod/VISO-1-iPod-Music-System http://nadelectronics.com/products/speakers/VISO-1-iPod-Music-System http://nadelectronics.com/products/digital-music/VISO-1-iPod-Music-System All pages rel-canonical to:
Technical SEO | | kevin4803
http://nadelectronics.com/products/made-for-ipod/VISO-1-iPod-Music-System My question is... why can't google + (or facebook and twitter for that matter) consolidate all these pages +1. So if the first two had 5 +1 and the rel-canonical page had 5 +1's. It would be nice for all pages to display 15 +1's not 5 on each. It's my understanding that Google +1 will gives the juice to the correct page. So why not display all the +1's at the same time. Hope that makes sense.0 -
Effect of rel canonical on links
Has anyone done any experimentation on how Google treats links that are on a page that is being "rel canonical'd" to another page? For eg, example.com/b has a canonical pointing to example.com/a How does Google treat the internal links that are on page example.com/b?
Technical SEO | | Burgo0