Canonical for stupid _GET parameters or not? [deep technical details]
-
Hi,
Im currently working on www.kupwakacje.pl which is something like travel agency. People can search for holidays and buy/reserve them. I do know plenty of problems on my website, and thx to seomoz hopefully I will be able to fix them but one is crucial and it's kind of hard to fix I think. The search engine is provided by external party in form of simple API which is in the end responding with formatted HTML - which is completly stupid and pointless, but that's not the main problem. Let's dive in:
So for example the visitor goes to homepage, selects Egypt and hit search button. He will be redirected to
and this is not a joke
'wczasy-egipt' is my invention obviously and it means 'holidays-egypt'. I've tried to at least have 'something' in the url that makes google think it's related to Egypt indeed. Rest which is the complicated ep3[] thingy is a bunch of encoded parameters. This thing renders in first step a list of hotels, in next one hotel specific offer and in next one the reservation page. Problem is that all those links generated by this so-called API are only changing subparameters in ep3[] parameter so for example clicking on a single hotel changes to url to:
www.kupwakacje.p/wczasy-egipt/?url=wczasy-egipt/&ep3[]=%3Fsid%3Db5onrj4hdnspb5eku4s2iqm1g3lomq91%26l ang%3Dpl%26drt%3D30%26sd%3D10.06.2011%26ed%3D30.12.1999%26px%3D99999 %26dsr%3D11%253A%26ds%3D11%253A%26sp%3D
which is obviously looking not very different to the first one. what I would like to know is shall i make all pages starting with 'wczasy-egipt' a rel-canonical to the first one (www.kupwakacje.pl/wczasy-egipt) or shoudn't I? google recognizes the webpage according to webmasters central, and recognizes the url but responses with mass duplicate content. What about positioning my website for the hotel names - so long tail optimalization?
I know it's a long and complicated post, thx for reading and I would be very happy with any tip or response.
-
Also, here's a blog post from SEOmoz discussing the idea of Google, internal search results pages, and thin content: http://www.seomoz.org/blog/fat-pandas-and-thin-content
"Google has often taken a dim view of internal search results (sometimes called “search within search”, although that term has also been applied to Google’s direct internal search boxes). Essentially, they don’t want people to jump from their search results to yours – they want search users to reach specific, actionable information.
While Google certainly has their own self-interest in mind in some of these cases, it’s true that internal search can create tons of near duplicates, once you tie in filters, sorts, and pagination. It’s also arguable that these pages create a poor search experience for Google users.
The Solution
This can be a tricky situation. On the one hand, if you have clear conceptual duplicates, like search sorts, you should consider blocking or NOINDEXing them. Having the ascending and descending version of a search page in the Google index is almost always low value.
Likewise, filters and tags can often create low-value paths to near duplicates.
Search pagination is a difficult issue and beyond the scope of this post, although I’m often in favor of NOINDEXing pages 2+ of search results. They tend to convert poorly and often look like duplicates." -
Yeah, the iframe idea seems to be the easiest to implement and would give you a nice amount of control over both the URLs and the content on the pages. Generally Google tries to avoid indexing other sites' internal search results pages, so if you can add content around the iframe that helps make those search pages unique, that will help.
-
ok, will try all of these advices to be honest. I'm 99% sure I can't do much about the GET parameters, but will check.
Second thing which is making some kind of static pages and linking them with an iframe response seems really nice idea and is definetely doable. I will dive into that.
Third one is the most obvious one but I doubt I will manage to do it (even though I'm really not a bad developer ;)) there are about 30 parameters which need to be rewritten probably. It might be a better idea just to overwrite a few main ones (like which step user is at, which direction, which hotel etc). But can apache decode javascript?
hmm..
Thx for answers so far!
-
First, I'd look for a way to shorten the URL via the API. There are a TON of blank variables in that URL so I'm guessing the API has everything turned on, even though you're not pulling results for all those variables. If you can, get it to return data on only the things being searched for.
Next, if the API is just too unmanageable, I'd look into building static pages that pull search results into them via an iFrame. That way you could control all the URLs and content for several hundred popular searches, have nice clean URLs, but still have the dynamic search results as a portion of the page.
A last option, if possible, would be to setup URL rewrites to change the popular searches into normal sounding pages, but that could be difficult and cause things to break if the API changes suddenly or throws more random variables into the mix.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Does "google selected canonical" pass link juice the same as "user selected canonical"?
We are in a bit of a tricky situation since a key top-level page with lots of external links has been selected as a duplicate by Google. We do not have any canonical tag in place. Now this is fine if Google passes the link juice towards the page they have selected as canonical (an identical top-level page)- does anyone know the answer to this question? Due to various reasons, we can't put a canonical tag ourselves at this moment in time. So my question is, does a Google selected canonical work the same way and pass link juice as a user selected canonical? Thanks!
Technical SEO | | Lewald10 -
Why do two pages compete while a canonical tag is active?
Hi guys, My SERP analysis show me that two pages compete eachother for the keyword kinderfiets, which should not happen since there is a canonical tag is active. www.halfords.nl/fiets/kinderfiets/kinderfiets/ Ranks #6 and www.halfords.nl/fiets/kinderfiets/ Ranks #7. The first one is a subcategory which is one step deeper than the second one. I prefer consumers to land on the broader subcategory, because that one shows more products.Furthermore, we already did some SEO tweaking for the #7 page, but did not work on the #6 page. So it is even kind of strange that this page ranks higher.Can somebody help me out?Kind Regards,Tom
Technical SEO | | Sebastiaan10 -
Will canonical solve this?
Hi all, I look after a website which sells a range of products. Each of these products has different applications, so each product has a different product page. For eg. Product one for x application Product one for y application Product one for z application Each variation page has its own URL as if it is a page of its own. The text on each of the pages is slightly different depending on the application, but generally very similar. If I were to have a generic page for product one, and add canonical tags to all the variation pages pointing to this generic page, would that solve the duplicate content issue? Thanks in advance, Ethan
Technical SEO | | Analoxltd0 -
Do Canonical Tags Pass Link Juice?
I have an ecommerce website where some pages link to a product page with a different URL. EXAMPLE: 1: /category/product1.html (not indexed by Google) with canonical pointing to product1.html Other page link to the product like below. 2: product1.html (indexed by Google) Now the question is, does 1: pass any link juice to product1.html or not? Is it worth to change everything and link only to one URL? My site is running on Magento!
Technical SEO | | bill3690 -
Canonical tag or 301
Hi, Our crawl report is showing duplicate content. some of the report I am clear about what to do but on others I am not. Some of the duplicate content arises with a 'theme=default' on the end of the URL. Is this version of a page necessary for people to see when they visit the site (like a theme=print page is) in which case I think we should use a canonical tag, or is it not necessary in which case we should use a 301? Thanks
Technical SEO | | Houses0 -
Why are these pages duplicates when canonical is defined?
The SEOmoz reports indicate that the following pages are duplicates even though the canonical tag has been added. http://www.designquotes.com.au/dq/web/get-quotes/quotes http://www.designquotes.com.au/dq/web/get-quotes/brief Is this normal?
Technical SEO | | designquotes0 -
Should Canonical be used if your site does not have any duplicate
Should canonical be used site wide even if my site is solid no duplicate content is generated. please explain your answer
Technical SEO | | ciznerguy0 -
How rel=canonical works with index, noindex ?
Hello all, I had always wondered how the index,noindex affects to the canonical. And also if the canonical post should be included in the sitemap or not. I posted this http://www.comparativadebancos.co... and with a rel=canonical to this that was published at the beginning of the month http://www.comparativadebancos.co... but then I have the first one in google http://www.google.com/search?aq=f... May be this is evident for you but, what is really doing the canonical? If I publish something with the canonical pointing to another page, will it still be indexed by google but with no penalty for duplicate content? Or the usual behaviour should have been to havent indexed the first post but just the second one? Should I also place a noindex in the first post in addition to the canonical? What am I missing here? thanks
Technical SEO | | antorome0