Does a sitemap override Google parameter handling?
-
This question might seem silly, but I'll ask anyway.
We have an eCommerce site with a ton of duplicate content, mostly caused by faceted navigation. In researching ways to reduce the clutter, I've decided to use Google parameter handling to stop Googlebot from crawling pages with certain parameters, like: sort order, page #, etc...
Now my question:
If I set all of these parameters so that Googlebot doesn't crawl the grids, how will they ever find the individual product pages? We do upload a sitemap with all of the product pages. Does this solve my issue? Or, should I handle the duplicate content with noindex, follow tag?
Or, is there an even better way?
Thanks
-
Hello John,
This is a very good question, and something people don't often think about when blocking the navigational paths on their site from being crawled.
Depending on how fast your category pages load and how many products are on each of them, you may consider a View All Canonical page: http://googlewebmastercentral.blogspot.com/2011/09/view-all-in-search-results.html
There are many different ways to handle faceted navigation problems, including javascrpt, GWT parameter handling, robots meta, robots.txt, rel canonical... and combinations of these. The right approach should be customized for your specific needs. When possible, I prefer to allow Google to crawl and index down to a certain level of faceting, similar to allowing them into sub-categories (though it depends entirely on your taxonomy) but not tertiary (i.e. sub-sub) categories. For the next couple of levels I might allow them to crawl, but not index. And once it gets down to 4 or 5 levels deep (e.g. /?category=1&size=5&color=blue&price=low&this=that&so-on=so-forth...) I just block them from being both indexed and crawled (i.e. Meta NOINDEX,NOFOLLOW or robots.txt block) to save crawl budget by avoiding spider traps.
With all of that said, if you are giving Google an XML sitemap that contains the indexable URLs to all of your products they should have no problem indexing them, regardless of whether or not they can crawl all the way through your faceted navigation.
-
I would recommend you to use 'Canonical Link'
You can find more here:
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Product Pages not indexed by Google
We built a website for a jewelry company some years ago, and they've recently asked for a meeting and one of the points on the agenda will be why their products pages have not been indexed. Example: http://rocks.ie/details/Infinity-Ring/7170/ I've taken a look but I can't see anything obvious that is stopping pages like the above from being indexed. It has a an 'index, follow all' tag along with a canonical tag. Am I missing something obvious here or is there any clear reason why product pages are not being indexed at all by Google? Any advice would be greatly appreciated. Update I was told 'that each of the product pages on the full site have corresponding page on mobile. They are referred to each other via cannonical / alternate tags...could be an angle as to why product pages are not being indexed.'
Intermediate & Advanced SEO | | RobbieD910 -
Does including your site in Google News (and Google) Alerts helps with SEO?
Based on the following article http://homebusiness.about.com/od/yourbusinesswebsite/a/google-alerts.htm in order to check if you are included you need to run site:domain.com and click the news search tab. If you are not there then... I ran the test on MOZ and got no results which surprised me. Next step according to :https://support.google.com/news/publisher/answer/40787?hl=en#ts=3179198 is to submit your site for inclusion. Should I? Will it help? P.S.
Intermediate & Advanced SEO | | BeytzNet
This is a followup question to the following: http://moz.com/community/q/what-makes-a-site-appear-in-google-alerts-and-does-it-mean-anything0 -
What URL parameter settings in GWT to choose for search results parameter?
Hello,we're about to disallow search results from crawling in robots.txt, but in GWT we have to specify URL parameters. URLs with 'search' parameter look like these: http://www.example.com/?search=keyword So in GWT we're setting the following parameter: search Question, what settings to set for it?
Intermediate & Advanced SEO | | poiseo0 -
Google images
Hi, I am working on a website with a large number (millions) of images. For the last five months Ihave been trying to get Google Images to crawl and index these images (example page: http://bit.ly/1ePQvyd). I believe I have followed best practice in the design of the page, naming of images etc. Whilst crawlng and indexing of the pages is going reasonably well with the standard crawler, the image bot has only crawled about half a million images and indexed only about 40,000. Can anyone suggest what I could do to increase this number 100 fold? Richard
Intermediate & Advanced SEO | | RichardTay0 -
Why, oh why does Google hate us?
My URL is:
Intermediate & Advanced SEO | | candylotus
www.drupalgeeks.org I have tried my very best to cover all the usual SEO items.... But nothing... We are a legitimate company offering a legitimate service. Ideally we would come up in the results for: "Drupal Developers"
"Drupal Development" and
"Drupal Designers" Yet, we cannot break the top 50 for any of these.... We have:
Optimized Meta Tags
Written Quality Content
Maintained a Social Presence in Twitter, LinkedIn, Facebook, Pinterest, Youtube, and Google Plus
Blogged with some consistency
Guest Blogged
Canonicalization
And dozens of other things And we are all around nice people and a good company....... Why oh why? Can anyone take a look and see if there is something blatantly obvious I am missing? Are we using "drupal" too much?? Thank you in advanced for any assistance. Candice1 -
How does google count a menu on each page
Hello, Just wondering how google treats the TOp and bottom menu that you see on each page of a website ? Does it count it on all the pages in terms of link juice, or is it just there for user experience and only what it counts are the links in the content of a page or on the side ? Thank you,
Intermediate & Advanced SEO | | seoanalytics0 -
How to handle web server downtime?
We have a client who is taking their web server down Saturday morning from 1am - 7am for planned maintenance. Initially, we thought to have all requests return a 503 (service unavailable) response but the web server itself will be down so we are not able to have it return any response codes. Updating the DNS on the registrar will have too much lag time while it propogates out so we aren't sure exactly how to handle this. I had thought possibly of using a second DNS, or a service like DynDNS but that seems like a large amount of effort to set up just for some planned downtime. I have to imagine that Google understands planned website/server downtime every once in a great while. This client has pretty good rankings for some incredibly competitive terms so we want to do all that we can to make sure those rankings are preserved. What are some other potential solutions? We could totally just be overthinking this but we'd rather be safe than sorry... Thanks in advance!
Intermediate & Advanced SEO | | MichaelWeisbaum0 -
Duplicate Listings on Google Maps
About 3 weeks ago google created a duplicate listing for our law firm on google maps. In building links I have tried very hard to ensure that our address and company name was always listed identically. Our correct firm name and address is Feldman Feldman & Associates, PC 2221 Camino Del Rio South, Suite 201 inevitably somehow the new listing stated Camino Del Rio S, Ste 201 All of our reviews moved over to this new profile, I claimed it, changed it to make it the same reported it to Google. Google merged them. Now Google has created another profile this time the firm name and address matches ours exactly (South and Suite both spelled out), but all of the reviews have moved over except for the most recent one(s). I have claimed it again and reported it to google, changed the address. Google then created another listing. Our page rank for keywords has been hurt by this. any idea why this keeps happening suggestions? Here are the two pages. This is our original listing http://maps.google.com/maps/place?hl=en&cid=468564492130231259 This is the new one google self created that stole all our reviews, but is ranked very poorly for the keyword searches. http://maps.google.com/maps/place?&cid=468564492130231259
Intermediate & Advanced SEO | | jfeld2220