Google indexing my website's Search Results pages. Should I block this?
-
After running the SEOmoz crawl test, i have a spreadsheet of 11,000 urls of which 6381 urls are search results pages from our website that have been indexed.
I know I've read that /search should be blocked from the engines, but can't seem to find that information at this point. Does anyone have facts behind why they should be blocked? Or not blocked?
-
Since you already released these out to the wild, I would analyze which search results pages are bringing in traffic and use that analysis to create new category pages on your site. I would certainly block the search parameter in the Webmaster tools and in robots.txt.. Most internal search results pages have little content value and the engines now look at your site as a whole and if a certain percentage of the site is low quality, the whole site will be penalized.
-
Jenny,
Take a look at this post in the forums on indexing issues with site search - http://www.seomoz.org/q/block-search-engines-from-urls-created-by-internal-search-engine.
Allowing site search to be indexed can result in a ton of duplicate content on your site. I recommend taking the meta noindex approach.
-
well the simple answer for you is Google allocate a crawl budget based on multiple factors.
with your current setting the crawlers and wondering and going after these search page that add no value to the web. and losing alot of your budget on these search pages, where i would definitely direct these crawlers to crawl the content and update it whenever u add or update a page.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Any SEO disadvantages with creating pages under a directory page which doesn't exists?
Hi, Let's say we are going to create pages in the URL path www.website.com/directory/sub-pages/. In case this page www.website.com/directory/ doesn't exists or redirected; will the pages created in this URL path like stated above have any issues in-terms of SEO? We will link these pages from somewhere in the website and planning to redirect the /directory/ to homepage. Suggestions please.
Algorithm Updates | | vtmoz1 -
Non-indexed or indexed top hierarchy pages get high PageRank at Google?
Hi, We are creating some pages just to capture leads from blog-posts. We created few pages at top hierarchy like website.com/new-page/. I'm just wondering if these pages will take away more PageRank. Do we need to create these pages at low hierarchy like website.com/folder/new-page to avoid passing more PageRank? Is this is how PR distributed even now and it's same for indexed or non-indexed pages? Thanks
Algorithm Updates | | vtmoz0 -
Please help explain this (Question about search results)
What's up SEO's, I'm new the SEO world and had a quick question. I just installed the MOZBAR and did a google search: "What is Google Voice" I attached an image of the results I received. Can someone explain how MacWorld's article outranked Google's when both Google's Page Authority and Domain Authority are so much stronger than MacWorlds. This is in addition to google having many more links. This is basic, but any insight will be very helpful. Thanks guys! [Screen%20Shot%202014-02-18%20at%206.08.15%20PM.png](file:///Users/jackfarrell/Desktop/Screen%20Shot%202014-02-18%20at%206.08.15%20PM.png)
Algorithm Updates | | Petbrosia1 -
Struggling with Google Bot Blocks - Please help!
I own a site called www.wheretobuybeauty.com.au After months and months we still have a serious issue with all pages having blocked URLs according to Google Webmaster Tools. The 404 errors are returning a 200 header code according to the email below. Do you agree that the 404.php code should be changed? Can you do that please ? The current state: Google webmaster tools Index Status shows: 26,000 pages indexed 44,000 pages blocked by robots. In late March, we implemented a change recommended by an SEO expert and he provided a new robots.txt file, advised that we should amend sitemap.xml and other changes. We implemented those changes and then setup a re-index of the site by google. The no of blocked URLs eventually reduced in May and June to 1,000 for a few days – but now the problem has rapidly returned. The no of pages that are displayed in a google search request of www.google.com.au where the query was ‘site:wheretobuybeauty.com.au’ is 37,000: This new site has been re-crawled over last 4 weeks. About the site This is a Linux php site and has the following: 55,000 URLs in sitemap.xml submitted successfully to webmaster tools robots.txt file has been modified several times: Firstly we had none Then we created one but were advised that it needed to have this current content: User-agent: * Disallow: Sitemap: http://www.wheretobuybeauty.com.au/sitemap.xml
Algorithm Updates | | socialgrowth0 -
Are you still seeing success with EMD's?
I am curious if any other SEO's are still seeing success with exact matching domains. I am not seeing ANY changes to any of my clients rankings since the "Exact Match Domain" filter came about in September. Also while I have conducted SERP audits in my neck of the woods I am noticing EMD's are still doing very well. What are you seeing?
Algorithm Updates | | clarktbell0 -
Stop google indexing CDN pages
Just when I thought I'd seen it all, google hits me with another nasty surprise! I have a CDN to deliver images, js and css to visitors around the world. I have no links to static HTML pages on the site, as far as I can tell, but someone else may have - perhaps a scraper site? Google has decided the static pages they were able to access through the CDN have more value than my real pages, and they seem to be slowly replacing my pages in the index with the static pages. Anyone got an idea on how to stop that? Obviously, I have no access to the static area, because it is in the CDN, so there is no way I know of that I can have a robots file there. It could be that I have to trash the CDN and change it to only allow the image directory, and maybe set up a separate CDN subdomain for content that only contains the JS and CSS? Have you seen this problem and beat it? (Of course the next thing is Roger might look at google results and start crawling them too, LOL) P.S. The reason I am not asking this question in the google forums is that others have asked this question many times and nobody at google has bothered to answer, over the past 5 months, and nobody who did try, gave an answer that was remotely useful. So I'm not really hopeful of anyone here having a solution either, but I expect this is my best bet because you guys are always willing to try.
Algorithm Updates | | loopyal0 -
Google's reaction to site updates
Hi, Is it safe to assume as soon as Google indexes updates I've made to my site that any ranking changes the updates effected will happen at that same time, or is there ever a lag time before these changes ( if any ) take effect?
Algorithm Updates | | minutiae0 -
If a page one result for a keyword is mostly directories, do I have a chance to rank for this keyword?
I feel like although directories carry a lot of weight and links, I'd think that my client would be able to gain a top position, since none of the others are competitor pages, nor are the directories engaging.
Algorithm Updates | | randallseo0