The Moz Q&A Forum

    • Forum
    • Questions
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Forum
    2. Categories
    3. SEO Tactics
    4. Intermediate & Advanced SEO
    5. How to stop URLs that include query strings from being indexed by Google

    Moz Q&A is closed.

    After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

    How to stop URLs that include query strings from being indexed by Google

    Intermediate & Advanced SEO
    2 2 8.4k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • McTaggart
      McTaggart last edited by

      Hello Mozzers

      Would you use rel=canonical, robots.txt, or Google Webmaster Tools to stop the search engines indexing URLs that include query strings/parameters. Or perhaps a combination?

      I guess it would be a good idea to stop the search engines crawling these URLs because the content they display will tend to be duplicate content  and of low value to users.

      I would be tempted to use a combination of canonicalization and robots.txt for every page I do not want crawled or indexed, yet perhaps Google Webmaster Tools is the best way to go / just as effective??? And I suppose some use meta robots tags too.

      Does Google take a position on being blocked from web pages.

      Thanks in advance, Luke

      1 Reply Last reply Reply Quote 0
      • CleverPhD
        CleverPhD last edited by

        WIthout a specific example, there are a couple of options here. I am going to assume that you have an ecommerce site where parameters are being used for sort functions on search results or different options on a given product.

        I know you may not be able to do this, but using parameters in this case is just a bad idea to start with. If you can (and I know this can be difficult) find a way to rework this so that your site functions without the use of parameters.

        You could use canonicals, but then Google would still be crawling all those pages and then go through the process of using the canonical link to find out what page is canonical. That is a big waste of Google's time. Why waste Googlebots time on crawling a bunch of pages that you do not want to have crawled anyway? I would rather Googlebot focus on crawling your most important pages.

        You can use the robots.txt file to stop Google from crawling sections of your site.  The only issue with this is that if some of your pages with a bunch of parameters in them are ranking, once you tell Google to stop crawling it, you would then lose traffic.

        It is not that Google does not "like" robot.txt to block them, or that they do not "like" the use of the canonical tag, it is just that there are directives that Google will follow in a certain way and so if not implemented correctly or in the wrong sequence can cause negative results because you have basically told Google to do something without fully understanding what will happen.

        Here is what I would do.  Long version for long term success

        1. Look at Google Analytics (or other Analytics) and Moz tools and see what pages are ranking and sending you traffic. Make note of your results.

        2. Think of the most simple way that you could organize your site that would be logical to your users and would allow Google to crawl every page you deem important. Creating a hierarchical sitemap is a good way to do this. How does this relate to what you found in #1.

        3. Rework your URL structure to reflect what you found in #2 without using parameters. If you have to use parameters, then make sure Google can crawl your basic sitemap without using any of the parameters. Use robots.txt to then block the crawling of any parameters on your site. You have now ensured that Google can crawl and will rank pages without parameters and you are not hiding any important pages or page information on a page that uses parameters.

        There are other reasons not to use parameters (e.g. easier for users remember, tend to be shorter, etc), so think about if you want to get rid of them.

        1. 301 redirect all your main traffic pages from the old URL structure to the new URL structure.  Show 404s for all the old pages including the ones with parameters.  That way all the good pages will move to the new URL structure and the bad ones will go away.

        Now, if you are stuck using parameters. I would do a variant of the above.  Still see if there are any important or well ranked pages that use parameters. Consider if there is a way to use the canonical on those pages to get Google to the right page to know what should rank. All the other pages I would use the noindex directive to get them out of the Google index, then later use robots to block Google crawling them.  You want to do this in sequence as if you block Google first, it will never see the noindex directive.

        Now, everything I said above is generally "correct" but depending on your situation, things may need to be tweaked. I hope the information I gave might help with you being able to work out the best options for what works for your site and your customers.

        Good luck!

        1 Reply Last reply Reply Quote 1
        • 1 / 1
        • First post
          Last post

        Got a burning SEO question?

        Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


        Start my free trial


        Explore more categories

        • Moz Tools

          Chat with the community about the Moz tools.

          Getting Started
          Moz Pro
          Moz Local
          Moz Bar
          API
          What's New

        • SEO Tactics

          Discuss the SEO process with fellow marketers

          Content Development
          Competitive Research
          Keyword Research
          Link Building
          On-Page Optimization
          Technical SEO
          Reporting & Analytics
          Intermediate & Advanced SEO
          Image & Video Optimization
          International SEO
          Local SEO

        • Community

          Discuss industry events, jobs, and news!

          Moz Blog
          Moz News
          Industry News
          Jobs and Opportunities
          SEO Learn Center
          Whiteboard Friday

        • Digital Marketing

          Chat about tactics outside of SEO

          Affiliate Marketing
          Branding
          Conversion Rate Optimization
          Web Design
          Paid Search Marketing
          Social Media

        • Research & Trends

          Dive into research and trends in the search industry.

          SERP Trends
          Search Behavior
          Algorithm Updates
          White Hat / Black Hat SEO
          Other SEO Tools

        • Support

          Connect on product support and feature requests.

          Product Support
          Feature Requests
          Participate in User Research

        • See all categories

        • If I block a URL via the robots.txt - how long will it take for Google to stop indexing that URL?
          Gabriele_Layoutweb
          Gabriele_Layoutweb
          0
          3
          821

        • Will disallowing URL's in the robots.txt file stop those URL's being indexed by Google
          andyheath
          andyheath
          0
          10
          3.2k

        • Should I include URLs that are 301'd or only include 200 status URLs in my sitemap.xml?
          Jonathan.Smith
          Jonathan.Smith
          0
          3
          1.4k

        • URL Injection Hack - What to do with spammy URLs that keep appearing in Google's index?
          peteboyd
          peteboyd
          0
          7
          5.6k

        • Does Google Read URL's if they include a # tag? Re: SEO Value of Clean Url's
          Atlanta-SMO
          Atlanta-SMO
          0
          6
          3.1k

        Get started with Moz Pro!

        Unlock the power of advanced SEO tools and data-driven insights.

        Start my free trial
        Products
        • Moz Pro
        • Moz Local
        • Moz API
        • Moz Data
        • STAT
        • Product Updates
        Moz Solutions
        • SMB Solutions
        • Agency Solutions
        • Enterprise Solutions
        • Digital Marketers
        Free SEO Tools
        • Domain Authority Checker
        • Link Explorer
        • Keyword Explorer
        • Competitive Research
        • Brand Authority Checker
        • Local Citation Checker
        • MozBar Extension
        • MozCast
        Resources
        • Blog
        • SEO Learning Center
        • Help Hub
        • Beginner's Guide to SEO
        • How-to Guides
        • Moz Academy
        • API Docs
        About Moz
        • About
        • Team
        • Careers
        • Contact
        Why Moz
        • Case Studies
        • Testimonials
        Get Involved
        • Become an Affiliate
        • MozCon
        • Webinars
        • Practical Marketer Series
        • MozPod
        Connect with us

        Contact the Help team

        Join our newsletter
        Moz logo
        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
        • Accessibility
        • Terms of Use
        • Privacy

        Looks like your connection to Moz was lost, please wait while we try to reconnect.