Skip to content
    Moz logo Menu open Menu close
    • Products
      • Moz Pro
      • Moz Pro Home
      • Moz Local
      • Moz Local Home
      • STAT
      • Moz API
      • Moz API Home
      • Compare SEO Products
      • Moz Data
    • Free SEO Tools
      • Domain Analysis
      • Keyword Explorer
      • Link Explorer
      • Competitive Research
      • MozBar
      • More Free SEO Tools
    • Learn SEO
      • Beginner's Guide to SEO
      • SEO Learning Center
      • Moz Academy
      • MozCon
      • Webinars, Whitepapers, & Guides
    • Blog
    • Why Moz
      • Digital Marketers
      • Agency Solutions
      • Enterprise Solutions
      • Small Business Solutions
      • The Moz Story
      • New Releases
    • Log in
    • Log out
    • Products
      • Moz Pro

        Your all-in-one suite of SEO essentials.

      • Moz Local

        Raise your local SEO visibility with complete local SEO management.

      • STAT

        SERP tracking and analytics for enterprise SEO experts.

      • Moz API

        Power your SEO with our index of over 44 trillion links.

      • Compare SEO Products

        See which Moz SEO solution best meets your business needs.

      • Moz Data

        Power your SEO strategy & AI models with custom data solutions.

      Enhance Keyword Discovery with Bulk Analysis
      Moz Pro

      Enhance Keyword Discovery with Bulk Analysis

      Learn more
    • Free SEO Tools
      • Domain Analysis

        Get top competitive SEO metrics like DA, top pages and more.

      • Keyword Explorer

        Find traffic-driving keywords with our 1.25 billion+ keyword index.

      • Link Explorer

        Explore over 40 trillion links for powerful backlink data.

      • Competitive Research

        Uncover valuable insights on your organic search competitors.

      • MozBar

        See top SEO metrics for free as you browse the web.

      • More Free SEO Tools

        Explore all the free SEO tools Moz has to offer.

      NEW Keyword Suggestions by Topic
      Moz Pro

      NEW Keyword Suggestions by Topic

      Learn more
    • Learn SEO
      • Beginner's Guide to SEO

        The #1 most popular introduction to SEO, trusted by millions.

      • SEO Learning Center

        Broaden your knowledge with SEO resources for all skill levels.

      • On-Demand Webinars

        Learn modern SEO best practices from industry experts.

      • How-To Guides

        Step-by-step guides to search success from the authority on SEO.

      • Moz Academy

        Upskill and get certified with on-demand courses & certifications.

      • MozCon

        Save on Early Bird tickets and join us in London or New York City

      Access 20 years of data with flexible pricing
      Moz API

      Access 20 years of data with flexible pricing

      Find your plan
    • Blog
    • Why Moz
      • Digital Marketers

        Simplify SEO tasks to save time and grow your traffic.

      • Small Business Solutions

        Uncover insights to make smarter marketing decisions in less time.

      • Agency Solutions

        Earn & keep valuable clients with unparalleled data & insights.

      • Enterprise Solutions

        Gain a competitive edge in the ever-changing world of search.

      • The Moz Story

        Moz was the first & remains the most trusted SEO company.

      • New Releases

        Get the scoop on the latest and greatest from Moz.

      Surface actionable competitive intel
      New Feature

      Surface actionable competitive intel

      Learn More
    • Log in
      • Moz Pro
      • Moz Local
      • Moz Local Dashboard
      • Moz API
      • Moz API Dashboard
      • Moz Academy
    • Avatar
      • Moz Home
      • Notifications
      • Account & Billing
      • Manage Users
      • Community Profile
      • My Q&A
      • My Videos
      • Log Out

    The Moz Q&A Forum

    • Forum
    • Questions
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. Home
    2. SEO Tactics
    3. Technical SEO
    4. How to block "print" pages from indexing

    Moz Q&A is closed.

    After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

    How to block "print" pages from indexing

    Technical SEO
    5
    23
    9915
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with question management privileges can see it.
    • dreadmichael
      dreadmichael last edited by

      I have a fairly large FAQ section and every article has a "print" button. Unfortunately, this is creating a page for every article which is muddying up the index - especially on my own site using Google Custom Search.

      Can you recommend a way to block this from happening?

      Example Article:

      http://www.knottyboy.com/lore/idx.php/11/183/Maintenance-of-Mature-Locks-6-months-/article/How-do-I-get-sand-out-of-my-dreads.html

      Example "Print" page:

      http://www.knottyboy.com/lore/article.php?id=052&action=print

      1 Reply Last reply Reply Quote 0
      • NakulGoyal
        NakulGoyal @dreadmichael last edited by

        Donnie, I agree. However, we had the same problem on a website and here's what we did the canonical tag:

        Over a period of 3-4 weeks, all those print pages disappeared from the SERP. Now if I take a print URL and do a cache: for that page, it shows me the web version of that page.

        So yes, I agree the question was about blocking the pages from getting indexed. There's no real recipe here, it's about getting the right solution. Before canonical tag, robots.txt was the only solution. But now with canonical there (provided one has the time and resources available to implement it vs adding one line of text to robots.txt), you can technically 301 the pages and not have to stop/restrict the spiders from crawling them.

        Absolutely no offence to your solution in any way. Both are indeed workable solutions. The best part is that your robots.txt solution takes 30 seconds to implement since you provided the actually disallow code :), so it's better.

        1 Reply Last reply Reply Quote 0
        • dreadmichael
          dreadmichael @SEODinosaur last edited by

          Thanks Jennifer, will do! So much good information.

          1 Reply Last reply Reply Quote 0
          • Dr-Pete
            Dr-Pete Staff @SEODinosaur last edited by

            Sorry, but I have to jump in - do NOT use all of those signals simultaneously. You'll make a mess, and they'll interfere with each other. You can try Robots.txt or NOINDEX on the page level - my experience suggests NOINDEX is much more effective.

            Also, do not nofollow the links yet - you'll block the crawl, and then the page-level cues (like NOINDEX) won't work. You can nofollow later. This is a common mistake and it will keep your fixes from working.

            1 Reply Last reply Reply Quote 1
            • jennita
              jennita @SEODinosaur last edited by

              Josh, please read my and Dr. Pete's comments below. Don't nofollow the links, but do use the meta noindex,follow on the page.

              1 Reply Last reply Reply Quote 0
              • Dr-Pete
                Dr-Pete Staff @SEODinosaur last edited by

                Rel-canonical, in practice, does essentially de-index the non-canonical version. Technically, it's not a de-indexation method, but it works that way.

                1 Reply Last reply Reply Quote 0
                • dreadmichael
                  dreadmichael @SEODinosaur last edited by

                  You are right Donnie. I've "good answered" you too.

                  I've gone ahead and updated my robots.txt file. As soon as I am able, I will use no indexon the page, no follow on the links, and rel=canonical.

                  This is just what I needed, a quick fix until I can make a more permanent solution.

                  1 Reply Last reply Reply Quote 0
                  • SEODinosaur
                    SEODinosaur @dreadmichael last edited by

                    Your welcome : )

                    1 Reply Last reply Reply Quote 0
                    • SEODinosaur
                      SEODinosaur @SEODinosaur last edited by

                      Although you are correct... there is still more then one way to skin a chicken.

                      1 Reply Last reply Reply Quote 0
                      • SEODinosaur
                        SEODinosaur @dreadmichael last edited by

                        But the spiders still run on the page and read the canonical link, however with the robot text the spiders will not.

                        1 Reply Last reply Reply Quote 0
                        • SEODinosaur
                          SEODinosaur @NakulGoyal last edited by

                          Yes, but Rel=Canonical does not block a page it only tells google which page to follow out of two pages.The question was how to block, not how to tell google which link to follow. I believe you gave credit to the wrong answer.

                          http://en.wikipedia.org/wiki/Canonical_link_element

                          This is not fair. lol

                          dreadmichael Dr-Pete jennita 5 Replies Last reply Reply Quote 0
                          • Dr-Pete
                            Dr-Pete Staff @jennita last edited by

                            I have to agree with Jen - Robots.txt isn't great for getting indexed pages out. It's good for prevention, but tends to be unreliable as a cure. META NOINDEX is probably more reliable.

                            One trick - DON'T nofollow the print links, at least not yet. You need Google to crawl and read the NOINDEX tags. Once the ?print pages are de-indexed, you could nofollow the links, too.

                            1 Reply Last reply Reply Quote 0
                            • NakulGoyal
                              NakulGoyal @dreadmichael last edited by

                              Yes, it's strongly recommended. It should be fairly simple to populate this tag with the "full" URL of the article based on the article ID. This approach will not only help you get rid of the duplicate content issue, but a canonical tag essentially works like a 301 redirect. So from all search engine perspective you are 301'ing your print pages to the real web urls without redirecting the actual user's who are browsing the print pages if they need to.

                              1 Reply Last reply Reply Quote 0
                              • dreadmichael
                                dreadmichael @NakulGoyal last edited by

                                Ya it is actually really useful. Unfortunately they are out of business now - so I'm hacking it on my own.

                                I will take your advice. I've shamefully never used rel= canonical before - so now is a good time to start.

                                NakulGoyal SEODinosaur 3 Replies Last reply Reply Quote 0
                                • jennita
                                  jennita @SEODinosaur last edited by

                                  True but using robots.txt does not keep them out of the index. Only using "noindex" will do that.

                                  1 Reply Last reply Reply Quote 1
                                  • dreadmichael
                                    dreadmichael last edited by

                                    Thanks Donnie. Much appreciated!

                                    SEODinosaur 1 Reply Last reply Reply Quote 1
                                    • NakulGoyal
                                      NakulGoyal last edited by

                                      I actually remember Lore from a while ago. It's an interesting, easy to use FAQ CMS.

                                      Anyways, I would also recommend implementing Canonical Tags for any possible duplicate content issues. So whether it's the print or the web version, each one of them will contain a canonical tag pointing to the web url of that article in the section of your website.

                                      rel="canonical" href="http://www.knottyboy.com/lore/idx.php/11/183/Maintenance-of-Mature-Locks-6-months-/article/How-do-I-get-sand-out-of-my-dreads.html" />
                                      dreadmichael SEODinosaur 2 Replies Last reply Reply Quote 1
                                      • SEODinosaur
                                        SEODinosaur @dreadmichael last edited by

                                        http://www.seomoz.org/learn-seo/robotstxt

                                        1 Reply Last reply Reply Quote 1
                                        • SEODinosaur
                                          SEODinosaur @dreadmichael last edited by

                                          Try This.

                                          User-agent: *

                                          Disallow: /*&action=print

                                          1 Reply Last reply Reply Quote 0
                                          • SEODinosaur
                                            SEODinosaur @jennita last edited by

                                            Theres more then one way to skin a chicken.

                                            jennita SEODinosaur 2 Replies Last reply Reply Quote 0
                                            • jennita
                                              jennita last edited by

                                              Rather than using robots.txt I'd use a noindex,follow tag instead to the page. This code goes into the tag for each print page. And it will ensure that the pages don't get indexed but that the links are followed.

                                              SEODinosaur Dr-Pete 2 Replies Last reply Reply Quote 1
                                              • dreadmichael
                                                dreadmichael @SEODinosaur last edited by

                                                That would be great. Do you mind giving me an example?

                                                SEODinosaur 2 Replies Last reply Reply Quote 0
                                                • SEODinosaur
                                                  SEODinosaur last edited by

                                                  you can block in .robot text, every page that ends in action=print

                                                  dreadmichael 1 Reply Last reply Reply Quote 0
                                                  • 1 / 1
                                                  • First post
                                                    Last post

                                                  Got a burning SEO question?

                                                  Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


                                                  Start my free trial


                                                  Browse Questions

                                                  Explore more categories

                                                  • Moz Tools

                                                    Chat with the community about the Moz tools.

                                                  • SEO Tactics

                                                    Discuss the SEO process with fellow marketers

                                                  • Community

                                                    Discuss industry events, jobs, and news!

                                                  • Digital Marketing

                                                    Chat about tactics outside of SEO

                                                  • Research & Trends

                                                    Dive into research and trends in the search industry.

                                                  • Support

                                                    Connect on product support and feature requests.

                                                  • See all categories

                                                  Related Questions

                                                  • Chophel

                                                    My WP website got attack by malware & now my website site:www.example.ca shows about 43000 indexed page in google.

                                                    Hi All My wordpress website got attack by malware last week. It affected my index page in google badly. my typical site:example.ca shows about 130 indexed pages on google. Now it shows about 43000 indexed pages.  I had my server company tech support scan my site and clean the malware yesterday. But it still shows the same number of indexed page on google. Does anybody had ever experience such situation and how did you fixed it. Looking for help. Thanks FILE HIT LIST:
                                                    {YARA}Spam_PHP_WPVCD_ContentInjection : /home/example/public_html/wp-includes/wp-tmp.php
                                                    {YARA}Backdoor_PHP_WPVCD_Deployer : /home/example/public_html/wp-includes/wp-vcd.php
                                                    {YARA}Backdoor_PHP_WPVCD_Deployer : /home/example/public_html/wp-content/themes/oceanwp.zip
                                                    {YARA}webshell_webshell_cnseay02_1 : /home/example2/public_html/content.php
                                                    {YARA}eval_post : /home/example2/public_html/wp-includes/63292236.php
                                                    {YARA}webshell_webshell_cnseay02_1 : /home/example3/public_html/content.php
                                                    {YARA}eval_post : /home/example4/public_html/wp-admin/28855846.php
                                                    {HEX}php.generic.malware.442 : /home/example5/public_html/wp-22.php
                                                    {HEX}php.generic.cav7.421 : /home/example5/public_html/SEUN.php
                                                    {HEX}php.generic.malware.442 : /home/example5/public_html/Webhook.php

                                                    Technical SEO | | Chophel
                                                    0
                                                  • vikrantrathore

                                                    Pages are Indexed but not Cached by Google. Why?

                                                    Hello, We have magento 2 extensions website mageants.com since 1 years google every 15 days cached my all pages but suddenly last 15 days my websites pages not cached by google showing me 404 error so go search console check error but din't find any error so I have cached manually fetch and render but still most of pages have same 404 error example page : - https://www.mageants.com/free-gift-for-magento-2.html error :- http://webcache.googleusercontent.com/search?q=cache%3Ahttps%3A%2F%2Fwww.mageants.com%2Ffree-gift-for-magento-2.html&rlz=1C1CHBD_enIN803IN804&oq=cache%3Ahttps%3A%2F%2Fwww.mageants.com%2Ffree-gift-for-magento-2.html&aqs=chrome..69i57j69i58.1569j0j4&sourceid=chrome&ie=UTF-8 so have any one solutions for this issues

                                                    Technical SEO | | vikrantrathore
                                                    0
                                                  • RichHamilton_qcs

                                                    Best practices for types of pages not to index

                                                    Trying to better understand best practices for when and when not use a content="noindex".  Are there certain types of pages that we shouldn't want Google to index?  Contact form pages, privacy policy pages, internal search pages, archive pages (using wordpress).  Any thoughts would be appreciated.

                                                    Technical SEO | | RichHamilton_qcs
                                                    0
                                                  • zeepartner

                                                    Google indexing despite robots.txt block

                                                    Hi This subdomain has about 4'000 URLs indexed in Google, although it's blocked via robots.txt: https://www.google.com/search?safe=off&q=site%3Awww1.swisscom.ch&oq=site%3Awww1.swisscom.ch This has been the case for almost a year now, and it does not look like Google tends to respect the blocking in http://www1.swisscom.ch/robots.txt Any clues why this is or what I could do to resolve it? Thanks!

                                                    Technical SEO | | zeepartner
                                                    0
                                                  • ShawnHerrick

                                                    Product Pages Outranking Category Pages

                                                    Hi, We are noticing an issue where some product pages are outranking our relevant category pages for certain keywords. For a made up example, a "heavy duty widgets" product page might rank for the keyword phrase Heavy Duty Widgets, instead of our Heavy Duty Widgets category page appearing in the SERPs. We've noticed this happening primarily in cases where the name of the product page contains an at least partial match for the desired keyword phrase we want the category page to rank for. However, we've also found isolated cases where the specified keyword points to a completely irrelevent pages instead of the relevant category page. Has anyone encountered a similar issue before, or have any ideas as to what may cause this to happen? Let me know if more clarification of the question is needed. Thanks!

                                                    Technical SEO | | ShawnHerrick
                                                    0
                                                  • TOMMarketingLtd.

                                                    Home Page .index.htm and .com Duplicate Page Content/Title

                                                    I have been whittling away at the duplicate content on my clients' sites, thanks to SEOmoz's pro report, and have been getting push back from the account manager at register.com (the site was built here and the owner doesn't want to move it).  He says these are the exact same page and he can't access one to redirect to the other.  Any suggestions? The SEOmoz report says there is duplicate content on both these urls: Durango Mountain Biking | Durango Mountain Resort - Cascade Village http://www.cascadevillagehotel.com/index.htm Durango Mountain Biking | Durango Mountain Resort - Cascade Village http://www.cascadevillagehotel.com/ Your help is greatly appreciated! Sheryl

                                                    Technical SEO | | TOMMarketingLtd.
                                                    0
                                                  • TalkInThePark

                                                    De-indexing millions of pages - would this work?

                                                    Hi all, We run an e-commerce site with a catalogue of around 5 million products. Unfortunately, we have let Googlebot crawl and index tens of millions of search URLs, the majority of which are very thin of content or duplicates of other URLs. In short: we are in deep. Our bloated Google-index is hampering our real content to rank; Googlebot does not bother crawling our real content (product pages specifically) and hammers the life out of our servers. Since having Googlebot crawl and de-index tens of millions of old URLs would probably take years (?), my plan is this: 301 redirect all old SERP URLs to a new SERP URL. If new URL should not be indexed, add meta robots noindex tag on new URL. When it is evident that Google has indexed most "high quality" new URLs, robots.txt disallow crawling of old SERP URLs. Then directory style remove all old SERP URLs in GWT URL Removal Tool This would be an example of an old URL:
                                                    www.site.com/cgi-bin/weirdapplicationname.cgi?word=bmw&what=1.2&how=2 This would be an example of a new URL:
                                                    www.site.com/search?q=bmw&category=cars&color=blue I have to specific questions: Would Google both de-index the old URL and not index the new URL after 301 redirecting the old URL to the new URL (which is noindexed) as described in point 2 above? What risks are associated with removing tens of millions of URLs directory style in GWT URL Removal Tool? I have done this before but then I removed "only" some useless 50 000 "add to cart"-URLs.Google says themselves that you should not remove duplicate/thin content this way and that using this tool tools this way "may cause problems for your site". And yes, these tens of millions of SERP URLs is a result of a faceted navigation/search function let loose all to long.
                                                    And no, we cannot wait for Googlebot to crawl all these millions of URLs in order to discover the 301. By then we would be out of business. Best regards,
                                                    TalkInThePark

                                                    Technical SEO | | TalkInThePark
                                                    0
                                                  • tdsnet

                                                    301 for "index.php" in Web.config?

                                                    Hi there, I'm trying to create a 301 redirect for the file "index.php" but I keep getting a "fail to redirect" message in Firefox whenever I insert it into the Web.config file. <location path="index.php"></location> Is there anyway around this? Thanks for any help According to Open Site Explorer, there are about 500 links to my index file but it only has a 302 status so will not be passing link juice.

                                                    Technical SEO | | tdsnet
                                                    0

                                                  Get started with Moz Pro!

                                                  Unlock the power of advanced SEO tools and data-driven insights.

                                                  Start my free trial
                                                  Products
                                                  • Moz Pro
                                                  • Moz Local
                                                  • Moz API
                                                  • Moz Data
                                                  • STAT
                                                  • Product Updates
                                                  Moz Solutions
                                                  • SMB Solutions
                                                  • Agency Solutions
                                                  • Enterprise Solutions
                                                  • Digital Marketers
                                                  Free SEO Tools
                                                  • Domain Authority Checker
                                                  • Link Explorer
                                                  • Keyword Explorer
                                                  • Competitive Research
                                                  • Brand Authority Checker
                                                  • Local Citation Checker
                                                  • MozBar Extension
                                                  • MozCast
                                                  Resources
                                                  • Blog
                                                  • SEO Learning Center
                                                  • Help Hub
                                                  • Beginner's Guide to SEO
                                                  • How-to Guides
                                                  • Moz Academy
                                                  • API Docs
                                                  About Moz
                                                  • About
                                                  • Team
                                                  • Careers
                                                  • Contact
                                                  Why Moz
                                                  • Case Studies
                                                  • Testimonials
                                                  Get Involved
                                                  • Become an Affiliate
                                                  • MozCon
                                                  • Webinars
                                                  • Practical Marketer Series
                                                  • MozPod
                                                  Connect with us

                                                  Contact the Help team

                                                  Join our newsletter
                                                  Moz logo
                                                  © 2021 - 2025 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                                                  • Accessibility
                                                  • Terms of Use
                                                  • Privacy

                                                  Looks like your connection to Moz was lost, please wait while we try to reconnect.