Skip to content
    Moz logo Menu open Menu close
    • Products
      • Moz Pro
      • Moz Pro Home
      • Moz Local
      • Moz Local Home
      • STAT
      • Moz API
      • Moz API Home
      • Compare SEO Products
      • Moz Data
    • Free SEO Tools
      • Domain Analysis
      • Keyword Explorer
      • Link Explorer
      • Competitive Research
      • MozBar
      • More Free SEO Tools
    • Learn SEO
      • Beginner's Guide to SEO
      • SEO Learning Center
      • Moz Academy
      • SEO Q&A
      • Webinars, Whitepapers, & Guides
    • Blog
    • Why Moz
      • Digital Marketers
      • Agency Solutions
      • Enterprise Solutions
      • Small Business Solutions
      • The Moz Story
      • New Releases
    • Log in
    • Log out
    • Products
      • Moz Pro

        Your all-in-one suite of SEO essentials.

      • Moz Local

        Raise your local SEO visibility with complete local SEO management.

      • STAT

        SERP tracking and analytics for enterprise SEO experts.

      • Moz API

        Power your SEO with our index of over 44 trillion links.

      • Compare SEO Products

        See which Moz SEO solution best meets your business needs.

      • Moz Data

        Power your SEO strategy & AI models with custom data solutions.

      NEW Keyword Suggestions by Topic
      Moz Pro

      NEW Keyword Suggestions by Topic

      Learn more
    • Free SEO Tools
      • Domain Analysis

        Get top competitive SEO metrics like DA, top pages and more.

      • Keyword Explorer

        Find traffic-driving keywords with our 1.25 billion+ keyword index.

      • Link Explorer

        Explore over 40 trillion links for powerful backlink data.

      • Competitive Research

        Uncover valuable insights on your organic search competitors.

      • MozBar

        See top SEO metrics for free as you browse the web.

      • More Free SEO Tools

        Explore all the free SEO tools Moz has to offer.

      NEW Keyword Suggestions by Topic
      Moz Pro

      NEW Keyword Suggestions by Topic

      Learn more
    • Learn SEO
      • Beginner's Guide to SEO

        The #1 most popular introduction to SEO, trusted by millions.

      • SEO Learning Center

        Broaden your knowledge with SEO resources for all skill levels.

      • On-Demand Webinars

        Learn modern SEO best practices from industry experts.

      • How-To Guides

        Step-by-step guides to search success from the authority on SEO.

      • Moz Academy

        Upskill and get certified with on-demand courses & certifications.

      • MozCon

        Save on Early Bird tickets and join us in London or New York City

      Unlock flexible pricing & new endpoints
      Moz API

      Unlock flexible pricing & new endpoints

      Find your plan
    • Blog
    • Why Moz
      • Digital Marketers

        Simplify SEO tasks to save time and grow your traffic.

      • Small Business Solutions

        Uncover insights to make smarter marketing decisions in less time.

      • Agency Solutions

        Earn & keep valuable clients with unparalleled data & insights.

      • Enterprise Solutions

        Gain a competitive edge in the ever-changing world of search.

      • The Moz Story

        Moz was the first & remains the most trusted SEO company.

      • New Releases

        Get the scoop on the latest and greatest from Moz.

      Surface actionable competitive intel
      New Feature

      Surface actionable competitive intel

      Learn More
    • Log in
      • Moz Pro
      • Moz Local
      • Moz Local Dashboard
      • Moz API
      • Moz API Dashboard
      • Moz Academy
    • Avatar
      • Moz Home
      • Notifications
      • Account & Billing
      • Manage Users
      • Community Profile
      • My Q&A
      • My Videos
      • Log Out

    The Moz Q&A Forum

    • Forum
    • Questions
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. Home
    2. SEO Tactics
    3. Intermediate & Advanced SEO
    4. PDF for link building - avoiding duplicate content

    Moz Q&A is closed.

    After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

    PDF for link building - avoiding duplicate content

    Intermediate & Advanced SEO
    4
    14
    3132
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with question management privileges can see it.
    • BobGW
      BobGW last edited by

      Hello,

      We've got an article that we're turning into a PDF. Both the article and the PDF will be on our site. This PDF is a good, thorough piece of content on how to choose a product.

      We're going to strip out all of the links to our in the article and create this PDF so that it will be good for people to reference and even print. Then we're going to do link building through outreach since people will find the article and PDF useful.

      My question is, how do I use rel="canonical" to make sure that the article and PDF aren't duplicate content?

      Thanks.

      1 Reply Last reply Reply Quote 0
      • Marcus_Miller
        Marcus_Miller @BobGW last edited by

        Hey Bob

        I think you should forget about any kind of perceived conventions and have whatever you think works best for your users and goals.

        Again, look at unbounce, that is a custom landing page with a homepage link (to share the love) but not the general site navigation.

        They also have a footer to do a bit more link love but really, do what works for you.

        Forget conventions - do what works!

        Hope that helps
        Marcus

        1 Reply Last reply Reply Quote 0
        • BobGW
          BobGW @BobGW last edited by

          I see, thanks! I think it's important not to have the ecommerce navigation on the page promoting the pdf. What would you say is ideal as far as the graphical and navigation components of the page with the PDF on it - what kind of navigation and graphical header should I have on it?

          1 Reply Last reply Reply Quote 0
          • Marcus_Miller
            Marcus_Miller @BobGW last edited by

            Yep, check the HTTP headers with webbug or there are a bunch of browser plugins that will let you see the headers for the document.

            That said, I would push to drive the links to the page though rather than the document itself and just create a nice page that houses the document and make that the link target.

            You could even make the PDF link only available by email once they have singed up or some such as canonical is only a directive and you would still be better getting those links flooding into a real page on the site.

            You could even offer up some HTML to make this easier for folks to link to that linked to your main page. If you take a look at any savvy infographics etc folks will try to draw a link into a page rather than the image itself for the very same reasons.

            If you look at something like the Noobs Guide to Online Marketing from Unbounce then you will see something like this as the suggested linking code:

            [](<strong>http://unbounce.com/noob-guide-to-online-marketing-infographic/</strong>)

            [The Noob Guide to Online Marketing - Infographic](<strong>http://unbounce.com/noob-guide-to-online-marketing-infographic/</strong>)

            [](<strong>http://unbounce.com/noob-guide-to-online-marketing-infographic/</strong>)

            Unbounce – The DIY Landing Page Platform

            So, the image is there but the link they are pimping is a standard page:

            http://unbounce.com/noob-guide-to-online-marketing-infographic/

            They also cheekily add an extra homepage link in as well with some keywords and the brand so if folks don't remove that they still get that benefit.

            Ultimately, it means that when links flood into the site they benefit the whole site rather than just promote one PDF.

            Just my tuppence! 
            Marcus

            1 Reply Last reply Reply Quote 0
            • BobGW
              BobGW @Marcus_Miller last edited by

              Thanks for the code Marcus.

              Actually, the pdf is what people will be linking to. It's a guide for websites. I think the PDF will be much easier to promote than the article.I assume so anyway.

              Is there a way to make sure my canonical code in htaccess is working after I insert the code?

              Thanks again,

              Bob

              Marcus_Miller BobGW 3 Replies Last reply Reply Quote 0
              • Marcus_Miller
                Marcus_Miller last edited by

                Hey Bob

                There is a much easier way to do this and simply have your PDFs that you don't want indexed in a folder that you block access to in robots.txt. This way you can just drop PDFs into articles and link to them knowing full well these pages will not be indexed.

                Assuming you had a PDF called article.pdf in a folder called pdfs/ then the following would prevent indexation.

                User-agent: * Disallow: /pdfs/

                Or to just block the file itself:

                User-agent: *
                Disallow: /pdfs/yourfile.pdf Additionally, There is no reason not to add the canonical link as well and if you find people are linking directly to the PDF then having this would ensure that the equity associated with those links was correctly attributed to the parent page (always a good thing).

                Header add Link '<http: www.url.co.uk="" pdfs="" article.html="">; </http:> rel="canonical"'

                Generally, there are better ways to block indexation than with robots.txt but in the case of PDFs, we really don't want these files indexed as they make for such poor landing pages (no navigation) and we certainly want to remove any competition or duplication between the page and the PDF so in this case, it makes for a quick, painless and suitable solution.

                Hope that helps!
                Marcus

                BobGW 1 Reply Last reply Reply Quote 2
                • BobGW
                  BobGW @BobGW last edited by

                  Thanks ThompsonPaul,

                  Say the pdf is located at

                  domain.com/pdfs/white-papers.pdf

                  and the article that I want to rank is at

                  domain.com/articles/article.html

                  do I simply add this to my htaccess file?:

                  Header add Link "<http: www.domain.com="" articles="" article.html="">; rel="canonical""</http:>

                  1 Reply Last reply Reply Quote 0
                  • ThompsonPaul
                    ThompsonPaul @BobGW last edited by

                    You can insert the canonical header link using your site's .htaccess file, Bob. I'm sure Hostgator provides access to the htaccess file through ftp (sometimes you have to turn on "show hidden files") or through the file manager built into your cPanel.

                    Check tip #2 in this recent SEOMoz blog article for specifics:
                    seomoz.org/blog/htaccess-file-snippets-for-seos

                    Just remember too - you will want to do the same kind of on-page optimization for the PDF as you do for regular pages.

                    • Give it a good, descriptive, keyword-appropriate, dash-separated file name. (essential for usability as well, since it will become the title of the icon when saved to someone's desktop)
                    • Fill out the metadata for the PDF, especially the Title and Description. In Acrobat it's under File -> Properties -> Description tab (to get the meta-description itself, you'll need to click on the Additional Metadata button)

                    I'd be tempted to build the links to the html page as much as possible as those will directly help ranking, unlike the PDF's inbound links which will have to pass their link juice through the canonical, assuming you're using it. Plus, the visitor will get a preview of the PDF's content and context from the rest of your site which which may increase trust and engender further engagement..

                    Your comment about links in the PDF got kind of muddled, but you'll definitely want to make certain there are good links and calls to action back to your website within the PDF - preferably on each page. Otherwise there's no clear "next step" for users reading the PDF back to a purchase on your site. Make sure to put Analytics tracking tags on these links so you can assess the value of traffic generated back from the PDF - otherwise the traffic will just appear as Direct in your Analytics.

                    Hope that all helps;

                    Paul

                    1 Reply Last reply Reply Quote 2
                    • BobGW
                      BobGW @BobGW last edited by

                      Can I just use htaccess?

                      See here: http://www.seomoz.org/blog/how-to-advanced-relcanonical-http-headers

                      We only have one pdf like this right now and we plan to have no more than five.

                      Say the pdf is located at

                      domain.com/pdfs/white-papers.pdf

                      and the article that I want to rank is at

                      domain.com/articles/article.pdf

                      do I simply add this to my htaccess file?:

                      Header add Link "<http: www.domain.com="" articles="" article.pdf="">; rel="canonical""</http:>

                      1 Reply Last reply Reply Quote 0
                      • BobGW
                        BobGW @BobGW last edited by

                        How do I know if I can do an HTTP header request? I'm using shared hosting through hostgator.

                        1 Reply Last reply Reply Quote 0
                        • DoRM
                          DoRM @BobGW last edited by

                          PDF seem to not rank as well as other normal webpages.  They still rank do not get me wrong, we have over 100 pdf pages that get traffic for us. The main version is really up to you, what do you want to show in the search results.  I think it would be easier to rank for a normal webpage though.  If you are doing a rel="canonical"  it will pass most of the link juice, not all but most.

                          1 Reply Last reply Reply Quote 0
                          • DoRM
                            DoRM @BobGW last edited by

                            PDF seem to not rank as well as other normal webpages.  They still rank do not get me wrong, we have over 100 pdf pages that get traffic for us. The main version is really up to you, what do you want to show in the search results.  I think it would be easier to rank for a normal webpage though.  If you are doing a rel="canonical"  it will pass most of the link juice, not all but most.

                            1 Reply Last reply Reply Quote 1
                            • BobGW
                              BobGW @DoRM last edited by

                              Thank you DoRM,

                              I assume that the PDF is what I want to be the main version since that is what I'll be marketing, but I could be wrong? What if I get backlinks to both pages, will both sets of backlinks count?

                              DoRM BobGW ThompsonPaul 6 Replies Last reply Reply Quote 0
                              • DoRM
                                DoRM last edited by

                                Indicate the canonical version of a URL by responding with the Link rel="canonical" HTTP header. Addingrel="canonical" to the head section of a page is useful for HTML content, but it can't be used for PDFs and other file types indexed by Google Web Search. In these cases you can indicate a canonical URL by responding with the Link rel="canonical" HTTP header, like this (note that to use this option, you'll need to be able to configure your server):

                                Link: <http: www.example.com="" downloads="" white-paper.pdf="">; rel="canonical"</http:> 
                                

                                Google currently supports these link header elements for Web Search only.

                                You can read more her http://support.google.com/webmasters/bin/answer.py?hl=en&answer=139394

                                BobGW 1 Reply Last reply Reply Quote 1
                                • 1 / 1
                                • First post
                                  Last post

                                Got a burning SEO question?

                                Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


                                Start my free trial


                                Browse Questions

                                Explore more categories

                                • Moz Tools

                                  Chat with the community about the Moz tools.

                                • SEO Tactics

                                  Discuss the SEO process with fellow marketers

                                • Community

                                  Discuss industry events, jobs, and news!

                                • Digital Marketing

                                  Chat about tactics outside of SEO

                                • Research & Trends

                                  Dive into research and trends in the search industry.

                                • Support

                                  Connect on product support and feature requests.

                                • See all categories

                                Related Questions

                                • ycnetpro101

                                  Duplicate content in Shopify - subsequent pages in collections

                                  Hello everyone! I hope an expert in this community can help me verify the canonical codes I'll add to our store is correct. Currently, in our Shopify store, the subsequent pages in the collections are not indexed by Google, however the canonical URL on these pages aren't pointing to the main collection page (page 1), e.g. The canonical URL of page 2, page 3 etc are used as canonical URLs instead of the first page of the collections. I have the canonical codes attached below, it would be much appreciated if an expert can urgently verify these codes are good to use and will solve the above issues? Thanks so much for your kind help in advance!! -----------------CODES BELOW--------------- <title><br /> {{ page_title }}{% if current_tags %} – tagged "{{ current_tags | join: ', ' }}"{% endif %}{% if current_page != 1 %} – Page {{ current_page }}{% endif %}{% unless page_title contains shop.name %} – {{ shop.name }}{% endunless %}<br /></title>
                                  {% if page_description %} {% endif %} {% if current_page != 1 %} {% else %} {% endif %}
                                  {% if template == 'collection' %}{% if collection %}
                                  {% if current_page == 1 %} {% endif %}
                                  {% if template == 'product' %}{% if product %} {% endif %}
                                  {% if template == 'collection' %}{% if collection %} {% endif %}

                                  Intermediate & Advanced SEO | | ycnetpro101
                                  0
                                • Kingalan1

                                  Reasonable Cost for Link Building Service

                                  We need about 5-10 high quality links to our website created every month. We need the link targets researched and outreach done to these sites. The sites most be legitimate and high quality; decent domain authority, real sites, not phony low quality sites. Sites that would show traffic in similarweb.com with decent metrics. We absolutely want to avoid any link building schemes that could get us penalized. I have been told that such a project would take a qualified SEO about 8-10 hours per months (more during the additional month of research, less afterward). As such, what is a reasonable cost for these 5-10 links per month? $300, $500, $700, more? I only want to work with a highly experienced SEO, native english speaker with extensive experience. What is fair? I don't want to overpay or to under pay. Thanks, Alan

                                  Intermediate & Advanced SEO | | Kingalan1
                                  0
                                • yacpro13

                                  Duplicate content on URL trailing slash

                                  Hello, Some time ago, we accidentally made changes to our site which modified the way urls in links are generated. At once, trailing slashes were added to many urls (only in links). Links that used to send to
                                  example.com/webpage.html Were now linking to
                                  example.com/webpage.html/ Urls in the xml sitemap remained unchanged (no trailing slash). We started noticing duplicate content (because our site renders the same page with or without the trailing shash). We corrected the problematic php url function so that now, all links on the site link to a url without trailing slash. However, Google had time to index these pages. Is implementing 301 redirects required in this case?

                                  Intermediate & Advanced SEO | | yacpro13
                                  1
                                • marcandre

                                  Woocommerce SEO & Duplicate content?

                                  Hi Moz fellows, I'm new to Woocommerce and couldn't find help on Google about certain SEO-related things. All my past projects were simple 5 pages websites + a blog, so I would just no-index categories, tags and archives to eliminate duplicate content errors. But with Woocommerce Product categories and tags, I've noticed that many e-Commerce websites with a high domain authority actually rank for certain keywords just by having their category/tags indexed. For example keyword 'hippie clothes' = etsy.com/category/hippie-clothes (fictional example) The problem is that if I have 100 products and 10 categories & tags on my site it creates THOUSANDS of duplicate content errors, but If I 'non index' categories and tags they will never rank well once my domain authority rises... Anyone has experience/comments about this? I use SEO by Yoast plugin. Your help is greatly appreciated! Thank you in advance. -Marc

                                  Intermediate & Advanced SEO | | marcandre
                                  1
                                • BobAnderson

                                  Tabs and duplicate content?

                                  We own this site http://www.discountstickerprinting.co.uk/ and just a little concerned as I right clicked open in new tab on the tab content section and it went to a new page For example if you right click on the price tab and click open in new tab you will end up with the url
                                  http://www.discountstickerprinting.co.uk/#tabThree Does this mean that our content is being duplicated onto another page? If so what should I do?

                                  Intermediate & Advanced SEO | | BobAnderson
                                  0
                                • AxialDev

                                  How do I geo-target continents & avoid duplicate content?

                                  Hi everyone, We have a website which will have content tailored for a few locations: USA: www.site.com
                                  Europe EN: www.site.com/eu
                                  Canada FR: www.site.com/fr-ca Link hreflang and  the GWT option are designed for countries. I expect a fair amount of duplicate content; the only differences will be in product selection and prices. What are my options to tell Google that it should serve www.site.com/eu in Europe instead of www.site.com? We are not targeting a particular country on that continent. Thanks!

                                  Intermediate & Advanced SEO | | AxialDev
                                  0
                                • sbaylor

                                  Artist Bios on Multiple Pages: Duplicate Content or not?

                                  I am currently working on an eComm site for a company that sells art prints. On each print's page, there is a bio about the artist followed by a couple of paragraphs about the print. My concern is that some artists have hundreds of prints on this site, and the bio is reprinted on every page,which makes sense from a usability standpoint, but I am concerned that it will trigger a duplicate content penalty from Google. Some people are trying to convince me that Google won't penalize for this content, since the intent is not to game the SERPs. However, I'm not confident that this isn't being penalized already, or that it won't be in the near future. Because it is just a section of text that is duplicated, but the rest of the text on each page is original, I can't use the rel=canonical tag. I've thought about putting each artist bio into a graphic, but that is a huge undertaking, and not the most elegant solution. Could I put the bio on a separate page with only the artist's info and then place that data on each print page using an <iframe>and then put a noindex,nofollow in the robots.txt file?</p> <p>Is there a better solution?  Is this effort even necessary?</p> <p>Thoughts?</p></iframe>

                                  Intermediate & Advanced SEO | | sbaylor
                                  0
                                • rball1

                                  Increasing Internal Links But Avoiding a Link Farm

                                  I'm looking to create a page about Widgets and all of the more specific names for Widgets we sell: ABC Brand Widgets, XYZ Brand Widgets, Big Widgets, Small Widgets, Green Widgets, Blue Widgets, etc. I'd like my Widget page to give a brief explanation about each kind of Widget with a link deeper into my site that gives more detail and allows you to purchase. The problem is I have a lot of Widgets and this could get messy: ABC Green Widgets, Small XYZ Widgets, many combinations. I can see my Widget page teetering on being a link farm if I start throwing in all of these combos. So where should I stop? How much do I do? I've read more than 100 links on a page being considered a link farm, is that a hardline number or a general guideline?

                                  Intermediate & Advanced SEO | | rball1
                                  0

                                Get started with Moz Pro!

                                Unlock the power of advanced SEO tools and data-driven insights.

                                Start my free trial
                                Products
                                • Moz Pro
                                • Moz Local
                                • Moz API
                                • Moz Data
                                • STAT
                                • Product Updates
                                Moz Solutions
                                • SMB Solutions
                                • Agency Solutions
                                • Enterprise Solutions
                                Free SEO Tools
                                • Domain Authority Checker
                                • Link Explorer
                                • Keyword Explorer
                                • Competitive Research
                                • Brand Authority Checker
                                • Local Citation Checker
                                • MozBar Extension
                                • MozCast
                                Resources
                                • Blog
                                • SEO Learning Center
                                • Help Hub
                                • Beginner's Guide to SEO
                                • How-to Guides
                                • Moz Academy
                                • API Docs
                                About Moz
                                • About
                                • Team
                                • Careers
                                • Contact
                                Why Moz
                                • Case Studies
                                • Testimonials
                                Get Involved
                                • Become an Affiliate
                                • MozCon
                                • Webinars
                                • Practical Marketer Series
                                • MozPod
                                Connect with us

                                Contact the Help team

                                Join our newsletter
                                Moz logo
                                © 2021 - 2025 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                                • Accessibility
                                • Terms of Use
                                • Privacy

                                Looks like your connection to Moz was lost, please wait while we try to reconnect.