I have a page where you can download a PDF of the material - should I exclude the PDF from the search engines?
-
In my niche, there is a controversial research article that is very popular. I am writing a rebuttal to this article and giving another point of view.
My article has the potential to be really good link bait for my site.
The original article is often printed out to be shown to professionals in my niche. My hope is that people will do the same with mine. So, I plan to have a PDF version of my article available on my page. The article that is visible on my site (i.e. non PDF) will be a graphic rich article that is easy for the reader to go through. I plan to have the PDF have all of the same text, but it won't have as many graphics - it will look more like a scientific research article.
So, should I exclude the pdf from search engines so that it isn't duplicate content? Or does that even matter seeing as it is a duplicate of my own content? I want people to link to the main article, not the pdf.
Any tips would be greatly appreciated!
-
Thank you! This is exactly the kind of information I needed!
I was thinking contacting webmasters who published the original article to tell them about mine. But now, perhaps what I will do is not just contact them but attach a copy of the pdf for them to use.
-
Do not exclude.
People will link to it.
PDF documents can rank in the SERPs if you complete the properties portion of the document. The title in the properties will serve as a title tag for Google SERPs.
PDF documents can accumulate pagerank and pass that pagerank though any links in the PDF document. (Be sure to place a few links to your website in the PDF. Because....pdf, .ppt, .xls and many other file times display in my google webmaster tools backlinks).
Encourage other webmasters to download your pdf and post it on their server and link to it from their website. That will give you backlinks from their domain. You can get a kickass number of backlinks from this. (I usually don't advocate giving content away but I have seen success from "whitepapers" like this. You might consider offering them a "branded" copy of the document to post on their own site - you would add their branding for them.)
Its a good idea to lock the .pdf document so that others can't change it. They can always make their own document from your content but don't make it too easy for them.
I have used .pdfs and have not seen a duplicate content problem from them. However, the content of the pdf is not exactly the same as what is on an .html page of my site. It sounds like you are planning to have richer content on your site than in the .pdf so I would not worry about dupe content. Just be sure that there is a significant difference.
-
I don't think there's a problem with hosting the PDF. Just make sure you've got strong branding in the PDF and links back to your online article. People will most likely pass your PDF around to others and you want them to come visit the source --> YOU.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Do you know any website you can get in touch with bloggers?
I would like to get in touch with some good bloggers who would be happy to write a blog about my company services. Is there any website where is the list of blogs so you can get in touch with them or buy blog post on their blogs? Thanks Lukas
Content Development | | Lukas-ST0 -
I work on a uk decorating website with five of our own bloggers all of which reside on the home page of the website on their own separete blogging urls as sub domains - is this a good idea or would google not like this from an seo point of view?
Should blogs that are part of an overall content site be on separate sites and link in or is it ok to promote them as content on the home page of the site and take users off to their own url to view the site. Is this good practise for seo?
Content Development | | Pday0 -
Page Content?
So I have review pages for websites on my site, each website has a review around 400-500 words. Recently I had my writers write 2 additional articles on each site but about something they have there. My thinking was interlinking them allowing them to rank individually etc. However now after looking around etc.. I see that content that is upwards of 1000 words or more might be more powerful and the way this is all written etc.. I could easily put it all on one page.... So my question is do I go with 3 pages or 1 page. I can see strength in both
Content Development | | dueces0 -
Can you help me with my options on publishing others' news releases on my site?
I wish to add a "News" section to a highly-read, highly ranked blog I have. The News pieces will not be in the same flow as my regular posts. I'm contemplating what the best way to do this is, and would like some advice, please. I see these options: Option 1. Pay textbroker type people to rewrite news releases and post them into the news flow. Pro: indexable content. Con: expense. Option 2: Have a Submit News form on the site for vendors to submit their news stories. I would have to ask them to rewrite their stories to avoid dup content. Pros: Easy for me, no cost. Cons: Will still get dup content I bet, a lot of companies won't take the time to do it, and I will have no control over quality. (I really doubt this option will work). Option 3: Post news releases from companies in their raw format, and mark them as no index (even if I don't noindex, they won't move up the SERPs anyway, so why not just noindex them). Pros: very easy, all the news I want. Cons: not creating any indexable content. Bonus question: If I do Option #3, and I place an adsense ad on the page, will it work the same as if it was an indexed, non-duplicate content page? Your thoughts?
Content Development | | bizzer0 -
Does a Google Map on the contact page help with SEO?
In regards to ranking organically for local search results (not google places), I'm wondering if there is any benefit to having a Google Map on my Contact page with our location pinned? If so, how important do you think it is?
Content Development | | pharcydeabc0 -
How to handle product pages with similar information
We have thousands of product pages with similar information but differentiating variables such as length/width. Example: http://www.savvyboater.com/store/p/2100-Cover-for-V-Hull-Fishing-Boat-with-Side-Console-O-B-14-X-74-.aspx http://www.savvyboater.com/store/p/2101-Cover-for-V-Hull-Fishing-Boat-with-Side-Console-O-B-15-X-76-.aspx http://www.savvyboater.com/store/p/2102-Cover-for-V-Hull-Fishing-Boat-with-Side-Console-O-B-16-X-92-.aspx http://www.savvyboater.com/store/p/2103-Cover-for-V-Hull-Fishing-Boat-with-Side-Console-O-B-17-X-92-.aspx We built individual products instead of grouped products because we recommend specific part numbers for specific make, model, year boats through our finder tool. These pages have recently started showing up in SEOmoz as duplicate content and we are looking for solutions to solve it. We have considered creating a "parent page" that lists all sizes and then using a rel canonical on each individual page to tell google that the parent page is the preferred page. Any thoughts or other ideas on this?
Content Development | | ironpac0 -
Posts vs Pages and Rankings Differ Greatly
I use wordpress for most of my sites and generally have a post 'news' section. What I've noticed is that just about every time a post will always rank much higher and much faster than a 'page'. As long as I don't let it get buried in the news archives it continues to rank well, better than if I were to create a 'page'. Is there any sort of reason this might occur? I'd like to be able to just create 'pages' but at this point in time it makes no sense.
Content Development | | GYMSN0