I have a page where you can download a PDF of the material - should I exclude the PDF from the search engines?
-
In my niche, there is a controversial research article that is very popular. I am writing a rebuttal to this article and giving another point of view.
My article has the potential to be really good link bait for my site.
The original article is often printed out to be shown to professionals in my niche. My hope is that people will do the same with mine. So, I plan to have a PDF version of my article available on my page. The article that is visible on my site (i.e. non PDF) will be a graphic rich article that is easy for the reader to go through. I plan to have the PDF have all of the same text, but it won't have as many graphics - it will look more like a scientific research article.
So, should I exclude the pdf from search engines so that it isn't duplicate content? Or does that even matter seeing as it is a duplicate of my own content? I want people to link to the main article, not the pdf.
Any tips would be greatly appreciated!
-
Thank you! This is exactly the kind of information I needed!
I was thinking contacting webmasters who published the original article to tell them about mine. But now, perhaps what I will do is not just contact them but attach a copy of the pdf for them to use.
-
Do not exclude.
People will link to it.
PDF documents can rank in the SERPs if you complete the properties portion of the document. The title in the properties will serve as a title tag for Google SERPs.
PDF documents can accumulate pagerank and pass that pagerank though any links in the PDF document. (Be sure to place a few links to your website in the PDF. Because....pdf, .ppt, .xls and many other file times display in my google webmaster tools backlinks).
Encourage other webmasters to download your pdf and post it on their server and link to it from their website. That will give you backlinks from their domain. You can get a kickass number of backlinks from this. (I usually don't advocate giving content away but I have seen success from "whitepapers" like this. You might consider offering them a "branded" copy of the document to post on their own site - you would add their branding for them.)
Its a good idea to lock the .pdf document so that others can't change it. They can always make their own document from your content but don't make it too easy for them.
I have used .pdfs and have not seen a duplicate content problem from them. However, the content of the pdf is not exactly the same as what is on an .html page of my site. It sounds like you are planning to have richer content on your site than in the .pdf so I would not worry about dupe content. Just be sure that there is a significant difference.
-
I don't think there's a problem with hosting the PDF. Just make sure you've got strong branding in the PDF and links back to your online article. People will most likely pass your PDF around to others and you want them to come visit the source --> YOU.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
My keywords have low search volume - is it still worth starting a blog?
I'm thinking of starting a new blog, but when I did my keyword research I found that my keywords all have low search volume (under 100 searches per month, with the occasional keyword having 480 searches a month). Is this a deal breaker? Any recommendations would be great - thanks everyone!
Content Development | | Trevorneo1 -
Lonely lonely pages
On my site I have tons of blog posts that have never been visited. (Falls on floor in tears). I of course know why. The content is mediocre in most cases and when it was average to good I didn't market it more. My question is should I go and just scrub the non visited pages or spend the time making these pages better and work on making the content above average? My competition above me do not have as many pages and their ranking is purely (I have researched this to death) from links from sites they have developed - with good authority.
Content Development | | GrangeWeb1 -
Authorship showing in SERPs for non-blog pages
Hi, A few months ago we set up authorship for on our blog articles for multiple authors, which has helped driving extra traffic to our blog posts. Today, I did a search for one of most important search terms and one of our non-blog pages is showing in the first page of the results with one of our authors headshot next to it. Technically we have not not set it up to do this, the page is on a different CMS to our blog (which is wordpress). I'm not complaining because I think this is a positive outcome, but does anyone have an idea why it has done this? I was under the impression that only blog article pages could have authorship set up. Thanks, Stu
Content Development | | Stuart260 -
How to make a bad posting drop down the search engines
we have had a competitor who has been stealing a lot of content from us and now they have used seo tactics to write a bad article about one of our sites that they have been taking content from, and what we would like to do is to use seo techniques to drop their article down the rankings. We are consulting a solicitor to take action against them as we are fed up of the content being stolen and the other techniques they are using, can anyone give me ideas on the best way to drop this post down the rankings. I was thinking about articles and press releases and blog posts to get higher up in the rankings and do a lot of these to drop their article down the rankings. Our solicitor has already taken action against them in the past and that cost us a fortune and it looks like we are going to have to spend a fortune again, so any help on using seo techniques to drop their article down the rankings would be great.
Content Development | | ClaireH-1848860 -
Do comments count as page content, as it relates to the length of content on a page?
I understand Google likes long content, and I make all my pages at least 500 words of unique and good content. But there is something I am curious about. Do they also count comments as content? The reason I'm asking is that I'm considering creating a Q&A site, where I'd control the questions, making sure they would be good ones and not duplicates, and then have people add answers. In reality, I'd be populating most the questions as first, and most definitely supplying a very good and long answer to questions. The answers would likely be in the form of comments, with highest ranked answers at top. So, I'm wondering what Google would think of a 100 word question, with a several hundred word answer in a comment, often followed by some other comments after that. Would it be a 100 word page or a 500+ word page?
Content Development | | bizzer0 -
Content being copied from our product page hurting our site overall?
On our product pages, he have short descriptions and some bulleted lists. Resellers of our products, and many other sites who are not resellers are copying this content, often verbatim. While I'm not as concerned for the product pages themselves as we're hoping the category pages will rank, does this duplication of our content hurt our site overall? FWIW, our brand name is in our domain and often also shown on these sites that copy the content.
Content Development | | minutiae0 -
Block Low Quality Pages?
What are your thoughts on blocking (in robots.txt) and/or noindexing low-quality pages to defend against Panda, assuming you can't remove, redirect, or add quality content to it? Also, assume there are no external links pointing to these low-quality pages, no social shares, and zero incoming organic traffic. Has anyone had experience with this as a solution to Panda?
Content Development | | poolguy0 -
Should I Have No Index, No Follow On Blog Category & Tag Pages?
At some point in the past I read or was told that No Index, No Follow tags on category and tag pages were a good thing on a standard WordPress blog in order to prevent duplicate content issues. Is this still true or was it ever true?
Content Development | | eTundra0