Index pdf files but redirecto to site
-
Hi,
One of our clients has tons of PDFs (manuals, etc.) and frequently gets good rankings for the direct PDF link. While we're happy about the PDFs attracting users' attention, we'd like to redirect them to the site where the original PDF link is published and avoid that people open the pdf directly.
In short, we'd like to index the PDFs, but show to users the pdf link within a site - how should we proceed to do that?
Thanks,
GM
-
Thanks for the follow-up ... if it weren't for phrases like
- The page displayed to all users who visit from Google must be identical to the content that is shown to Googlebot.
I'd be quite comfortable with that ... in the meantime, however, I might try some pdf2html conversion tools to see if there is a viable way to present PDF-information on a HTML page and block the PDF link for robots.
Regards,
Gert
-
Hi Gret,
After further research, it might not be considered as cloacking that much as the Google First Click Free for Web Search system works the same way and check the HTTP referer.
For more details, read the official Google Webmaster Central blog post about it here :
http://googlewebmastercentral.blogspot.com/2008/10/first-click-free-for-web-search.htmlBest regards,
Guillaume Voyer. -
Thanks for your detailed reply, Guillaume,
I guess the possible "cloaking troubles" with this strategy are probably too risky for our project. However, I like the "click here" idea, we'll check if we can automate that somehow to drag users reading the PDFs back to our site.
-
Hi Gert,
Technically, this is not possible unless you use cloaking to display the PDF to the search engines and redirect the users to a different page.
What you could do to avoid cloacking is to include a banner at the top of your PDF with something like "Click here to see all our related PDFs" that would link to your website, this way users might be interested in going to your website.
Otherwise, you could detect the referer with htaccess and redirect the user to the user if he is coming from google, but this might be considered as cloaking. Here's an example :
RewriteEngine On
RewriteCond %{HTTP_REFERER} (.)google.(.)
RewriteRule ^pdf/(.*).pdf /pdf-list [R=302]If you are running a apache server and you put this in your .htaccess file, the first line activate mod_rewrite, the second line check if the referer matches anythinggoogle.anything and the third line redirect all .pdf files in the pdf folder to the /pdf-list page if the referer matches.
Best regards,
Guillaume Voyer.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Website is not getting indexed
Hi,
Content Development | | Aman0022
Hope you all are doing great!
I have created a dog blog a few weeks back which talks about all things about dogs (http://pawspulse.com/). I am publishing couple of articles everyday which are more than 5k words long with proper keyword research but still Google is not indexing my content. My content is systematically categorized in proper categories related to dog guides, nutrition, accessories, dog breeds etc.
Can anyone help me how to get the website index faster fully. Any help will be much appreciated. Thanks0 -
Sitemap - 200 out of 2100 pages indexed
I submitted the .xml sitemap in Google Webmaster Tools and only 200 out of 2100 pages were indexed.
Content Development | | Madlena
Why is that and what can I do ?0 -
In my website all the pages are not indexed by google..what to do for the same
In my website http://www.dubins.ae, all the pages are not indexed by google. How to make sure that all the pages are indexed by google?
Content Development | | Muna0 -
Smaller Index
Hi guys, We are a price comparison website with thousands of webpages. Most of them are product webpages with not so good quality content. Only price information and product image, no product details nor costumers reviews. We are planing to focus on less product categories by adding reviews, details, better images etc... and I would like to know if I should maintain the other "not-so-good" products in other categories or if I should remove it from index to leverage domain average content quality. Our index size is 200k pages and we are planning to focus on 10k pages max. Thanks for your help.
Content Development | | Kuantokusta0 -
This article on making money with your site has to be out of date, doesnt it
Came across this article on making money with your site but surely it must be out of date as a number of the recommendations breaks google guidelines. Here is the link to read the full article but below is a breakdown http://bloggerspassion.com/make-money-online-websites/ 1. Paid Blogging with Sponsored Reviews Publishers are able to make money selling paid reviews on their blogs and advertisers are able to create buzz, traffic and backlinks with SponsoredReviews website. My total earnings with this paid review website are $1631.95. You need to have 3 months old blog with 10 quality posts published to get accepted. 2. Make Money Selling Text Link Ads Bloggers are able to make good earnings selling text links and advertisers are able to get lots of high quality backlinks and traffic for their websites. This helps them earn a lot of money from the services and products they are selling on their websites. I’m so far I have been able to earn $7,711 with this website. You need PR 3 plus blogs to get accepted. would be interested in your thoughts on this
Content Development | | ClaireH-1848860 -
Can you help me with my options on publishing others' news releases on my site?
I wish to add a "News" section to a highly-read, highly ranked blog I have. The News pieces will not be in the same flow as my regular posts. I'm contemplating what the best way to do this is, and would like some advice, please. I see these options: Option 1. Pay textbroker type people to rewrite news releases and post them into the news flow. Pro: indexable content. Con: expense. Option 2: Have a Submit News form on the site for vendors to submit their news stories. I would have to ask them to rewrite their stories to avoid dup content. Pros: Easy for me, no cost. Cons: Will still get dup content I bet, a lot of companies won't take the time to do it, and I will have no control over quality. (I really doubt this option will work). Option 3: Post news releases from companies in their raw format, and mark them as no index (even if I don't noindex, they won't move up the SERPs anyway, so why not just noindex them). Pros: very easy, all the news I want. Cons: not creating any indexable content. Bonus question: If I do Option #3, and I place an adsense ad on the page, will it work the same as if it was an indexed, non-duplicate content page? Your thoughts?
Content Development | | bizzer0 -
Press Releases and Duplicate Content on Event Related Site
I have a site that lists events. I ask those submitting events to submit original content if possible, but frequently they submit press releases which are already published elsewhere. I rewrite some of the press releases, but do not have time to rewrite every press release that comes my way. I want my users to get a comprehensive list of events, but I don't want get a penalty for duplicate content. What is the best solution?
Content Development | | andywozhere0 -
Our blog is indexed by "google web" but does not show up in "google blogs". Why not and how can I fix this?
We have a pretty simple blog http://www.aviawest.com/blog I've noticed our articles arn't showing up in Google blogs on "web", we've submitted to http://blogsearch.google.com/ping a month ago. Anyone have some insight here?
Content Development | | Aviawest0