Index pdf files but redirecto to site
-
Hi,
One of our clients has tons of PDFs (manuals, etc.) and frequently gets good rankings for the direct PDF link. While we're happy about the PDFs attracting users' attention, we'd like to redirect them to the site where the original PDF link is published and avoid that people open the pdf directly.
In short, we'd like to index the PDFs, but show to users the pdf link within a site - how should we proceed to do that?
Thanks,
GM
-
Thanks for the follow-up ... if it weren't for phrases like
- The page displayed to all users who visit from Google must be identical to the content that is shown to Googlebot.
I'd be quite comfortable with that ... in the meantime, however, I might try some pdf2html conversion tools to see if there is a viable way to present PDF-information on a HTML page and block the PDF link for robots.
Regards,
Gert
-
Hi Gret,
After further research, it might not be considered as cloacking that much as the Google First Click Free for Web Search system works the same way and check the HTTP referer.
For more details, read the official Google Webmaster Central blog post about it here :
http://googlewebmastercentral.blogspot.com/2008/10/first-click-free-for-web-search.htmlBest regards,
Guillaume Voyer. -
Thanks for your detailed reply, Guillaume,
I guess the possible "cloaking troubles" with this strategy are probably too risky for our project. However, I like the "click here" idea, we'll check if we can automate that somehow to drag users reading the PDFs back to our site.
-
Hi Gert,
Technically, this is not possible unless you use cloaking to display the PDF to the search engines and redirect the users to a different page.
What you could do to avoid cloacking is to include a banner at the top of your PDF with something like "Click here to see all our related PDFs" that would link to your website, this way users might be interested in going to your website.
Otherwise, you could detect the referer with htaccess and redirect the user to the user if he is coming from google, but this might be considered as cloaking. Here's an example :
RewriteEngine On
RewriteCond %{HTTP_REFERER} (.)google.(.)
RewriteRule ^pdf/(.*).pdf /pdf-list [R=302]If you are running a apache server and you put this in your .htaccess file, the first line activate mod_rewrite, the second line check if the referer matches anythinggoogle.anything and the third line redirect all .pdf files in the pdf folder to the /pdf-list page if the referer matches.
Best regards,
Guillaume Voyer.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Website is not getting indexed
Hi,
Content Development | | Aman0022
Hope you all are doing great!
I have created a dog blog a few weeks back which talks about all things about dogs (http://pawspulse.com/). I am publishing couple of articles everyday which are more than 5k words long with proper keyword research but still Google is not indexing my content. My content is systematically categorized in proper categories related to dog guides, nutrition, accessories, dog breeds etc.
Can anyone help me how to get the website index faster fully. Any help will be much appreciated. Thanks0 -
Google won't index my website because "certain conditions" weren't met
I found the answer on this -- interestingly, I had changed registrars and they didn't pull over the DNS information correctly. This caused the above issues. Once I identified this, I updated the DNS correctly -- at registrar and server -- and things worked fine.
Content Development | | newbyguy0 -
Ecommerce site content upgrade timescale.
I have been upgrading my sites content and structure and I have been wondering how long I should wait for a traffic increase before I should think it has been a failure and try a new plan of attack?
Content Development | | mark_baird0 -
Duplicate content - 6 websites, 1 IP. Is the #1 site knocked down too?
Yes I know, running multiple websites on 1 IP isn't smart. 6 Websites with duplicate content on 1 IP is even worse. It's a technical issue we can't solve quickly. Thing is, our #1 website, which has the highest DA and PR, was the first website with all this content. All other websites we're running were launched a few months, and some a few years, later. All content was copied from the #1 website. I'd say the other websites would get knocked down by Google, because they duplicated the content. Google should see that our #1 website was the first that uploaded this content. Therefore our #1 website should rank normally. Questions is: What does Google think of duplicate content when all websites are on 1 IP? Is, or will our #1 website get punished as well?
Content Development | | Webprint0 -
Press Releases and Duplicate Content on Event Related Site
I have a site that lists events. I ask those submitting events to submit original content if possible, but frequently they submit press releases which are already published elsewhere. I rewrite some of the press releases, but do not have time to rewrite every press release that comes my way. I want my users to get a comprehensive list of events, but I don't want get a penalty for duplicate content. What is the best solution?
Content Development | | andywozhere0 -
Google Site Search
Hi, I was just wondering if anyone had used Google Site Search before and what they thought of it? http://www.google.com/sitesearch/ It seems quite expensive for just returning your own pages but would be interested to find out more. Thanks
Content Development | | ASOS0 -
Blogs: Hosted vs. Off-Site
What is the latest opinion on whether to have a blog hosting on your primary domain or on an outside domain (either Wordpress or just another URL). In the past we have done the former and it's been a great source of fresh content and organic traffic. Wonder if there is good argument now for not having it on the domain. With this new site we're working on it logistically would be more challenging given their tech resources, plus they already have thousands of pages.
Content Development | | BMGSEO0 -
How important is linking out to relevant, authoritative sites?
As I write blog articles for my site I often come across a situation where I'm quoting something from another site, or using a piece of data from that other site to make a point. I know it's nice and courteous to link to the source when I do this but from a pure SEO point of view, does it matter? Is there any benefit to linking from my site to other sites that are related and authoritative on the subject I'm discussing? I know I'll bleed off a little link juice to that external site that would otherwise go towards my internal links on the same page, but are there other benefits to linking out to known good sites? Is that any kind of signal to Google that I'm playing in a good neighborhood?
Content Development | | scanlin0