Index pdf files but redirecto to site
-
Hi,
One of our clients has tons of PDFs (manuals, etc.) and frequently gets good rankings for the direct PDF link. While we're happy about the PDFs attracting users' attention, we'd like to redirect them to the site where the original PDF link is published and avoid that people open the pdf directly.
In short, we'd like to index the PDFs, but show to users the pdf link within a site - how should we proceed to do that?
Thanks,
GM
-
Thanks for the follow-up ... if it weren't for phrases like
- The page displayed to all users who visit from Google must be identical to the content that is shown to Googlebot.
I'd be quite comfortable with that ... in the meantime, however, I might try some pdf2html conversion tools to see if there is a viable way to present PDF-information on a HTML page and block the PDF link for robots.
Regards,
Gert
-
Hi Gret,
After further research, it might not be considered as cloacking that much as the Google First Click Free for Web Search system works the same way and check the HTTP referer.
For more details, read the official Google Webmaster Central blog post about it here :
http://googlewebmastercentral.blogspot.com/2008/10/first-click-free-for-web-search.htmlBest regards,
Guillaume Voyer. -
Thanks for your detailed reply, Guillaume,
I guess the possible "cloaking troubles" with this strategy are probably too risky for our project. However, I like the "click here" idea, we'll check if we can automate that somehow to drag users reading the PDFs back to our site.
-
Hi Gert,
Technically, this is not possible unless you use cloaking to display the PDF to the search engines and redirect the users to a different page.
What you could do to avoid cloacking is to include a banner at the top of your PDF with something like "Click here to see all our related PDFs" that would link to your website, this way users might be interested in going to your website.
Otherwise, you could detect the referer with htaccess and redirect the user to the user if he is coming from google, but this might be considered as cloaking. Here's an example :
RewriteEngine On
RewriteCond %{HTTP_REFERER} (.)google.(.)
RewriteRule ^pdf/(.*).pdf /pdf-list [R=302]If you are running a apache server and you put this in your .htaccess file, the first line activate mod_rewrite, the second line check if the referer matches anythinggoogle.anything and the third line redirect all .pdf files in the pdf folder to the /pdf-list page if the referer matches.
Best regards,
Guillaume Voyer.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Using different sections from all over your site to compile a blog post, bad idea or ok to do?
I have a large site that sells various products, I have been on a kick creating new content relating to the many aspects of upkeep with these products after purchase, I wanted to create a blog post combining all the info for the group of products, but will be reusing some of the FAQs and even tips, since I'm more or less relocating the info. Since this blog post is using many different sources on our site, using a rel=canonical isn't possible. Is there anything I should watch out for, Will rewording / phrasing here and there be enough or should I steer clear of this as a whole?
Content Development | | Deacyde0 -
One story stands out for not getting indexed?
We have all our stories published today ( 20-Jun-2013 ) got indexed by google except this ( http://coed.com/2013/06/20/heres-a-video-of-kate-upton-topless-on-a-horse/ ). Do anyone out there have any clue about that? Thanks in advance
Content Development | | COEDMediaGroup0 -
Should my client copy and paste his blog posts onto other professional sites?
I have a client who is an Attorney and participates in a couple of different Q and A legal sites. One of these legal sites is recommending to my client that he copy and paste his blog posts onto his profile page on the Q and A site. I understand my client wanting to copy and paste his blog posts onto these sites, as it will not only get his content in front of a larger audience, but also lend credibility. However, I've always heard that that could cause some duplicate content problems. Will my client's blog likely get penalized by google for copying and pasting his blog posts onto other relevant sites?
Content Development | | ScottMcPherson0 -
Are there quality blog sites allowing guest blogging ?
hi can someone guide me the right direction for guest blogging ? i m looking for quality blog sites that allow guest blogging especially for ecommerce related sites. Thank you Nick
Content Development | | orion680 -
Any article sites that actually still rank welll?
Hi All! After the havoc of Panda and Penguin - are there any article sites that still rank decently for the articles within? I'm not talking about using articles to get backlinks - I want to attract visitors to the articles themselves because of the message within the article. Yes, I know that for lots of reasons my own blog would be better. In addition to that, if you want to spread the message - which sites are recommended? Thanks! Aviva
Content Development | | debi_zyx0 -
Anyone know of any other guest blogging sites like myblogguest?
Hi, I am a member over at myblogguest, but im wondering is this the only service online for guest blogging? Cheers
Content Development | | activitysuper0 -
How does google react to duplicate shops on ecommerce sites
Surely shopping cart sites are going to have a lot of duplicate content? Does google recognise this? Is there anything I can do let google know?
Content Development | | borderbound0 -
Google still caching old site
Hi all, We just acquired a new domain that was being squatted on by a reseller for a very long time and on the 5th June migrated our site over to it, replacing their advertising holding page. The domain is http://primate.co.uk It's been a week now though and Google hasn't seemed to have updated it's cache. Doing a search for 'primate.co.uk' in Google lists the site but with the old holding page description. Web master tools doesn't report any errors or issues with the site. Does anyone know how we can get Google to index the domain and update it's cache? Cheers, Gordon
Content Development | | Primate0