Does google scrape links from PDF files? do these links pass link juice?
-
Title is pretty much the whole question.
-
I made a test and it seems that yes, the links from pdf count for ranking.
The test is on my Romanian blog http://seogan.ro/link-building-pdf-urile-o-sursa-de-linkuri-test
You can find an English translation here: http://www.seogan.com/pdf-link-building
Hope it helps.
-
Yes it does according to Google tech spec http://code.google.com/apis/searchappliance/documentation/50/admin_crawl/Introduction.html
which specifically states if follows html links in pdf 'It follows HTML links in PDF files, Word documents, and Shockwave documents'. Google's own api docs carry more weight than a comment in a forum_._ If they are licencing this out as an application it would suggest that the same technology is available in the main engine as does Dunamis's comment about a listing in a pdf document being found in search results.
You can test for youself by publishing a pdf with a link to a info page that does not show up in any other links. Include the pdf in your sitemap but not the test page and check if it shows in googles index site:yoursite.com the next time it crawls.
This also gives some insight in an interview with Matt Cutts - http://www.stonetemple.com/articles/interview-matt-cutts-012510.shtml
Eric Enge: What about PDF files?
Matt Cutts: We absolutely do process PDF files. I am not going to talk about whether links in PDF files pass PageRank. But, a good way to think about PDFs is that they are kind of like Flash in that they aren't a file format that's inherent and native to the web, but they can be very useful. In the same way that we try to find useful content within a Flash file, we try to find the useful content within a PDF file. At the same time, users don't always like being sent to a PDF. If you can make your content in a Web-Native format, such as pure HTML, that's often a little more useful to users than just a pure PDF file.
-
This person seems to think no: http://www.google.fr/support/forum/p/Webmasters/thread?tid=14c5fe970fe84361&hl=en
but i'm not sure how much i can trust a random comment from a random source. any evidence for either argument?
EDIT: And this person seems to think they do pass link juice: http://www.whydowork.com/blog/link-building/274/
Could a mod remove the marked as answered? i don't think i am able to remove it, and the question isn't really answered.
-
yes, but do they crawl the links they find in these documents, or do they just index their contents.
-
Hmmm although i thought you had answered my question, i actually feel that you have not... Yes the links you provided state that google scrapes pdfs and even OCRs pdfs to get a better idea what is in them, but i don't see anywhere that they mention crawling the urls they find in these pdf documents.
-
Google definitely does index the contents of pdf files. I found this out the hard way as I had a real estate pdf on my site that I wanted to have listed in the index, but I didn't know that the contents would be crawled. The pdf contained some listings that I was not legally allowed to advertise on my site. (It was legal for me to give someone a report with the listings in it though).
When another realtor was searching for their own listing, my pdf came up. I got in trouble. I'm ok now though.
-
Have a look at this article http://searchenginewatch.com/article/2067225/Google-Does-PDF-Other-Changes it explains some of the doc library search for pdf files and Google's statement here http://googleblog.blogspot.com/2008/10/picture-of-thousand-words.html.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Are these links helping
Hi, We have had a few new links for a client and we are hoping that we can get some feedback on whether these have any link juice. We dont want to waste our time with links that dont benefit the website. Thanks http://thebestmealplancateringbiz.strikingly.com/blog/diet-meal-plan-delivery-services http://mealplanguide.home.blog/2018/11/02/importance-of-diet-meal-plans/ https://besthealthymeals.tumblr.com/post/179669447854/benefits-of-diet-meal-planning https://lauraberbenaj2.wixsite.com/mysite/blog/healthy-diet-meals-delivery
Link Building | | Caffeine_Marketing0 -
Spam links - what would you do?
Hello, A few months back our website was hacked which we noticed quickly and got fixed. However, there still is a lot of dodgy backlinks, linking to the spam pages. They come up in Webmaster Tools, Moz and SEM rush. According to the Moz Spam tool, a few of them have been given 7 flags which Moz suggests gives a 30% chance of being penalised by Google. The website has great rankings (position 1-3 in most targeted keywords) so I am scared of doing something that will harm the rankings, however I also aware that Google may do an update which could take these spam links into account. I have no experience of using the disavow tool but from reading up about it, it should only be used as a last resort. So my question is - what would you do?
Link Building | | N2Digital0 -
Getting a sitewide links from one domain as oppsed to few links from certain pages, a good, bad or horrible idea?
Hello, I have a website that is performing really well in Google for country specific Real Estate Website. I have had this site for just about 2 years and I have 4600+ indexed pages, and about 50 of my main keywords appear in top 10 results on Google. I have now started a second site with a brand new domain, its only about 2 months old. This one is a Travel website and again its country specific (same country that real estate website is performing well in) and since its still building up the content and has barely a couple of backlinks, I am thinking of adding a linkback to it from my Real Estate Site. Is it a good idea to add a site wide link back from header section of the real estate website to my travel website? Both sites has same country and most of the keywords in both relate to this country. Or will I be better off adding a few text links from few pages with best Page Authority? Or - do you suggest that I do not link the two at all? Both sites are hosted on different servers, with different IP's and one is on windows server created in asp.net while another is a magento site. Any advidce / insight will be much appreciated and thank you in advance.
Link Building | | waituk_sanjeev0 -
Can you get "link juice" from an outside company's/organization's intranet?
Will you get the same "link juice benefits" from a University, Government Org, or public company's intranet that are linked to your webpages – vs the benefits that you would get from links from their public webpages? Assume that the external webpages have very high Domain Authority, MozRank/Trust etc.
Link Building | | MLR0 -
Changing links
Hi guys i wanted you views on changing the anchor text of links. I have quality links coming in but with year terms such as 2012 in there, if i want to change them all to 2013 for example would it be badly seen by Google? I cant say i feel comfortable about doing it but they are my links and are related to our products. Any advice much appreciated.
Link Building | | pauledwards0 -
Does the trailing slash really matter in terms of passing link juice?
Does the trailing slash matter? Say my website resolves to: link to: http://www.domain.com/ So if a user went to link to: http://www.domain.com it would 301 them to link to: http://www.domain.com Then does a link to: link to: http://www.domain.com**/** pass the same amount of link juice as a link to: http://www.domain.com ?
Link Building | | adriandg0 -
Link exchanges
I have quite a few legal clients and 90% of my clients top competitors are doing 1:1 link exchanges and they have been doing it for years. Other industries Dont seem as prevalent. I am just boggled every time someone says link exchanges don't work, will get penalized, or value is not passed. Its been working for my competitors....Legal SEO's seem to focus heavily on link exchanges. I have yet to do link exchanges, but am about to get started with a resourceful directory of local businesses on my clients site. Some will be linking back, but not all. Fear of penalization is lurking in my mind. Does anyone have any real data on this? Does anyone have an example of doing this properly and an example of how not to do it? I appreciate any feedback you can give. Thank you,
Link Building | | waqid0 -
New PDF Guide. On My site or linked by blogs?
Hi experts, I created a new worth guide about food. Its purpose it to improve backlink by adding website's URL on PDF's footer (spider crawls that one, is it true?). I want send my guide to trusted blogs so users can download new pdf guide on the web. Is it the best way to gain popularity and let my backlinks grow? I've seen that seomoz published is guide such as html pages.
Link Building | | Greenman
Could it be the worth way for me? Thanks a bunch in advance! G.0