Google indexing PDF's
-
Hello,
We work heavily on E-commerce SEO and recently Google has started to index PDF pages (Datasheets) added to the product pages instead of the actual product pages. Has anyone else noticed this at all?
Seems to have got worse over the last month or so.
Thanks
-
Did you know that some shopping cart buttons can be embedded in pdf documents?
You can also place links in your pdf documents to direct traffic to product pages.
Or you can extract the info from pdf documents and publish it on the bottom of your product pages.
-
Yes, Google crawls PDFs but I don't think they crawl them "instead" of html. If anything, perhaps your PDFs are better optimized for the popular keywords than the pages..?
As for your question, no I have not noticed any changes in the way Google crawls PDFs. I personally made the decision a few months ago to no-crawl our PDF folder to avoid them being indexed as I saw no reason for it.
I friggin hate PDFs and if it wasn't for certain members of certain departments of a certain company using certain browsers like a certain Internet Explorer 7 and demanding certain PDFs to send to certain clients I certainly would never use them.
Whew.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How does Google treat significant content changes to web pages and how should I flag them as such?
I have several pages (~30) that I have plans to overhaul. The URLs will be identical and the theme of the content will be the same (still talking about the same widgets, using the same language) but I will be adding a lot more useful information for users, specifically including things that I think will help with my fairly high bounce rate on these pages. I believe the changes will be significant enough for Google to notice, I was wondering if it goes "this is basically a new page now, I will treat it as such and rank accordingly" or does it go "well this content was rubbish last time I checked so it is probably still not great". My second question is, is there a way I can get Google to specifically crawl a page it already knows about with fresh eyes? I know in the Search Console I can ask Google to index new pages, and I've experimented with if I can ask it to crawl a page I know Google knows (it allows me to) but I couldn't see any evidence of it doing anything with that index. Some background The reason I'm doing this is because I noticed when these pages first ranked, they did very well (almost all first / second page for the terms I wanted). After about two weeks I've noticed them sliding down. It doesn't look like the competition is getting any better so my running theory is they ranked well to begin with because they are well linked internally and the content is good/relevant and one of the main things negatively impacting me (that google couldn't know at the time) is bounce rate.
Search Behavior | | tosbourn0 -
Google Index Issue - Indexing pages that don't exhist
Hi All, I have noticed a weird issue when performing a search on Google to show me all the pages it is indexing of our site. site:www.one2create.co.uk It brings up most of our website pages but then is also brings up a few HTTPS urls (our site has not been converted to HTTPS yet) but also the URL path, Title, and Meta Description are from one of our clients websites (an Automotive Job site). When clicked they take you to a generic 404 server error page, not our branded 404 page. The site that it has taken the url, title and meta description from is on a different server completely so I don't see how it has even managed to get that information and linked it to our site? Has anyone seen anything like this before? And what is the best way to fix it? We have asked Google to re-index the site but still no luck.
Search Behavior | | Jvickery0 -
Google search operator "site:" show different result.
Search operator "site:" show incomplete information. When I search with just domain name it show only 3 link that got crawl in past week, this is the link https://www.google.com/?gfe_rd=cr&ei=mLr3VfrhN4_BuATQuYugBg&gws_rd=cr&fg=1#q=site:sierralivingconcepts.com&safe=off&tbs=qdr: but when i look a specific link it show them in any time (search tools), https://www.google.com/search?q=site:http://www.sierralivingconcepts.com/p-6300-white-silver-regence-louis-xiv-mango-wood-ornate-hall-console-table.aspx&safe=off&biw=1600&bih=775&noj=1&tbas=0&source=lnt&sa=X&ved=0CBUQpwVqFQoTCJ-4iOLK-McCFQFwjgod43gI9A But when i look in cached page it says "appeared on 11 Sep 2015" I am total confused why google not showing all the new link that it crawl from my site.
Search Behavior | | Sierra-Living-Concepts0 -
Google Panda 4.2 Is Here
Most of you guys probably already know - but Google Panda 4.2 has began. I would love to keep an open discussion regarding anyone who was affected by the last Panda update along with changes for the good/bad during this new roll out in addition to what vertical you are in. PANDA TIME! Your web therapist, Chenzo
Search Behavior | | Chenzo1 -
Google Analytics: advanced segment for hour of day
Cioa from 17 Degrees C cloudy Wetherby UK 🙂 In Google analytics I want to report specifically on Blackberry Mobile traffic next to hour if the day. Whilst this customised report I ripped off did the job @ http://bit.ly/hourdays I only resorted to this after battling with advanced segments thinking I could do the same thing. So my question is please how can I get this report http://i216.photobucket.com/albums/cc53/zymurgy_bucket/hrs-day-examplecopy_zps4f15d4a1.jpg by building it via advanced segments and not ripping off via http://bit.ly/hourdays Grazie tanto,
Search Behavior | | Nightwing
David0 -
Google slow to index new domains and subs?
Anyone finding Google slow to index new websites at the moment? Made a new site on Thursday and posted a number off high quality, relevant, backlinks to it the same day and now on Monday it is still not indexed. Have see the same with a couple of sub domains I have created off a website with a moz score of 40. Normally can get new sites indexed within hours but this seems super slow.
Search Behavior | | Grumpy_Carl0 -
Can I use Google Analytics to find out actual times of visits during the day??
Hi, I'm a newbie at all this - I hope someone can help me. We're thinking of running time-specific offers to try and convert as many of our customer site visits as possible e.g. 15% discount if you call between, say, 2 and 5pm. It would be really helpful to me to find out what times of day people are visiting our site. I can't seem to find a way to do this on Google Analytics. Can anyone help? Thanks so much Sue
Search Behavior | | 3Amigos0 -
Google Analytics Benchmarking Newsletter: How does your site perform?
With Google recently releasing benchmarking data I am curious as to what you all see across the various types of website niches that you work with (eCommerce, news, blog, services, small business, etc). And how SEO'd websites compare with this "raw" data provided by google. We have one medium size (12,000 products) strictly eCommerce website that has a bounce rate of 37% and an avg time on site of 5:20 While two other medium size eCommerce/blog sites have a bounce rate of 57% and 59% with average time on site of 2:37 and 2:30 respectively. Finally, I manage a website for a local small business that provides business and home cleaning services. This site has a bounce rate of 45% and 1:40 average time on site. How do your sites perform in these areas? Is it typical to see this great of a disparity between strict eCommerce websites and those sites that are both informational and transactional in nature? What about other kinds of websites? Cheers!
Search Behavior | | prima-2535091