PDFs and indexing
-
Hello and good morning.
I work for a paint manufacturing company in the UK on their seo campaigns across a couple of websites, this is my question. as paint and chemicals require data and tech sheets by law, available to be downloadable for said product, should these be included in the sitemap, we auto generate our sitemaps which they include these files, with low priorities and never change in terms of name etc.
they basically have a name of say 092847.pdf for example which cannot be changed, but from an seo view this doesn't mean a thing? so theres my question should they be included and would they carry any value?
-
thank you, I'm not saying they couldn't be changed it would just cause a lot of stress for our labs and tech guys who create these and work by the number. were as having a naming structure things would become a mess and everything up in the air.
I will look into the back end keywords, authors, company name which may give them some sort of impact from what I read on the link above.
-
-
Hi
Sitemaps - yes, include anything in Sitemaps that you want users to be able to find, so the more ways you can lead a Search Engine to it, the better.
Filename - it would help if you could change the filenames to include keywords, but if that's not an option then there are other things you can do to optimise each PDF.
There's a good overview of optimising PDFs here - How To Optimize PDF Documents For Search
As that post mentions, include links back to your site for maximum value, especially if these documents are shared on other websites. Also, a bit of branding within each PDF (just add a logo) could help you out in some way.
Hope that's helpful
-
Case A:
If the content of the PDF's is valuable, if it contains also some text about the product, I would make them indexable. It will make niche searchers find you.
You might want to make a separate sitemap for these PDF's, just to keep things clean.Case B:
If it's only numbers and very technical jibber jabber, I wouldn't let it index, since Google won't understand it either.Update with an interesting story:
A client of mine also had technical PDF sheets online. He has put a lot of effort in that. There were a few (4-5) competitors using direct links to the PDF's. After a while, we referred all that competitor traffic to a special landing page trying to convince why my client is a better deal. It's still online on some of the sites, since some competitors never really checked the PDF's.
Made my client very happy. -
Hey there
I can't imagine them having any SEO value, but I can't see the PDFs doing any harm either.
PDFs are crawlable and indexable by the search engines, so I would want to keep it in your sitemap for the user. I'm quite familiar with your industry (my dad worked with providing paint and chemical coatings) and I can imagine your target audience being quite specific in their searches, looking for products by code and specifications. A PDF would probably be the ideal solution for this and so having it indexed and sitting on your domain could bring in some organic traffic.
I'd make sure that the PDFs are branded if possible containing clear links back to your site, in order to funnel any long-tail traffic back to your homepage and sales pages.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to Diagnose "Crawled - Currently Not Indexed" in Google Search Console
The new Google Search Console gives a ton of information about which pages were excluded and why, but one that I'm struggling with is "crawled - currently not indexed". I have some clients that have fallen into this pit and I've identified one reason why it's occurring on some of them - they have multiple websites covering the same information (local businesses) - but others I'm completely flummoxed. Does anyone have any experience figuring this one out?
Reporting & Analytics | | brettmandoes2 -
No Index Meta
Good Morning, So the company who redesigned our website forgot to take off any of the no-index stuff that was put onto the site once it went live. I removed everything in the robots.txt and the privacy settings in wordpress but I am still seeing Any suggestions on changing this or if its even necessary to change would be great! Thank you
Reporting & Analytics | | HashtagHustler0 -
Getting google impressions for a site not in the index...
Hi all Wondering if i could pick the brains of those wise than myself... my client has an https website with tons of pages indexed and all ranking well, however somehow they managed to also set their server up so that non https versions of the pages were getting indexed and thus we had the same page indexed twice in the engine but on slightly different urls (it uses a cms so all the internal links are relative too). The non https is mainly used as a dev testing environment. Upon seeing this we did a google remove request in WMT, and added noindex in the robots and that saw the index pages drop over night. See image 1. However, the site still appears to getting return for a couple of 100 searches a day! The main site gets about 25,000 impressions so it's way down but i'm puzzled as to how a site which has been blocked can appear for that many searches and if we are still liable for duplicate content issues. Any thoughts are most welcome. Sorry, I am unable to share the site name i'm afraid. Client is very strict on this. Thanks, Carl image1.png
Reporting & Analytics | | carl_daedricdigital0 -
Sudden Increase In Number of Pages Indexed By Google Webmaster When No New Pages Added
Greetings MOZ Community: On June 14th Google Webmaster tools indicated an increase in the number of indexed pages, going from 676 to 851 pages. New pages had been added to the domain in the previous month. The number of pages blocked by robots increased at that time from 332 (June 1st) to 551 June 22nd), yet the number of indexed pages still increased to 851. The following changes occurred between June 5th and June 15th: -A new redesigned version of the site was launched on June 4th, with some links to social media and blog removed on some pages, but with no new URLs added. The design platform was and is Wordpress. -Google GTM code was added to the site. -An exception was made by our hosting company to ModSecurity on our server (for i-frames) to allow GTM to function. In the last ten days my web traffic has decline about 15%, however the quality of traffic has declined enormously and the number of new inquiries we get is off by around 65%. Click through rates have declined from about 2.55 pages to about 2 pages. Obviously this is not a good situation. My SEO provider, a reputable firm endorsed by MOZ, believes the extra 175 pages indexed by Google, pages that do not offer much content, may be causing the ranking decline. My developer is examining the issue. They think there may be some tie in with the installation of GTM. They are noticing an additional issue, the sites Contact Us form will not work if the GTM script is enabled. They find it curious that both issues occurred around the same time. Our domain is www.nyc-officespace-leader. Does anyone have any idea why these extra pages are appearing and how they can be removed? Anyone have experience with GTM causing issues with this? Thanks everyone!!!
Reporting & Analytics | | Kingalan1
Alan0 -
Switch to www from non www preference negatively hit # pages indexed
I have a client whose site did not use the www preference but rather the non www form of the url. We were having trouble seeing some high quality inlinks and I wondered if the redirect to the non www site from the links was making it hard for us to track. After some reading, it seemed we should be using the www version for better SEO anyway so I made a change on Monday but had a major hit to the number of pages being indexed by Thursday. Freaking me out mildly. What are people's thoughts? I think I should roll back the www change asap - or am I jumping the gun?
Reporting & Analytics | | BrigitteMN0 -
Not many pages being indexed on google
Hi I am putting in to Google: site:www.mysite.com to see the pages listed on Google - the figure Google is coming back with is much lower than the actual pages, I have no crawer warning etc... What could the problem be? Thanks
Reporting & Analytics | | acumenadagency0 -
Google Webmaster says "0" pages indexed
Built my first Wordpress site. It launched a few months ago. Google has crawled 76 pages so far. But why are 0 indexed?
Reporting & Analytics | | cschwartzel0 -
Correlation between google and yahoo indexed pages
My blog ocpatentlawyer.com has about 130 pages or so. Google has indexed most if not all of the posts and pages. In contrast, yahoo has only indexed about 1/4 of the pages and posts. Are there any actions that can be taken based on this information? For example, if i prepare a blog post should I prepare it so that it will most likely be indexed into yahoo knowing that google will also index it. If so, how can i prepare blog posts that will most likely be indexed into yahoo's index?
Reporting & Analytics | | jamesjd70