PDFs and indexing
-
Hello and good morning.
I work for a paint manufacturing company in the UK on their seo campaigns across a couple of websites, this is my question. as paint and chemicals require data and tech sheets by law, available to be downloadable for said product, should these be included in the sitemap, we auto generate our sitemaps which they include these files, with low priorities and never change in terms of name etc.
they basically have a name of say 092847.pdf for example which cannot be changed, but from an seo view this doesn't mean a thing? so theres my question should they be included and would they carry any value?
-
thank you, I'm not saying they couldn't be changed it would just cause a lot of stress for our labs and tech guys who create these and work by the number. were as having a naming structure things would become a mess and everything up in the air.
I will look into the back end keywords, authors, company name which may give them some sort of impact from what I read on the link above.
-
-
Hi
Sitemaps - yes, include anything in Sitemaps that you want users to be able to find, so the more ways you can lead a Search Engine to it, the better.
Filename - it would help if you could change the filenames to include keywords, but if that's not an option then there are other things you can do to optimise each PDF.
There's a good overview of optimising PDFs here - How To Optimize PDF Documents For Search
As that post mentions, include links back to your site for maximum value, especially if these documents are shared on other websites. Also, a bit of branding within each PDF (just add a logo) could help you out in some way.
Hope that's helpful
-
Case A:
If the content of the PDF's is valuable, if it contains also some text about the product, I would make them indexable. It will make niche searchers find you.
You might want to make a separate sitemap for these PDF's, just to keep things clean.Case B:
If it's only numbers and very technical jibber jabber, I wouldn't let it index, since Google won't understand it either.Update with an interesting story:
A client of mine also had technical PDF sheets online. He has put a lot of effort in that. There were a few (4-5) competitors using direct links to the PDF's. After a while, we referred all that competitor traffic to a special landing page trying to convince why my client is a better deal. It's still online on some of the sites, since some competitors never really checked the PDF's.
Made my client very happy. -
Hey there
I can't imagine them having any SEO value, but I can't see the PDFs doing any harm either.
PDFs are crawlable and indexable by the search engines, so I would want to keep it in your sitemap for the user. I'm quite familiar with your industry (my dad worked with providing paint and chemical coatings) and I can imagine your target audience being quite specific in their searches, looking for products by code and specifications. A PDF would probably be the ideal solution for this and so having it indexed and sitting on your domain could bring in some organic traffic.
I'd make sure that the PDFs are branded if possible containing clear links back to your site, in order to funnel any long-tail traffic back to your homepage and sales pages.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Strange - Search Console page indexing "../Detected" as 404
Anyone seen this lately? All of a sudden Google Search Console is insisting in Page indexing that there is a 404 for a page that has never existed on our client's site: https://........com.au/Detected We've noticed this across a number of sites, precisely in this way with a capitalised "/Detected" To me it looks like something spammy is being submitted to the SERPs (somehow) and Google is trying to index that and then getting a 404. Naturally MOZ isn't picking it up, cause the page simply never existed - it's just happening in Search Console 2afc7e35-71e4-4e25-80a3-690bf10776a7.png It comes and it goes in the 404 alerts in Console and is really annoying. I reckon it started happening late 2022.
Reporting & Analytics | | DanielDL0 -
Shall i index double pages of my website as compared to my competitors?
a:my competitors has indexed 10 pages (checked it with site:abcd.com and found 10 results) b:what if i index 20 pages of my site and create a lot of content which is also better than my competitors who will have the edge?
Reporting & Analytics | | calvinkj0 -
Sudden Drop in Index Status on GSC
Hi all, We've seen a sudden drop in index status on GSC from 19,000 to 12,000. Rankings, referring domains, organic traffic etc. have not changed. However, we have implemented a huge number of redirects and done a site migration from http to https in the past year. Could this have an effect? Thanks!
Reporting & Analytics | | SMVSEO0 -
Curious, anyone ever had over half of their indexed links drop on an e-commerce site?
In a year went from around 300k indexed pages to around >100k according to GWT. Could this be duplicate content issue, lost links, spam, aged links or all of the above? either way an audit is in order. Thanks! Chris
Reporting & Analytics | | Sundance_Kidd0 -
How to Detect Links within PDFs
Hi All, I have a funny situation that I would like some advice on handling... There are a handful of domains that were created several years ago in support of an offline to online campaign. These domains are simply vanity domains that use an IFrame at 100% to show the content of another page. Essentially, the content of the sites I manage are embedded into the frame on the vanity URL. Since I do not monitor or have access to any analytics for the vanity URLs, is there a way to tell how others are discovering those vanity URLs? As stated above, they were used on direct mail flyers two years ago and never appeared online. However, I still get a good deal of traffic from them and cannot believe people have hung onto those flyers in such volume. I have used Open Site Explorer for the vanity URLs, which show no links existing anywhere online. I am wondering if the vanity URLs may exist in pdf lists of local businesses that match my category, etc. Is there any way to tell how traffic finds those vanity URLs without analytics or discovered links through link profiling tools?
Reporting & Analytics | | dsinger0 -
How to get crawled pages indexed?
Hi, I've got over 1k pages crawled but approx 100 pages indexed. Although, i submit them on Google Fetch and the links are indexable,they are not indexed. What shall i do the get max pages indexed? Any input highly appreciated. Thanks!
Reporting & Analytics | | Rubix0 -
Difference between site: search and Total Indexed in Google Webmaster Tools.
This morning I did a search on Google for my site using the site: operator. I noticed that the number of results returned was significantly different than the "Total indexed" in Google Webmaster Tools. What is the difference and is it normal to have two very different numbers here?
Reporting & Analytics | | Gordian0 -
Will Bing/Google's engine index a page that has only been on social media?
Will Bing's engine index and rank a page that has only been seen on social media and has no inbound links? Will Google's? Are inbound links absolutely required to get a page indexed and ranking and getting traffic? If unknown, how would you go about testing this?
Reporting & Analytics | | SarahGoliger0