Is there a way to prevent Google Alerts from picking up old press releases?
-
I have a client that wants a lot of old press releases (pdfs) added to their news page, but they don't want these to show up in Google Alerts. Is there a way for me to prevent this?
-
Thanks for the post Keri.
Yep, the OCR option would still make the image option for hiding "moo"
-
Harder, but certainly not impossible. I had Google Alerts come up on scanned PDF copies of newsletters from the 1980s and 1990s that were images.
The files recently moved and aren't showing up for the query, but I did see something else interesting. When I went to view one of the newsletters (https://docs.google.com/file/d/0B2S0WP3ixBdTVWg3RmFadF91ek0/edit?pli=1), it said "extracting text" for a few moments, then had a search box where I could search the document. On the fly, Google was doing OCR work and seemed decently accurate in the couple of tests I had done. There's a whole bunch of these newsletters at http://www.modelwarshipcombat.com/howto.shtml#hullbusters if you want to mess around with it at all.
-
Well that is how to exclude them from an alert that they setup, but I think they are talking about anyone who would setup an alert that might find the PDFs.
One other idea I had, that I think may help. If you setup the PDFs as images vs text then it would be harder for Google to "read" the PDFs and therefore not catalog them properly for the alert, but then this would have the same net effect of not having the PDFs in the index at all.
Danielle, my other question would be - why do they give a crap about Google Alerts specifically. There has been all kinds of issues with the service and if someone is really interested in finding out info on the company, there are other ways to monitor a website than Google Alerts. I used to use services that simply monitor a page (say the news release page) and lets me know when it is updated, this was often faster than Google Alerts and I would find stuff on a page before others who did only use Google Alerts. I think they are being kind of myopic about the whole approach and that blocking for Google Alerts may not help them as much as they think. Way more people simply search on Google vs using Alerts.
-
The easiest thing to do in this situation would be to add negative keywords or advanced operators to your google alert that prevent the new pages from triggering the alert. You can do this be adding advanced operators that exclude an exact match phrase, a file type, the clients domain or just a specific directory. If all the new pdf files will be in the same directory or share a common url structure you can exclude using the "inurl:-" operator.
-
That also presumes Google Alerts is anything near accurate. I've had it come up with things that have been on the web for years and for whatever reason, Google thinks they are new.
-
That was what I was thinking would have to be done... It's a little complicated on why they don't want them showing up in Alerts. They do want them showing up on the web, just not as an Alert. I'll let them know they can't have it both ways!
-
Robots.txt and exclude those files. Note that this takes them out of the web index in general so they will not show up in searches.
You need to ask your client why they are putting things on the web if they do not want them to be found. If they do not want them found, dont put them up on the web.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How do I hide comment reply's from google? Do I need to?
Reason for asking is Moz reports them as URL too long. Should these even be indexed by google if not, how do I hide them? Example URL : https://www.tansleyphotography.co.uk/farnham-castle-wedding-claire-chris/?reply-to=1876 This really doesn't need indexing, as it's just a comment on a blog post. Does it matter?
On-Page Optimization | | paultansley1 -
Where does Google get its meta descriptions from?
We have a new client and they don't have meta descriptions yet. However, Google has assigned descriptions for them now appearing on the SERPs. The problem is that Google added a phone number that's totally not the client's and goes to a different unrelated business. Our plan is to update the meta to reflect the correct information, however, we're just perplexed as to how Google came up with the incorrect phone number. Where does it get its information from? The page currently has all the correct phone number, hours, and content. I've read that Google sometimes also doesn't recognise our meta descriptions if it thinks they could serve up a better one. My next question is, what if Google insists on showing the incorrect phone number. Is there a way we can fix this? Thanks!
On-Page Optimization | | nhhernandez2 -
Google Authorship Problems
Hi, I seem to be having a few problems with getting google authors set up on Wordpress. I've set up my G+ account, put the link to my blog http://appointedd.com/blog/ and then registered it on the yoast plugin. However, I'm not sure it's set up correctly and I can't seem to be able to get it to work. I'm hoping a fine someone here has experience in this as I'm a little flustered. thanks.
On-Page Optimization | | LeahHutcheon0 -
Google Indexing
Hi, We recently launched a new version of our site on the Magento platform. I submitted a new sitemap and on the first crawl only 7 pages out of 132 were indexed...a few days later and we now have 107 indexed (phew). My question is this....how on earth do i find out which pages are indexed and more importantly not indexed? For all i know they might be really important ones so I need to be able to identify the missing pages so i can work on getting them indexed. Nic
On-Page Optimization | | nicc19760 -
Did anyone Rankings drop massively last weekend ...Is this new google update ?
Hi all, My Site rankings took a battering in the past week , From what I have read, I know that google have updated their algorithm and supposedly it only affects less than 1% of queries but I was very surprised to have fallen in that list... Just wondered, am i the only site to have been hammered this past week ? or is it more of a case that maybe the new algorithm means I need to be more savvy with our SEO. Just posting a quick general consensus question ... thanks Sarah
On-Page Optimization | | SarahCollins0 -
Anyone know how long it takes Google to Index new site?
Could anyone let me know how long it takes for a NEW site to be indexed in Google please? Am having some robots.txt issues and am keen to see if it got indexed. Thanks!
On-Page Optimization | | Wallander0 -
How to make google not index quotes from other sites?
Hey guys, I have a site where we post quite a lot of info from other sites. We don't want google to de-index our pages because parts of it are quotes from other sites. What would you use to make it so Google sees it's a quote from another site? Or to just make Google not index the quote? Thanks!
On-Page Optimization | | StefanJDorresteijn0