Is there a way to prevent Google Alerts from picking up old press releases?
-
I have a client that wants a lot of old press releases (pdfs) added to their news page, but they don't want these to show up in Google Alerts. Is there a way for me to prevent this?
-
Thanks for the post Keri.
Yep, the OCR option would still make the image option for hiding "moo"
-
Harder, but certainly not impossible. I had Google Alerts come up on scanned PDF copies of newsletters from the 1980s and 1990s that were images.
The files recently moved and aren't showing up for the query, but I did see something else interesting. When I went to view one of the newsletters (https://docs.google.com/file/d/0B2S0WP3ixBdTVWg3RmFadF91ek0/edit?pli=1), it said "extracting text" for a few moments, then had a search box where I could search the document. On the fly, Google was doing OCR work and seemed decently accurate in the couple of tests I had done. There's a whole bunch of these newsletters at http://www.modelwarshipcombat.com/howto.shtml#hullbusters if you want to mess around with it at all.
-
Well that is how to exclude them from an alert that they setup, but I think they are talking about anyone who would setup an alert that might find the PDFs.
One other idea I had, that I think may help. If you setup the PDFs as images vs text then it would be harder for Google to "read" the PDFs and therefore not catalog them properly for the alert, but then this would have the same net effect of not having the PDFs in the index at all.
Danielle, my other question would be - why do they give a crap about Google Alerts specifically. There has been all kinds of issues with the service and if someone is really interested in finding out info on the company, there are other ways to monitor a website than Google Alerts. I used to use services that simply monitor a page (say the news release page) and lets me know when it is updated, this was often faster than Google Alerts and I would find stuff on a page before others who did only use Google Alerts. I think they are being kind of myopic about the whole approach and that blocking for Google Alerts may not help them as much as they think. Way more people simply search on Google vs using Alerts.
-
The easiest thing to do in this situation would be to add negative keywords or advanced operators to your google alert that prevent the new pages from triggering the alert. You can do this be adding advanced operators that exclude an exact match phrase, a file type, the clients domain or just a specific directory. If all the new pdf files will be in the same directory or share a common url structure you can exclude using the "inurl:-" operator.
-
That also presumes Google Alerts is anything near accurate. I've had it come up with things that have been on the web for years and for whatever reason, Google thinks they are new.
-
That was what I was thinking would have to be done... It's a little complicated on why they don't want them showing up in Alerts. They do want them showing up on the web, just not as an Alert. I'll let them know they can't have it both ways!
-
Robots.txt and exclude those files. Note that this takes them out of the web index in general so they will not show up in searches.
You need to ask your client why they are putting things on the web if they do not want them to be found. If they do not want them found, dont put them up on the web.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google index new data from my website page
Hi All, We have pages which are created few weeks before hand for Movie reviews in those pages we add value with adding the Movie cast and crew info and what ever info possible before the movie releases. The the movie releases we watch the movies and write reviews which is 500+ words. Now the issue is the pages are indexed a week before... How can i have these review pages scanned immediately when i have the complete review as the review content is not indexed for 3 to 5 days and the first day or 2 is when its important for the reviews to be seen in Google. Regards
On-Page Optimization | | AlexisWithers0 -
Using Google structured Data for SEO benefit
Hi there I run www.isacleanse.com.au and I've set up some Structured data using Google Webmaster Tools which says it will be picked up during the next Google update (has been set up over 4 weeks ago), however I dont seem to see any of the structured data for the products/reviews/ratings etc coming through in search results. Question at hand: Is there additional things I need to do in the code of the website or should this be sufficient? (see attached screenshot) szpFUpX
On-Page Optimization | | IsaCleanse1 -
Telling Google SERP's my correct currency.
Hi, I'm having a problem with Google SERP results showing my currency as USD, when it should be CAD. An example of a page with this problem is - http://www.absoluteautomation.ca/fgd400-sensaphone400-p/fgd400.htm - can anyone see where Google is getting USD from on there? I don't see it anywhere in the coding. Thanks in advance!
On-Page Optimization | | absoauto0 -
Is the HTML content inside an image slideshow of a website crawled by Google?
I am building a website for a client and i am in a dilemma whether to go for an image slideshow with HTML content on the slides or go for a static full size image on the homepage. My concern is that HTML content on the slideshow may not get crawled by Google and hence may not be SEO friendly.
On-Page Optimization | | aravinn0 -
Dropping Old Site After Too Many Penalties. What Do You Think About the New One?
I finally decided to drop my website after it kept losing traffic even though I spent hundreds of hours and lots of $$$ trying to recover from both Panda and Penguin. I should've started a new website a lot earlier. So here's my new website, let me know if it's worthy of Big G: http://www.webhostinghero.com/ Thank you in advance for your constructive comments!
On-Page Optimization | | sbrault740 -
Should I Remove This Subdirectory From Google?
On my site, I have a subdirectory. It posts articles from a bunch of websites that my readers are interested in & links back to all of those sites. There is no original content in it. There are over 1700 indexed pages in this subdirectory. The rest of my site has about 500 (all original content). The search engine traffic for this subdirectory only accounts for 3.9% of my sites overall visits. Should I consider removing this subdirectory? Could all the duplicate content be hurting the rankings of my legit pages? What do you all think?
On-Page Optimization | | PedroAndJobu0 -
Google SERPS showing wrong page.
I am new to SEO and trying to rank for keyword 'corporate entertainment' and my site is currently at 26. However google is showing the homepage http://www.musicliveuk.com in SERPS as opposed to my optomised page http://www.musicliveuk.com/home/corporate-entertainment. Any ideas why it is choosing so show the home page as the most relevant result?
On-Page Optimization | | SamCUK0