Is there a way to prevent Google Alerts from picking up old press releases?
-
I have a client that wants a lot of old press releases (pdfs) added to their news page, but they don't want these to show up in Google Alerts. Is there a way for me to prevent this?
-
Thanks for the post Keri.
Yep, the OCR option would still make the image option for hiding "moo"
-
Harder, but certainly not impossible. I had Google Alerts come up on scanned PDF copies of newsletters from the 1980s and 1990s that were images.
The files recently moved and aren't showing up for the query, but I did see something else interesting. When I went to view one of the newsletters (https://docs.google.com/file/d/0B2S0WP3ixBdTVWg3RmFadF91ek0/edit?pli=1), it said "extracting text" for a few moments, then had a search box where I could search the document. On the fly, Google was doing OCR work and seemed decently accurate in the couple of tests I had done. There's a whole bunch of these newsletters at http://www.modelwarshipcombat.com/howto.shtml#hullbusters if you want to mess around with it at all.
-
Well that is how to exclude them from an alert that they setup, but I think they are talking about anyone who would setup an alert that might find the PDFs.
One other idea I had, that I think may help. If you setup the PDFs as images vs text then it would be harder for Google to "read" the PDFs and therefore not catalog them properly for the alert, but then this would have the same net effect of not having the PDFs in the index at all.
Danielle, my other question would be - why do they give a crap about Google Alerts specifically. There has been all kinds of issues with the service and if someone is really interested in finding out info on the company, there are other ways to monitor a website than Google Alerts. I used to use services that simply monitor a page (say the news release page) and lets me know when it is updated, this was often faster than Google Alerts and I would find stuff on a page before others who did only use Google Alerts. I think they are being kind of myopic about the whole approach and that blocking for Google Alerts may not help them as much as they think. Way more people simply search on Google vs using Alerts.
-
The easiest thing to do in this situation would be to add negative keywords or advanced operators to your google alert that prevent the new pages from triggering the alert. You can do this be adding advanced operators that exclude an exact match phrase, a file type, the clients domain or just a specific directory. If all the new pdf files will be in the same directory or share a common url structure you can exclude using the "inurl:-" operator.
-
That also presumes Google Alerts is anything near accurate. I've had it come up with things that have been on the web for years and for whatever reason, Google thinks they are new.
-
That was what I was thinking would have to be done... It's a little complicated on why they don't want them showing up in Alerts. They do want them showing up on the web, just not as an Alert. I'll let them know they can't have it both ways!
-
Robots.txt and exclude those files. Note that this takes them out of the web index in general so they will not show up in searches.
You need to ask your client why they are putting things on the web if they do not want them to be found. If they do not want them found, dont put them up on the web.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Metadescription not being pulled by Google? Yoast v SmartCrawl?
Hey guys, For whatever reason, Google isn't pulling the metadescriptions I've provided for a wordpress site I'm working on. We had both Yoast and SmartCrawl installed, so I thought maybe they were confusing Google and deactivated Yoast. Unfortunately, that didn't fix the issue. Instead of using the text I've plugged into SmartCrawl, Google is just using snippets from the blog posts... And it's happening for every single post, leading to a huge uptick in metadata issues in moz. Any idea how to fix this?? Thank you!
On-Page Optimization | | laurendavidson0 -
Update old article or publish new content and redirect old post?
Hi all, I'm targetting a keyword and we used to rank quite good for it. Last couple of months traffic of that keyword (and variations) is going down a bit. I wrote an extensive new post on the same topic, much more in dept and from 600 to 1800 words covering the same topic. Is it better to update the old article and mention that it's updated recently, or publish a new post and redirect the old post to the new post?
On-Page Optimization | | jorisbrabants0 -
Does Google Analytics' Enhanced Link Attribution cause any SEO problems?
We are looking to implement Google Analytics Enhanced Link Attribution on our site. Our tech person says that this will cause SEO problems because of "duplicate URLS." I am not technical, so I don't understand this at all and can't find any research on the topic. I would like to know if there are any known SEO problems caused by putting in Enhanced Link Attribution.
On-Page Optimization | | DGM0 -
How does Google treat Dynamic Titles?
Let's say my website can be accessed in only 3 states Colorado, Arizona and Ohio. I want to display different information to each visitor based on where they are located. For this I would also like the title to change based on their location. Not quite sure how Google we treat the title and rank the site.... Any resources you can provide would be helpful. Thanks
On-Page Optimization | | Firestarter-SEO0 -
Why Isn't Google Authorship Showing My Picture?
I have several clients and the Google Authorship images used display in the search results for all of them. About a month ago all of the images disappeared, however it still displays "by <name>, indicating that Google Authorship is working -- it just doesn't show the image (see screenshots). The image follows the guidelines, and we've got the rel author tag in place, with a link back to Google. </name> When I use the Google Structured Data Testing Tool it shows that authorship is properly functioning. I'm completely stumped. Does anyone have any ideas why this may not be working? Here's two examples of the sites with Authorship not working properly (screenshots below): criminalattorneylongislandny.com
On-Page Optimization | | socialfirestarter
https://dl.dropboxusercontent.com/u/3786946/Screen Shot 2014-01-03 at 12.53.10 PM.png
https://dl.dropboxusercontent.com/u/3786946/Screen Shot 2014-01-03 at 12.44.12 PM.png attorneytonyadderley.com https://dl.dropboxusercontent.com/u/3786946/Screen Shot 2014-01-03 at 12.52.36 PM.png
https://dl.dropboxusercontent.com/u/3786946/Screen Shot 2014-01-03 at 12.52.52 PM.png0 -
404 errors in wordpress... Pages have never existed so why is google trying to crawl them?
I've just logged into webmaster tools and have over 100 404 errors. I'm running wordpress and I recently added child pages to 2 of my categories like so. www.mydomain.com/category1/lincolnshire www.mydomain.com/category1/cambridgeshire etc... The 404 errors though are for pages or categories I've never created though. I have over 20 root categories but decided to test adding child pages to only two of them. The 404 errors are for www.mydomain.com/category5/cambridgeshire .... It seem that gogle has tried to crawl these pages that don't exist. Can anyone explain what's going on? When I click 'linked from' in webmaster tools it's showing links from pages on my site that don't exist also.
On-Page Optimization | | SamCUK0 -
Google's view on geolocated results
Hello everyone, I am working on a project so the website is not online at the moment. My question is about Google's view on geolocated results : on the mainpage of the website, a bloc will be displaying local classifieds according to where the visitor is located. What will be Google's view on this bloc as it has no location ? A white empty bloc ? Bonus question : do you have any experience regarding this kind of situation ? How do you best deal with it in your opinion ? Thanks for your help ! Best Regards, Raphael
On-Page Optimization | | Pureshore0 -
Does Google give weight or importance to scholarly articles such as those found in pubmed?
Does Google give weight or importance to scholarly articles such as those found in pubmed? www.ncbi.nlm.nih.gov/pubmed Do you think it matters to Google if you format and word your contents so that they look like research articles?
On-Page Optimization | | monchconch0