Is there a way to prevent Google Alerts from picking up old press releases?
-
I have a client that wants a lot of old press releases (pdfs) added to their news page, but they don't want these to show up in Google Alerts. Is there a way for me to prevent this?
-
Thanks for the post Keri.
Yep, the OCR option would still make the image option for hiding "moo"
-
Harder, but certainly not impossible. I had Google Alerts come up on scanned PDF copies of newsletters from the 1980s and 1990s that were images.
The files recently moved and aren't showing up for the query, but I did see something else interesting. When I went to view one of the newsletters (https://docs.google.com/file/d/0B2S0WP3ixBdTVWg3RmFadF91ek0/edit?pli=1), it said "extracting text" for a few moments, then had a search box where I could search the document. On the fly, Google was doing OCR work and seemed decently accurate in the couple of tests I had done. There's a whole bunch of these newsletters at http://www.modelwarshipcombat.com/howto.shtml#hullbusters if you want to mess around with it at all.
-
Well that is how to exclude them from an alert that they setup, but I think they are talking about anyone who would setup an alert that might find the PDFs.
One other idea I had, that I think may help. If you setup the PDFs as images vs text then it would be harder for Google to "read" the PDFs and therefore not catalog them properly for the alert, but then this would have the same net effect of not having the PDFs in the index at all.
Danielle, my other question would be - why do they give a crap about Google Alerts specifically. There has been all kinds of issues with the service and if someone is really interested in finding out info on the company, there are other ways to monitor a website than Google Alerts. I used to use services that simply monitor a page (say the news release page) and lets me know when it is updated, this was often faster than Google Alerts and I would find stuff on a page before others who did only use Google Alerts. I think they are being kind of myopic about the whole approach and that blocking for Google Alerts may not help them as much as they think. Way more people simply search on Google vs using Alerts.
-
The easiest thing to do in this situation would be to add negative keywords or advanced operators to your google alert that prevent the new pages from triggering the alert. You can do this be adding advanced operators that exclude an exact match phrase, a file type, the clients domain or just a specific directory. If all the new pdf files will be in the same directory or share a common url structure you can exclude using the "inurl:-" operator.
-
That also presumes Google Alerts is anything near accurate. I've had it come up with things that have been on the web for years and for whatever reason, Google thinks they are new.
-
That was what I was thinking would have to be done... It's a little complicated on why they don't want them showing up in Alerts. They do want them showing up on the web, just not as an Alert. I'll let them know they can't have it both ways!
-
Robots.txt and exclude those files. Note that this takes them out of the web index in general so they will not show up in searches.
You need to ask your client why they are putting things on the web if they do not want them to be found. If they do not want them found, dont put them up on the web.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Fixing Index Errors in the new Google Search Console - Help
Hi, So I have started using the new Search Console and for one of my clients, there are a few 'Index Coverage Errors'. In the old version you could simply, analyse, test and then mark any URLs as fixed - does anyone know if that is possible in the new version? There are options to validate errors but no 'mark as fixed' options. Do you need to validate the errors before you can fix them?
On-Page Optimization | | daniel-brooks0 -
Google Page Rank has no any rankings as of now. what to do?
my domain and page authority is working well right now but my Google Page Rank has no any rankings as we speak. What to do now? can some of you give me advice on this? Thank you very much in advance.
On-Page Optimization | | Panoramictrip0 -
How do you check if press release images are different enough?
We're helping a Sydney blog called Happy develop their local following and we're starting by ensuring their posts are optimized. They're doing a great job with reviews and content but the one thing we noticed is that all the images they use (because they review music) are from bands and artists that are used tens if not hundreds of times in other places. We're trying to set up a simple way for them to tweak these images to ensure they're crawled and seen as original. Anyone had to deal with this and found a solution that makes sense?
On-Page Optimization | | wearehappymedia0 -
What URL Should I use in Google Place Page?
Alright, I have a client that has 1 website and 14 locations. We want to create place pages for each of their locations but my question is which URL should I put in the place page and why? I can put in the root domain into each place page, or should I put in the URL that lands on the actual location on the root. example: domain.com/location1 Thanks!
On-Page Optimization | | tcseopro0 -
Why isn't Google indexing me?
Recently got handed off a .org site for a quasi state agency here in Michigan. Turns out the developer had the site live for the past six months but left the noindex, nofollow tag on everything so the site was invisible to search engines. Obviously we wiped all of those things a couple weeks ago when we got started, added all of our sitemaps to bing/yahoo/google webmaster tools and we've already started getting indexed by yahoo and bing and showing up for branded terms...but NOTHING from Google. WMT says our pages are all indexed, but we aren't showing up for anything in search and we don't seem to be indexed at all. Granted, if this site was brand new and didn't have any links I could see us taking a little time to get found, but this site has very good .gov and .edu links, plus we've built some other solid links to it since we've launched and Google continues to ignore it. I haven't seen this before, but could Google still be ignoring us from the months of noindex, nofollowing? If so, any tips on how to get back in teh Google's good graces here?
On-Page Optimization | | NetvantageMarketing0 -
Google webmaster tools data update frequency?
What is the lag time for changes to a site to be reflected in Google webmaster tools Diagnostics section? They pointed out some duplicate titles which I fixed a week ago and yet they still show up as an HTML Suggestion. What has your experience been with making changes and then seeing them reflected in the HTML Suggestions section? My site is crawled every day, including the pages I have updated with new titles. Seems like it takes a while for the data to trickle into Webmaster tools, no?
On-Page Optimization | | scanlin0 -
Google place 7 -> 40, why??
Hi, my new site http://www.ie-mac.com/ just dropped 33 places from place 7 to place 40 on goolge.com , for the two word combo: ie mac Did I screw up? How? Background Info: 1 Two weeks ago I moved my whole site from my old domain http://ie4mac.com/ to http://www.ie-mac.com/ with the goal of obtaining a good ranking for the keyword combo: ie mac. Apparantly tis worked- The site showed up on place 7. 2. I changed the design of the site and put the video on the front page. Good so far, still place 7, but: The text that google was showing was half the ALT-Tag of the Video first-slide image and the other half was our trademark disclaimer. 3. I changed the ALT tag and the disclaimer to give users a more inviting text on google. THis worked, google now shows the text as intended, but: For the desired combo: ie mac the site dropped to palce 40!! My best guesses at this point: 1. I'm using wordpress as a CMS and the all-in-one-seo-pack plugin to set custom titles etc., and the google XML sitemap plugin to buid an XML sitemap and notify google. During the couple of days, I made a lot of chnages to the site. Could be that the plugin pinged google a lot of times. Could this be part of the problem? 2. The site is hosted at http://www.ixwebhosting.com/ , because they give users dedicated IPs and a good price. However, the loadlevel on the server I'm on is always very high (10 - 20). I'm using a CDN for images and a caching plugin so the site loads in less than 2 seconds according to http://tools.pingdom.com/ . Unless the cache is empty, then it's 9 seconds. This is not great, but it's also no new, so: What could have caused the sudden drop from 7 to 40?? Thank you and kind regards
On-Page Optimization | | ie4mac0 -
Whats the best way to rank high for several different keywords?
I Have a print website www.print.dor2dor.com and we print 100's of products. I was wandering what is the best way to rank high for severall keywords as we dont want to just rank high for printing because when people are searching they normally type in the product they are looking for with printing at the end of it.
On-Page Optimization | | WillFrank0