Is there a way to prevent Google Alerts from picking up old press releases?
-
I have a client that wants a lot of old press releases (pdfs) added to their news page, but they don't want these to show up in Google Alerts. Is there a way for me to prevent this?
-
Thanks for the post Keri.
Yep, the OCR option would still make the image option for hiding "moo"
-
Harder, but certainly not impossible. I had Google Alerts come up on scanned PDF copies of newsletters from the 1980s and 1990s that were images.
The files recently moved and aren't showing up for the query, but I did see something else interesting. When I went to view one of the newsletters (https://docs.google.com/file/d/0B2S0WP3ixBdTVWg3RmFadF91ek0/edit?pli=1), it said "extracting text" for a few moments, then had a search box where I could search the document. On the fly, Google was doing OCR work and seemed decently accurate in the couple of tests I had done. There's a whole bunch of these newsletters at http://www.modelwarshipcombat.com/howto.shtml#hullbusters if you want to mess around with it at all.
-
Well that is how to exclude them from an alert that they setup, but I think they are talking about anyone who would setup an alert that might find the PDFs.
One other idea I had, that I think may help. If you setup the PDFs as images vs text then it would be harder for Google to "read" the PDFs and therefore not catalog them properly for the alert, but then this would have the same net effect of not having the PDFs in the index at all.
Danielle, my other question would be - why do they give a crap about Google Alerts specifically. There has been all kinds of issues with the service and if someone is really interested in finding out info on the company, there are other ways to monitor a website than Google Alerts. I used to use services that simply monitor a page (say the news release page) and lets me know when it is updated, this was often faster than Google Alerts and I would find stuff on a page before others who did only use Google Alerts. I think they are being kind of myopic about the whole approach and that blocking for Google Alerts may not help them as much as they think. Way more people simply search on Google vs using Alerts.
-
The easiest thing to do in this situation would be to add negative keywords or advanced operators to your google alert that prevent the new pages from triggering the alert. You can do this be adding advanced operators that exclude an exact match phrase, a file type, the clients domain or just a specific directory. If all the new pdf files will be in the same directory or share a common url structure you can exclude using the "inurl:-" operator.
-
That also presumes Google Alerts is anything near accurate. I've had it come up with things that have been on the web for years and for whatever reason, Google thinks they are new.
-
That was what I was thinking would have to be done... It's a little complicated on why they don't want them showing up in Alerts. They do want them showing up on the web, just not as an Alert. I'll let them know they can't have it both ways!
-
Robots.txt and exclude those files. Note that this takes them out of the web index in general so they will not show up in searches.
You need to ask your client why they are putting things on the web if they do not want them to be found. If they do not want them found, dont put them up on the web.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Snippet showed in google search is not from metaDescription
This is my page https://www.collegehippo.com/graduate-school/programs/top-ranked-masters-degree-museum-museology-and-curatorial-studies The metaDescription is | |
On-Page Optimization | | etattva
| name="description" content="Master's degree in Museum, Museology and Curatorial Studies is offered by 49 American universities. New York University had highest number of international students receiving a Master's degree. Johns Hopkins University had the most women graduates in this program. Job outlook for Museum, Museology and Curatorial Studies Museum, Museology and Curatorial Studies is projected to grow 13 percent from 2016 to 2026, faster than average for all occupations. Median pay for Museum, Museology and Curatorial Studies in 2018 was $53,360. The number of jobs were 11170. Check out best universities offering online Master's program in Museum, Museology and Curatorial Studies "/> |
| | | But when I see the page in google search results for (museum studies graduate programs), This is how it appears in the search results. It is showing the breadcrumbs from the page. I am not sure why is google is treating the page like this. It was not like that 5 months back. Nothing much changed in page and google is displaying the page content like this . How can I fix this?0 -
The correct way to rel=canonical
When adding the rel=canonical tag to a landing page inside a folder, should the tag read: or With or without the index.php? TY KJr
On-Page Optimization | | KevnJr1 -
What is the best way to rank for variations of your business name?
Our business name is APVelocity. We are ranking when it is typed APVelocity, however we are finding many people are typing it AP Velocity. When it is typed with a space, we are not ranking. What is the best way to ensure we show up either way?
On-Page Optimization | | mbrowntci2 -
Google's mobile-friendly update. How significant is the impact for us?
Hi guys. Recently I got an email from Webmaster-tools saying our site is poorly optimised for mobile devices, and that it’s going to heavily affect rankings from April 21st. I’m worried to say the least. We literary cannot afford a hit on traffic at the moment 😞 We rank well for niche terms like ‘customised diary’ and ‘personalised diary’. So question... Because we rank well for these very specific searches will we still take a hit on rankings after the update? Won’t our high relevancy for those search terms be enough to keep us high in the results? Also, do you know if this change is specific to the users device? E.g) Someone on a mobile device will get mobile-friendly results, whilst users on a laptop will get different results altogether? I'm just trying to get a sense of how much this update will effect us. Any isights, suggestion, or thoughts would be greatly appreciated. Our site. Thanks in advance. This community is invaluable to us 🙂 Isaac - TOAD Diaries.
On-Page Optimization | | isaac6630 -
Google Xml Sitemaps
Which plugin is good to use to create and submit my sitemap: sitemap from yoast or google xml sitemap plugin?
On-Page Optimization | | Sebastyan22
Which one is better? I already saw this video but I get an error when I submited it to webmaster tools and I don't know why:http://www.quicksprout.com/university/how-to-set-up-and-optimize-a-sitemap/_''Your Sitemap appears to be an HTML page. Please use a supported sitemap format instead.''_Thank you !0 -
Google's Page Layout Algorythm
It seems that Google have been or will penalizing websites with too many ads above the fold. Is it me or Google's search result layout is a perfect example of what NOT to do?
On-Page Optimization | | sbrault741 -
Does Google index dynamically generated content/headers, etc.?
To avoid dupe content, we are moving away from a model where we have 30,000 pages, each with a separate URL that looks like /prices/<product-name>/<city><state>, often with dupe content because the product overlaps from city to city, and it's hard to keep 30,000 pages unique, where sometimes the only distinction is the price & the city/state.</state></city></product-name> We are moving to a model with around 300 unique pages, where some of the info that used to be in the url will move to the page itself (headers, etc.) to cut down on dupe content on those unique 300 pages. My question is this. If we have 300 unique-content pages with unique URL's, and we then put some dynamic info (year, city, state) into the page itself, will Google index this dynamic content? The question behind this one is, how do we continue to rank for searches for that product in the city-state being searched without having that info in the URL? Any best practices we should know about?
On-Page Optimization | | editabletext0 -
How long would it take for On-Page Optimization to have an effect on Google Rankings?
Hi there, I have a page on our website with an Interview with the author Tess Gerritsen. There has been a reasonable amount of Social Media buzz related to the page and lots of links. According to SEOMoz we are an A grade for the keyword Tess Gerritsen, we currently rank 29th on Google.co.uk for a 'tess gerritsen' search. My question is - how long would it take for any new changes to have an effect? I presume the answer would be whenever the page is crawled again. But is it wise to change one thing, then get crawled and see what the effect is, then the next day change something else and see what the effect is. Or is it wise to change one thing and then leave it a week or so to see the full effect of the change? Apologies for the vague question, if you need any more clarification just let me know. Thanks. Benj
On-Page Optimization | | Benj250