Blocking Google from telemetry requests
-
At Magnet.me we track the items people are viewing in order to optimize our recommendations. As such we fire POST requests back to our backends every few seconds when enough user initiated actions have happened (think about scrolling for example). In order to eliminate bots from distorting statistics we ignore their values serverside.
Based on some internal logging, we see that Googlebot is also performing these POST requests in its javascript crawling. In a 7 day period, that amounts to around 800k POST requests. As we are ignoring that data anyhow, and it is quite a number, we considered reducing this for bots.
Though, we had several questions about this:
1. Do these requests count towards crawl budgets?
2. If they do, and we'd want to prevent this from happening: what would be the preferred option? Either preventing the request in the frontend code, or blocking the request using a robots.txt line?The latter question is given by the fact that a in-app block for the request could lead to different behaviour for users and bots, and may be Google could penalize that as cloaking. The latter is slightly less convenient from a development perspective, as all logic is spread throughout the application.
I'm aware one should not cloak, or makes pages appear differently to search engine crawlers. However these requests do not change anything in the pages behaviour, and purely send some anonymous data so we can improve future recommendations.
-
Hi Rogier,
- Yes, this is usually counting towards crawl budgets as Googlebot is doing this per request.
- It depends on how your request is being set up obviously, otherwise, I would advise going with the exclusion for the robots.txt that you're already heading towards.
Hope this helps!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Search console says 'sitemap is blocked by robots?
Google Search console is telling me "Sitemap contains URLs which are blocked by robots.txt." I don't understand why my sitemap is being blocked? My robots.txt look like this: User-Agent: *
Technical SEO | | Extima-Christian
Disallow: Sitemap: http://www.website.com/sitemap_index.xml It's a WordPress site, with Yoast SEO installed. Is anyone else having this issue with Google Search console? Does anyone know how I can fix this issue?1 -
Google how deal with licensed content when this placed on vendor & client's website too. Will Google penalize the client's site for this ?
One of my client bought licensed content from top vendor of Health Industry. This same content is on the vendor's website & my client's site also but on my site there is a link back to vendor is placed which clearly tells to anyone that this is a licensed content & we bought from this vendor. My client bought paid top quality content for best source of industry but at this same this is placed on vendor's website also. Will Google penalize my client's website for this ? Niche is HEALTH
Technical SEO | | sourabhrana1 -
Does Google Parse The Anchor Text while Indexing
Hey moz fanz, I'm here to ask a bit technical and open-minding question.
Technical SEO | | atakala
In the Google's paper http://infolab.stanford.edu/~backrub/google.html
They say they parse the page into hits which is basically word occurences.
But I want to know that they also do the same thing while keeping the anchor text database.
I mean do they parse the anchor text or keep it as it is .
For example, let's say my anchor text is "real car games".
When they indexing my link with anchor text, do they parse my anchor text as hits like
"real" distinct hits
"car" distinct hits
"games" distinct hits.
OR do they just use it as it is. As "real car games"0 -
Why google indexed pages are decreasing?
Hi, my website had around 400 pages indexed but from February, i noticed a huge decrease in indexed numbers and it is continually decreasing. can anyone help me to find out the reason. where i can get solution for that? will it effect my web page ranking ?
Technical SEO | | SierraPCB0 -
Google Structured Data Problem
Hello everyone, About 1-2 weeks ago, I have implemented rich snippets (microdata) for the product pages of my e-commerce site. However, in the web masters tools, google is saying that the crawlers did not detect any structured data in my site. I have also checked my pages using Structured Data Testing Tool. You can see an example test result in the following address. http://www.google.com/webmasters/tools/richsnippets?q=http%3A%2F%2Fwww.tarzimon.com%2Fproduct%2Fnaif-tasarim-torr-aydinlatma-1031 What may cause this problem? Thank you for your help
Technical SEO | | hknkynr0 -
Google Local Gone Loco
I am a bankruptcy attorney in Southern California. I have been doing my own SEO since I had a couple of bad experiences paying someone to "do" it in the past. If you want it done right, do it yourself I suppose. Anyway, I have been ranking well in Google local results. At first I peeked in at 3/3 showing on the first page of the searches. Then I climbed to Number 2 in local searches, probably as a result of finding sites and making sure my addresses, phone numbers and business names were all correct. However, this week (as I climbed to #3 spot in the local search for my city+ bankruptcy attorney, my Google local result dropped to page 2. One of my employees rated me on google local and gave me a google + which is gone and the pictures that I uploaded to Local Google are gone. I don't know if this is some kind of penalty because an employee gave me a rating (they were completely up front about working for me) or if something else is going on. I was also trying to claim my business on Yahoo (which resulted in some kind of "Account Suspension"). I have no idea what is going on. You can take a look at my site if it helps: http://ashcraftfirm.com We are trying to rank for "murrieta bankruptcy attorney" Thanks for any help you can provide.
Technical SEO | | gcashcraft0 -
Ads above the fold penalty. Should I request reinclusion?
HI! My site has been losing traffic slowly for about 18 months. But it was in January 19 that was hit big time. My site has a lot of ads, including two 300x250 above the fold ads that were very lucrative for me. After January 19, I decided to remove only one ad of those two, but no change was reflected in the traffic. It is obvious that I needed to remove the other ad, but I didn't do it for two reasons. I still earn money from that ad and removing it would result in serious problems. A webmaster friend of mine that was hit too by this penalty, removed the ads and tried all sort of stuff to regain the lost traffic with NO LUCK in several months. He has unique and excellent content. So, after seeing his experience I didn't want to touch my biggest source of income and leave it as it is. My site has other problems that concerns Panda and maybe Penguin, and since yesterday I've been starting to fix them. Is it a good idea to request a reinclusion to check if I was manually penalized, without being previously notified by GWMT of any problem in my site? Thanks in advance, Enrique
Technical SEO | | enriquef0