Does Google have a separate crawler for Javascript and Content?
-
Someone told me this is true.
-
Much similar to many of the rules and guidelines by which Google calculates search rankings, they don't openly provide that type of information to the general public, because obviously their users would take advantage of it and manipulate the crawlers for the purpose of altering their rankings.
If you wanted to know something like that for sure, you would probably need to conduct field research doing something extreme like having 100% javascript content in one segment of your site and 100% HTML on another and track which IPs and user agents hit the pages.
However an educated guess of mine believes this:
- The bots that crawl HTML also crawl Javascript. To make separate bots to do individual tasks would be stupid.
- there would be absolutely no benefit nor would the sanitation of the data the crawlers obtain be increased using seperate bots. Because it can be clearly concluded the difference between html and Javascript, and at an automated level as well.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google not Indexing images on CDN.
My URL is: http://bit.ly/1H2TArH We have set up a CDN on our own domain: http://bit.ly/292GkZC We have an image sitemap: http://bit.ly/29ca5s3 The image sitemap uses the CDN URLs. We verified the CDN subdomain in GWT. The robots.txt does not restrict any of the photos: http://bit.ly/29eNSXv. We used to have a disallow to /thumb/ which had a 301 redirect to our CDN but we removed both the disallow in the robots.txt as well as the 301. Yet, GWT still reports none of our images on the CDN are indexed. The above screenshot is from the GWT of our main domain.The GWT from the CDN subdomain just shows 0. We did not submit a sitemap to the verified subdomain property because we already have a sitemap submitted to the property on the main domain name. While making a search of images indexed from our CDN, nothing comes up: http://bit.ly/293ZbC1While checking the GWT of the CDN subdomain, I have been getting crawling errors, mainly 500 level errors. Not that many in comparison to the number of images and traffic that we get on our website. Google is crawling, but it seems like it just doesn't index the pictures!? Can anyone help? I have followed all the information that I was able to find on the web but yet, our images on the CDN still can't seem to get indexed.
Intermediate & Advanced SEO | | alphonseha0 -
Removing duplicate content
Due to URL changes and parameters on our ecommerce sites, we have a massive amount of duplicate pages indexed by google, sometimes up to 5 duplicate pages with different URLs. 1. We've instituted canonical tags site wide. 2. We are using the parameters function in Webmaster Tools. 3. We are using 301 redirects on all of the obsolete URLs 4. I have had many of the pages fetched so that Google can see and index the 301s and canonicals. 5. I created HTML sitemaps with the duplicate URLs, and had Google fetch and index the sitemap so that the dupes would get crawled and deindexed. None of these seems to be terribly effective. Google is indexing pages with parameters in spite of the parameter (clicksource) being called out in GWT. Pages with obsolete URLs are indexed in spite of them having 301 redirects. Google also appears to be ignoring many of our canonical tags as well, despite the pages being identical. Any ideas on how to clean up the mess?
Intermediate & Advanced SEO | | AMHC0 -
Google snippet chosen why?
We have a page about buying property in the Megeve area of the Alps in France. We are No.2 on Google.co.uk for the term "megeve property for sale" and No.1 for "megeve property". http://www.prestigeproperty.co.uk/MegeveProperty/Properties.asp If you search for "megeve property for sale", Google serves our META description as the snippet: Ski chalets, homes and apartments for sale in this exclusive, prestigious Rhone Alpes village - 520000-16500000 EUR. However, we noticed that searching for just "megeve property" serves up a much better snippet taken from the text on the page: A crucial factor for potential property buyers is that there is a strong rental market in Megève and this remains high all year around with properties close to the ... Does anyone know why Google would serve this particular snippet instead of the META description. Is it the number of strong and descriptive words used, or some other reason?
Intermediate & Advanced SEO | | PPGUKLTD0 -
Google Authorship: Having others write content and authorship link to/from G+ profiles Impact Ranking?
Hi all! I am considering having several others write content for a new website and authorship link each to/from G+ profiles. Any idea of how that will Impact page/website ranking? I would think it would give more credibility to each page, and the website as a whole. No?
Intermediate & Advanced SEO | | BBuck0 -
Is this ok for content on our site?
We run a printing company and as an example the grey box (at the bottom of the page) is what we have on each page http://www.discountbannerprinting.co.uk/banners/vinyl-pvc-banners.html We used to use this but tried to get most of the content on the page, but we now want to add a bit more in-depth information to each page. The question i have is - would a 1200 word document be ok in there and not look bad to Google.
Intermediate & Advanced SEO | | BobAnderson0 -
Missing Suite Number on Google
I realized that we are missing a suite number. It is not on the website or the recently updated Google/Bing/Yahoo revisions I did. Should I go and fix? Or should I go and adjust old listings. Does a suite number matter in the NAP?
Intermediate & Advanced SEO | | greenhornet770 -
Google Local oddity
So I spotted something a little weird... one of my client's Google Local placements in blended results has the domain name - complete with the .com extension appearing where the business name typically appears: Businessxyz.com www. businessxyz .com of Google reviews Has anyone seen this? I setup their Google Places account quite some time ago and used the business name - not the url. I also setup their Google+ and Local page - using the name. None of the page titles on the website contain the url. I simply can not pinpoint where G is pulling this from or why for that matter. All competitors are appearing with business name - only my client has the domain name visible for the particular local search query. Any ideas?
Intermediate & Advanced SEO | | SCW0 -
Ajax Content Indexed
I used the following guide to implement the endless scroll https://developers.google.com/webmasters/ajax-crawling/docs/getting-started crawlers and correctly reads all URLs the command "site:" show me all indexed Url with #!key=value I want it to be indexed only the first URL, for the other Urls I would be scanned but not indexed like if there were the robots meta tag "noindex, follow" how I can do?
Intermediate & Advanced SEO | | wwmind1