How much content does Google Crawl on your site?
-
Hi,
We've had a debate around the office where some people believe that Google only crawls the first 150-200 words on a page and some people believe that they priority content that is above the fold and other people believe that all content has the same priority. Can you help us?
Thanks,
Matt -
Google actually crawls 150kb, excluding css files, images, etc.
150kb is much more than 200 words, and the experiment suggested by Mr Bennett proves it.
-
They definitely crawl more than that, and it's easy to prove as well.
Pick a long page, such as the Wikipedia page about London. Choose a block of text from near the bottom of that page, I've selected this:
in the south-western suburb of Wimbledon.[252] Other key events are the annual mass-participation London Marathon which sees some 35,000 runners
If you search for that text you will see the Wikipedia page in the results. If they only crawled the first 200 words they wouldn't have been able to find that result.
Prioritising is harder to demonstrate (and probably also to define!). However it is generally believe that greater importance is given to text towards the top of the page. That is logical if you consider how the majority of documents are structured.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Fetch as Google is showing this, help!
Our Fetch as Google in Google Webmaster Tools is showing this. What is this?? Thanks! https://imgur.com/k6KOQZz
On-Page Optimization | | bluejay78780 -
Dissapeared from Google - Urgent
We are new to the whole SEO thing and could be described as total numpties as we have hidden away from it, scared of what it means and what might happen if we open that can of worms. But after seeing a 25% drop in visitors over a year, and a continued fall, we thought we better try and get our heads around what we need to do to improve our chances on google. Consequently we have signed up to MOZ and are exploring its crawl results and trying to learn and action things. Today we noticed that we have dropped fairly completely from Google for many of our top key words. We are based in Italy,our site is in Italian and we only target Italy. Our homepage is http://www.shoechic,it and two of our top keywords that have dissapeared are: "scarpe sexy" & "Scarpe Pleaser" We would love to hear if anyone can throw some light in why this dramatic change may have taken place and what we can do to address it. Many thanks in advance for anyones help and advice. Philip
On-Page Optimization | | shoechic0 -
I've just manually edited all the page titles and meta descriptions on a site, when will this show in Google results?
I've just manually edited all of the page titles, meta descriptions and optimised the copy on a client's site. I submitted this for a new crawl on Google via Webmaster Tools but when I do a Google search the old versions are still showing. Will it still take a few weeks for the new versions to show even though Google has crawled it via Webmaster?
On-Page Optimization | | aoifep0 -
tagged as duplicate content?
Hello folks, I'm new to SEOmoz . I was looking at our Crawl Diagnostics and found that some of our blog posts that have been commented on were tagged as duplicate content. For example: http://thankyouregistry.com/blog/remarriages-and-gift-registries/ http://thankyouregistry.com/blog/remarriages-and-gift-registries/comment-page-1/ I'm unsure how to fix these, so any ideas would be appreciated. Thanks a lot!
On-Page Optimization | | GiftReg0 -
Checking for content duplication against content on your own site.
We are currently trying to rewrite our product descriptions and I'm afraid some of the salespeople that are writing the descriptions are plagiarizing one-another's writing. Is there a content duplication checker that will allow you to check a piece of writing against a specific site rather than all of the web?
On-Page Optimization | | MichealGooden0 -
Duplicate content? Not sure.
Good news! I have my first real SEO gig and now I have to be able to actually deliver. I'm up for it but I want to be sure I'm seeing what I think I am before suggesting any changes. I'm working my way throught Danny Dover's excellent book SEO Secrets and learning tons! To see if there is duplicate content on the site, I've taken a sentence from one of the pages on the site and searched for it: i.e., site:storybooksforhealing.com "Some of the most quiet moments are often the most difficult after a loss. Mornings, late nights, time alone." The SERPs show 7 pages that have this text on it. It seems like this is duplicate content, right? This is a Wordpress website so what's happening is the actual page is here: www.storybooksforhealing.com/publish-cup-of-joy/ but there are several archive pages that show excerpts of this text, too. If this is duplicate content (first question) then how would I go about remedying it? Should I set the canonical reference to /publish-cup-of-joy page? Thank you for being patient with my NOOB questions.
On-Page Optimization | | ChristiMc0 -
Optimisation for Google Images
What techniques do you use for the optimisation of images on Google. Alt tag , image title Surrounding text Anyone tested actual linking to the image url and not the page url. I have achieved hundreds of top listed images but when it gets competitive what is the most useful technique you have used. Thanks
On-Page Optimization | | onlinemediadirect0 -
Does Google still see masked domains as duplicate content?
Older reads state the domain forwarding or masking will create duplicate content but Google has evolved quite a bit and I'm wondering if that is still the case? Not suggesting that a 301 is not the proper way to redirect something but my question is: Does Google still see masked domains as duplicate content? Is there any viable use for domain masking other than for affiliates?
On-Page Optimization | | TracyWeb0